What is Compaction

« Back to Glossary Index

Merging micro-files in Delta tables to improve Spark job performance.

Synonyms:
File consolidation, Small file optimization
Defnition:
Compaction merges small files in a data lake to improve read performance and reduce metadata overhead.

Variations:
Delta compaction process

Hello popup window