Parquet

From P-20W+ Community of Innovation Wiki
Jump to navigation Jump to search

Overview[edit | edit source]

Parquet is a language-independent, open-source file format designed with a columnar structure to optimize querying and aggregation of specific fields. The columnar design also facilitates more efficient file compression, which allows for ease of storage for large volumes of data. Parquet files also include self-describing metadata, embedding the format and meaning within the data itself.