We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested name spaces. Parquet is built to support very efficient compression and encoding schemes.| Apache Parquet
Github Release Link. The latest version of parquet-format is 2.8.0. To check the validity of this release, use its: Release manager OpenPGP key OpenPGP signature SHA-512 Older Releases Older releases can be found in the Archives of the Apache Software Foundation: https://archive.apache.org/dist/parquet/| Parquet – Parquet
Github Release Link. The latest version of parquet-format is 2.7.0. To check the validity of this release, use its: Release manager OpenPGP key OpenPGP signature SHA-512 Older Releases Older releases can be found in the Archives of the Apache Software Foundation: https://archive.apache.org/dist/parquet/| Apache Parquet