Skip to content

Invitation to provide feedback on New Parquet Encoding: Adaptive Lossless Floating Point Compression #26279

@alamb

Description

@alamb

Description

👋 from the Parquet community (again)

The Parquet community is working on a proposal to implement ALP: Adaptive Lossless floating-Point Compression, (SIGMOD 2024) as a new Parquet encoding

Given that Polars is a major player in the open source Parquet world, and may feel pressure to implement newly added encodings, I think having your feedback on the spec as a stakeholder would be immensely valuable.

The proposed addition is here : Parquet ALP Encoding Specification (Google Doc)

There is also a prototype C++ implementation for review here

See also: mailing list discussion (link) and parquet-format PR

Thank you for your consideration

cc @ritchie46 and @orlp as I believe you have commented about Parquet things in the past

Metadata

Metadata

Assignees

No one assigned

    Labels

    A-io-parquetArea: reading/writing Parquet filesenhancementNew feature or an improvement of an existing feature

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions