Skip to content

[Parquet] Optimize struct reading #9216

@jhorstmann

Description

@jhorstmann

Is your feature request related to a problem or challenge? Please describe what you are trying to do.

The benchmarks added in #9209 show a significant overhead when reading data nested in structs from parquet. Profiling shows a simple performance hotspot in appending individual bits to a BooleanBufferBuilder.

Describe the solution you'd like

We can use the new extend_trusted_len method introduced in #9137 to optimize this pattern.

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementAny new improvement worthy of a entry in the changelog

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions