Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Arrow has added REE support apache/arrow#14176, similar to dictionary arrays that allow repeated values to be encoded in a space efficient manner that also allows fast processing.
Describe the solution you'd like
Implement REE in arrow-rs. Some likely candidate:
Remaining tasks:
Additional context
Among other things, @brancz is working to improve aggregation performance in DataFusion using Runarrays, see
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Arrow has added REE support apache/arrow#14176, similar to dictionary arrays that allow repeated values to be encoded in a space efficient manner that also allows fast processing.
Describe the solution you'd like
Implement REE in arrow-rs. Some likely candidate:
Remaining tasks:
RunArrays #7487build_extendandbuild_extend_nulls#7671PartialEqfor RunArray #7691RunArraydocumentation #9019RunArrays #9018RunEndEncodedarrays inarrow-json#9359RunEndEncodedin ord comparator #9360Additional context
Among other things, @brancz is working to improve aggregation performance in DataFusion using Runarrays, see
RunArrays datafusion#16011