Skip to content

[GLUTEN-11550][VL][UT] Enable GlutenVariantEndToEndSuite and GlutenVariantShreddingSuite#11726

Open
baibaichen wants to merge 1 commit intoapache:mainfrom
baibaichen:fix/GlutenVariantEndToEndSuite-GlutenVariantShreddingSuite
Open

[GLUTEN-11550][VL][UT] Enable GlutenVariantEndToEndSuite and GlutenVariantShreddingSuite#11726
baibaichen wants to merge 1 commit intoapache:mainfrom
baibaichen:fix/GlutenVariantEndToEndSuite-GlutenVariantShreddingSuite

Conversation

@baibaichen
Copy link
Contributor

@baibaichen baibaichen commented Mar 9, 2026

What changes are proposed in this pull request?

Detect variant shredded structs (produced by Spark's PushVariantIntoScan optimization) in VeloxValidatorApi.validateSchema() by checking for __VARIANT_METADATA_KEY metadata on struct fields. When detected, fall back to Spark's native reader since Velox cannot read variant shredding encoding in Parquet files.

Changes:

  • VeloxValidatorApi.scala: Added variant shredded struct detection in validateSchema(). When all fields of a struct have __VARIANT_METADATA_KEY metadata, the schema validation fails and triggers fallback to Spark's native execution.
  • VeloxTestSettings.scala (spark40 & spark41): Enabled GlutenVariantEndToEndSuite and GlutenVariantShreddingSuite.

How was this patch tested?

  • GlutenVariantEndToEndSuite: 14 tests passed on both spark40 and spark41
  • GlutenVariantShreddingSuite: 8 tests passed on both spark40 and spark41

Was this patch authored or co-authored using generative AI tooling?

Generated-by: GitHub Copilot CLI

Related issue: #11550

…riantShreddingSuite

Detect variant shredded structs (produced by Spark's PushVariantIntoScan)
in VeloxValidatorApi.validateSchema() by checking for __VARIANT_METADATA_KEY
metadata on struct fields. When detected, fall back to Spark's native reader
since Velox cannot read variant shredding encoding in Parquet files.

Enable both suites in VeloxTestSettings for spark40 and spark41.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@baibaichen baibaichen force-pushed the fix/GlutenVariantEndToEndSuite-GlutenVariantShreddingSuite branch from d508a12 to 8f1d876 Compare March 9, 2026 14:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant