Skip to content

Conversation

@linliu-code
Copy link
Collaborator

@linliu-code linliu-code commented Feb 7, 2026

Describe the issue this Pull Request addresses

There's a change in behavior for for SparkHoodieTableFileIndex since 0.14.1. The StructType(partitionFields) returned doesn't have the full path and causing data validation failures. This behavior was changed as part of this PR https://github.com/apache/hudi/pull/9863/changes

Summary and Changelog

If there's a table with a nested partition column whose leaf name conflicts with another top level field the partitionedSchema passed to the new file group reader is incorrect. The fix is to return the partition field with the full path name instead of the inner field name.

Impact

Medium

Risk Level

Low.

Documentation Update

Contributor's checklist

  • Read through contributor's guide
  • Enough context is provided in the sections above
  • Adequate tests were added if applicable

@github-actions github-actions bot added the size:M PR with lines of changes in (100, 300] label Feb 7, 2026
@linliu-code linliu-code force-pushed the nested_partitioning branch 3 times, most recently from d6f9ca7 to 413fa60 Compare February 8, 2026 01:00
@linliu-code linliu-code changed the title fix: Reproduce nested partition columns pruning data validation failure fix: Support data pruning using nested partition columns Feb 8, 2026
@linliu-code linliu-code marked this pull request as ready for review February 8, 2026 05:50
@linliu-code linliu-code requested a review from yihua February 8, 2026 05:54
@nsivabalan
Copy link
Contributor

@hudi-bot run azure

@linliu-code
Copy link
Collaborator Author

@hudi-bot run azure

The command seems not working. Let me push it again to trigger the Azure test.

@apache apache deleted a comment from hudi-bot Feb 10, 2026
@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:M PR with lines of changes in (100, 300]

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants