[GLUTEN-10648][VL] Support Iceberg overwrite partitions dynamic by Zouxxyy · Pull Request #10823 · apache/gluten

Zouxxyy · 2025-09-30T09:54:15Z

What changes are proposed in this pull request?

Support Iceberg overwrite partitions dynamic

How was this patch tested?

github-actions · 2025-09-30T09:54:28Z

#10648

github-actions · 2025-09-30T09:54:45Z

Run Gluten Clickhouse CI on x86

jinchengchenghh · 2025-09-30T14:38:38Z

It is public, username gluten password hN2xX3uQ4m

jinchengchenghh · 2025-09-30T14:42:24Z

This is an unstable test, I will create a PR to fix it.

[2025-09-30T11:51:08.252Z] - Gluten - SPARK-35650: Coalesce number of partitions by AEQ *** FAILED ***
[2025-09-30T11:51:08.252Z]   2 did not equal 1 (ClickHouseAdaptiveQueryExecSuite.scala:84)

jinchengchenghh · 2025-09-30T14:44:58Z

The line number is different with source ClickHouseAdaptiveQueryExecSuite.scala:84

jinchengchenghh · 2025-09-30T14:53:10Z

backends-velox/src/main/scala/org/apache/gluten/backendsapi/velox/VeloxBackend.scala

  override def supportOverwriteByExpression(): Boolean =
    GlutenConfig.get.enableOverwriteByExpression && enableEnhancedFeatures()
+
+  override def supportOverwritePartitionsDynamic(): Boolean =


Here is the backend setting config, just return enableEnhancedFeatures(), and this change can trigger the CI again.

github-actions · 2025-09-30T15:05:24Z

Run Gluten Clickhouse CI on x86

zhztheplayer · 2025-10-01T07:35:17Z

gluten-substrait/src/main/scala/org/apache/gluten/backendsapi/BackendSettingsApi.scala


  def supportOverwriteByExpression(): Boolean = false
+
+  def supportOverwritePartitionsDynamic(): Boolean = false


Unnecessarily related to this PR, but we do need to have the new V2 columnar write operators covered in tests individually without having to enable the Iceberg writer, as they were design to be general. Vanilla Spark uses an in-memory catalog for testing the row-based V2 write operators. We may want to introduce something similar just for testing. #9896

Sounds good, v2 write is a general capability that can be used in all other lake formats. I know a bit about DSv2, and happy to help if needed.

Thanks! Feel free to open issues and PRs.

Spark uses an in-memory catalog for testing the row-based V2 write operators. We may want to introduce something similar just for testing.

@Zouxxyy Just recalled that other contributors might already work on something similar, let me confirm first to avoid duplicated work. :) I don't have their GitHub ID or Email at this moment but I will try to get them into the public discussion.

I've confirmed, they are not working on the test topic. So feel free to take if wanted. We may have public discussions about the further matters later on.

Thanks, I'd like to, testing is the foundation, which will make the integration more directional and reliable.

zhztheplayer · 2025-10-01T18:04:15Z

gluten-substrait/src/main/scala/org/apache/gluten/config/GlutenConfig.scala

      .createWithDefault(true)

+  val COLUMNAR_OVERWRIET_PARTITIONS_DYNAMIC_ENABLED =
+    buildConf("spark.gluten.sql.columnar.overwriteOverwritePartitionsDynamic")


Hi @Zouxxyy,

Should this be spark.gluten.sql.columnar.overwritePartitionsDynamic?

Sorry, my mistake, I might have copied the wrong content.

This should fix CI error on `AllVeloxConfiguration`.

v1

eb9a503

github-actions bot added CORE works for Gluten Core VELOX DOCS DATA_LAKE labels Sep 30, 2025

Zouxxyy mentioned this pull request Sep 30, 2025

[GLUTEN-10648][VL] Support Iceberg overwrite partitions dynamic #10760

Closed

jinchengchenghh mentioned this pull request Sep 30, 2025

[CH] Flaky test ClickHouseAdaptiveQueryExecSuite #10756

Open

jinchengchenghh reviewed Sep 30, 2025

View reviewed changes

update for comment

3819bcd

jinchengchenghh approved these changes Sep 30, 2025

View reviewed changes

zhztheplayer reviewed Oct 1, 2025

View reviewed changes

jinchengchenghh merged commit 971f590 into apache:main Oct 1, 2025
57 checks passed

zhztheplayer reviewed Oct 1, 2025

View reviewed changes

zhztheplayer added a commit to zhztheplayer/gluten that referenced this pull request Oct 1, 2025

[VL] Following apache#10823, correct the config option key

70cc549

This should fix CI error on `AllVeloxConfiguration`.

zhztheplayer mentioned this pull request Oct 1, 2025

[VL] Following #10823, correct the config option key #10830

Merged

zhztheplayer added a commit that referenced this pull request Oct 2, 2025

[VL] Following #10823, correct the config option key (#10830)

fd155d2

This should fix CI error on `AllVeloxConfiguration`.


		def supportOverwriteByExpression(): Boolean = false

		def supportOverwritePartitionsDynamic(): Boolean = false

Conversation

Zouxxyy commented Sep 30, 2025

What changes are proposed in this pull request?

How was this patch tested?

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

jinchengchenghh commented Sep 30, 2025

Uh oh!

jinchengchenghh commented Sep 30, 2025

Uh oh!

jinchengchenghh commented Sep 30, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhztheplayer Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhztheplayer Oct 1, 2025 •

edited

Loading