Skip to content

[VL] Delta: Add Delta Lake write unit test for Spark 3.5 + Delta 3.3#10802

Merged
zhztheplayer merged 6 commits intoapache:mainfrom
zhztheplayer:wip-delta-write-ut
Sep 26, 2025
Merged

[VL] Delta: Add Delta Lake write unit test for Spark 3.5 + Delta 3.3#10802
zhztheplayer merged 6 commits intoapache:mainfrom
zhztheplayer:wip-delta-write-ut

Conversation

@zhztheplayer
Copy link
Member

@zhztheplayer zhztheplayer commented Sep 25, 2025

Delta doesn't release its testing Jars so we copy-paste some of their test code here as a quick solution to test Gluten with Delta.

In long-term, we may consider releasing a Delta test jar ourselves, or adopting download-and-patching for the sources to avoid maintaining Delta's test code in our code base.

In this PR, we only introduce one single test suite DeleteSQLSuite but it will cover the main Delta write functionality. We will add more in the future.

@github-actions github-actions bot added the VELOX label Sep 25, 2025
@zhztheplayer zhztheplayer changed the title [VL] Delta: Add Delta write unit tests for Spark 3.5 + Delta 3.3 [VL] Delta: Add a Delta Lake unit test for write for Spark 3.5 + Delta 3.3 Sep 25, 2025
@github-actions
Copy link

Run Gluten Clickhouse CI on x86

@github-actions github-actions bot added the CORE works for Gluten Core label Sep 26, 2025

val COLUMNAR_TABLE_CACHE_ENABLED =
buildConf("spark.gluten.sql.columnar.tableCache")
buildStaticConf("spark.gluten.sql.columnar.tableCache")
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suspect this option is somehow leaked among different test suites so caused some strange errors. Let's correct it as static to align with its life scope and doing this will also help diagnosing the issue.

@github-actions
Copy link

Run Gluten Clickhouse CI on x86

@github-actions
Copy link

Run Gluten Clickhouse CI on x86

1 similar comment
@github-actions
Copy link

Run Gluten Clickhouse CI on x86

@zhztheplayer zhztheplayer changed the title [VL] Delta: Add a Delta Lake unit test for write for Spark 3.5 + Delta 3.3 [VL] Delta: Add Delta Lake write unit test for Spark 3.5 + Delta 3.3 Sep 26, 2025
Comment on lines +39 to +41
// FIXME: Excluded by Gluten as results are mismatch.
"test delete on temp view - nontrivial projection - SQL TempView",
"test delete on temp view - nontrivial projection - Dataset TempView"
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO: Fix these cases.

Comment on lines +134 to +136
// FIXME: Excluded by Gluten as results are mismatch.
"test delete on temp view - nontrivial projection - SQL TempView",
"test delete on temp view - nontrivial projection - Dataset TempView"
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO: Fix these cases.

Comment on lines +98 to +102
// FIXME: Enable the test.
// Skipping as function input_file_name doesn't get correctly resolved.
@Ignore
class DeleteSQLNameColumnMappingSuite extends DeleteSQLSuite
with DeltaColumnMappingEnableNameMode {
Copy link
Member Author

@zhztheplayer zhztheplayer Sep 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO: Fix this suite.

This reverts commit 28fc63724e1ec4c4fb94cf7cb86851a5f364b016.
This reverts commit 5c4bc75.
@github-actions github-actions bot removed the CORE works for Gluten Core label Sep 26, 2025
@zhztheplayer zhztheplayer merged commit 56523dc into apache:main Sep 26, 2025
55 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants