[GLUTEN-11635] Enable partial fallback if parent node supports partial fallback by wecharyu · Pull Request #11637 · apache/gluten

wecharyu · 2026-02-20T16:41:28Z

What changes are proposed in this pull request?

How was this patch tested?

Add a new unit test.

select plus_one(col1) as col2, l_partkey from (
  select plus_one(l_orderkey) as col1, l_partkey from lineitem
)

Before this PR:

== Physical Plan ==
*(1) Project [if (isnull(col1#73L)) null else plus_one(knownnotnull(col1#73L)) AS col2#74L, l_partkey#1L]
+- *(1) Project [if (isnull(l_orderkey#0L)) null else plus_one(knownnotnull(l_orderkey#0L)) AS col1#73L, l_partkey#1L]
   +- VeloxColumnarToRow
      +- ^(1) BatchScanExecTransformer parquet file:/root/workspace/gluten-community/backends-velox/target/scala-2.13/test-classes/tpch-data-parquet/lineitem[l_orderkey#0L, l_partkey#1L] ParquetScan DataFilters: [], Format: parquet, Location: InMemoryFileIndex(1 paths)[file:/root/workspace/gluten-community/backends-velox/target/scala-2.13..., PartitionFilters: [], PushedAggregation: [], PushedFilters: [], PushedGroupBy: [], ReadSchema: struct<l_orderkey:bigint,l_partkey:bigint> RuntimeFilters: [] NativeFilters: []

After this PR:

VeloxColumnarToRow
+- ^(3) ProjectExecTransformer [_SparkPartialProject0#56L AS col2#38L, l_partkey#1L]
   +- ^(3) InputIteratorTransformer[col1#37L, l_partkey#1L, _SparkPartialProject0#56L]
      +- ColumnarPartialProject [if (isnull(col1#37L)) null else plus_one(knownnotnull(col1#37L)) AS col2#38L, l_partkey#1L] PartialProject List(if (isnull(col1#37L)) null else plus_one(knownnotnull(col1#37L)) AS _SparkPartialProject0#56L)
         +- ^(2) ProjectExecTransformer [_SparkPartialProject0#55L AS col1#37L, l_partkey#1L]
            +- ^(2) InputIteratorTransformer[l_orderkey#0L, l_partkey#1L, _SparkPartialProject0#55L]
               +- ColumnarPartialProject [if (isnull(l_orderkey#0L)) null else plus_one(knownnotnull(l_orderkey#0L)) AS col1#37L, l_partkey#1L] PartialProject List(if (isnull(l_orderkey#0L)) null else plus_one(knownnotnull(l_orderkey#0L)) AS _SparkPartialProject0#55L)
                  +- ^(1) BatchScanExecTransformer parquet file:/root/workspace/gluten-community/backends-velox/target/scala-2.13/test-classes/tpch-data-parquet/lineitem[l_orderkey#0L, l_partkey#1L] ParquetScan DataFilters: [], Format: parquet, Location: InMemoryFileIndex(1 paths)[file:/root/workspace/gluten-community/backends-velox/target/scala-2.13..., PartitionFilters: [], PushedAggregation: [], PushedFilters: [], PushedGroupBy: [], ReadSchema: struct<l_orderkey:bigint,l_partkey:bigint> RuntimeFilters: [] NativeFilters: []

Was this patch authored or co-authored using generative AI tooling?

No.

wecharyu · 2026-02-24T10:36:17Z

@jinchengchenghh @weixiuli Could you pls take a look? Thanks~

backends-velox/src/main/scala/org/apache/gluten/extension/PartialProjectRule.scala

wForget · 2026-02-26T11:02:41Z

Is there always a benefit to this? It does a partial projection, but with an extra C2R/R2C conversion.

before: full columnar to rows -> row-based project1 -> row-based project2 -> rows to columnar
after: partial columnar to rows -> row-based project1 -> rows to columnar -> partial columnar to rows -> row-based project2 -> rows to columnar

jinchengchenghh · 2026-03-05T15:20:42Z

Could you show the number that this PR can gain improvement?

…l fallback

wecharyu · 2026-03-06T11:27:17Z

I have run a simple test that read and write table, it shows little improvements, I think we could benefit a lot more if the operations offloaded to native are more complicated. cc: @jinchengchenghh @wForget

insert overwrite table dev_spark_auxiliary.wechar_tbl2
select plus_one(col1) as col2, col3 from (
  select col1, log_type as col3
  from test_db.test_tbl
  lateral view simpleUDTF(item_id) as col1
  where grass_region='BR' and regional_date='2026-03-01'
)

jinchengchenghh · 2026-03-06T12:22:41Z

backends-velox/src/main/scala/org/apache/gluten/extension/PartialProjectRule.scala

-          case other => other
-        }
-    }
+    newPlan


newPlan -> wrapped

github-actions bot added the VELOX label Feb 20, 2026

wecharyu force-pushed the partial_fallback_enhence branch from 747d9fc to 8b4e841 Compare February 24, 2026 03:10

jinchengchenghh reviewed Feb 26, 2026

View reviewed changes

backends-velox/src/main/scala/org/apache/gluten/extension/PartialProjectRule.scala Outdated Show resolved Hide resolved

wecharyu added 3 commits March 6, 2026 16:42

[GLUTEN-11635] Enable partial fallback if parent node supports partia…

a4c0e9a

…l fallback

enable the nested project test in spark 3.4 or later

758cb29

Run all partial fallback rules in batch

413cb80

wecharyu force-pushed the partial_fallback_enhence branch from 8b4e841 to 413cb80 Compare March 6, 2026 08:43

jinchengchenghh approved these changes Mar 6, 2026

View reviewed changes

jinchengchenghh reviewed Mar 6, 2026

View reviewed changes

address comment

04591d6

jinchengchenghh merged commit 68dd66a into apache:main Mar 10, 2026
111 of 113 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GLUTEN-11635] Enable partial fallback if parent node supports partial fallback#11637

[GLUTEN-11635] Enable partial fallback if parent node supports partial fallback#11637
jinchengchenghh merged 4 commits intoapache:mainfrom
wecharyu:partial_fallback_enhence

wecharyu commented Feb 20, 2026

Uh oh!

wecharyu commented Feb 24, 2026

Uh oh!

Uh oh!

wForget commented Feb 26, 2026

Uh oh!

jinchengchenghh commented Mar 5, 2026

Uh oh!

wecharyu commented Mar 6, 2026

Uh oh!

jinchengchenghh Mar 6, 2026

Uh oh!

wecharyu Mar 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wecharyu commented Feb 20, 2026

What changes are proposed in this pull request?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

wecharyu commented Feb 24, 2026

Uh oh!

Uh oh!

wForget commented Feb 26, 2026

Uh oh!

jinchengchenghh commented Mar 5, 2026

Uh oh!

wecharyu commented Mar 6, 2026

Uh oh!

jinchengchenghh Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

wecharyu Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants