Skip to content

[CORE] Iceberg: Implement input file expressions for Iceberg tables#10831

Closed
JunhyungSong wants to merge 2 commits intoapache:mainfrom
JunhyungSong:main
Closed

[CORE] Iceberg: Implement input file expressions for Iceberg tables#10831
JunhyungSong wants to merge 2 commits intoapache:mainfrom
JunhyungSong:main

Conversation

@JunhyungSong
Copy link

Support input_file_name(), input_file_block_start(), and input_file_block_length() for Iceberg tables which were missing in the initial implementation, #6021.

@github-actions github-actions bot added CORE works for Gluten Core VELOX DATA_LAKE labels Oct 2, 2025
@github-actions
Copy link

github-actions bot commented Oct 2, 2025

Run Gluten Clickhouse CI on x86

case p @ ProjectExec(projectList, child: BatchScanExecTransformerBase)
if projectList.exists(containsInputFileRelatedExpr) =>
child.copy(output = p.output.asInstanceOf[Seq[AttributeReference]])
child.withNewOutput(p.output.asInstanceOf[Seq[AttributeReference]])
Copy link
Member

@zhztheplayer zhztheplayer Oct 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Author

@JunhyungSong JunhyungSong Oct 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would you open an issue for this? Thanks!

We may want to end up with a common solution for different lake formats.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@github-actions
Copy link

github-actions bot commented Oct 2, 2025

Run Gluten Clickhouse CI on x86

@zhztheplayer zhztheplayer changed the title Implement input file expressions for Iceberg tables [CORE] Iceberg: Implement input file expressions for Iceberg tables Oct 3, 2025
 - Support input_file_name(), input_file_block_start(), and
   input_file_block_length() for Iceberg tables which were missing in
   the OSS
@github-actions
Copy link

github-actions bot commented Oct 3, 2025

Run Gluten Clickhouse CI on x86

@github-actions
Copy link

github-actions bot commented Oct 3, 2025

Run Gluten Clickhouse CI on x86

@JunhyungSong
Copy link
Author

@zhztheplayer Can you check what the Clickhouse CI failure is? I don't have permission to check the details. Can you also trigger other workflows like backends-velox? It looks like a maintainer approval is needed.

@zhztheplayer
Copy link
Member

@JunhyungSong CH failure is as following:

[2025-10-03T22:33:48.417Z] [ERROR] /home/jenkins/agent/workspace/gluten/gluten-ci/ut-stage-3/gluten-iceberg/src/main/scala/org/apache/iceberg/spark/source/GlutenIcebergSourceUtil.scala:68: error: value location is not a member of org.apache.iceberg.DataFile
[2025-10-03T22:33:48.421Z] [ERROR]             val path = task.file().location()
[2025-10-03T22:33:48.421Z] [ERROR]                                    ^
[2025-10-03T22:33:48.421Z] [ERROR] one error found
[2025-10-03T22:33:48.539Z] [ERROR] /home/jenkins/agent/workspace/gluten/gluten-ci/ut-stage-1/gluten-iceberg/src/main/scala/org/apache/iceberg/spark/source/GlutenIcebergSourceUtil.scala:68: error: value location is not a member of org.apache.iceberg.DataFile
[2025-10-03T22:33:48.543Z] [ERROR]             val path = task.file().location()
[2025-10-03T22:33:48.543Z] [ERROR]                                    ^
[2025-10-03T22:33:48.543Z] [ERROR] one error found
[2025-10-03T22:33:48.543Z] [ERROR] exception compilation error occurred!!!
[2025-10-03T22:33:48.543Z] org.apache.commons.exec.ExecuteException: Process exited with an error: 1 (Exit value: 1)
[2025-10-03T22:33:48.543Z]     at org.apache.commons.exec.DefaultExecutor.executeInternal (DefaultExecutor.java:404)
[2025-10-03T22:33:48.543Z]     at org.apache.commons.exec.DefaultExecutor.execute (DefaultExecutor.java:166)
[2025-10-03T22:33:48.543Z]     at org.apache.commons.exec.DefaultExecutor.execute (DefaultExecutor.java:153)
[2025-10-03T22:33:48.543Z]     at scala_maven_executions.JavaMainCallerByFork.run (JavaMainCallerByFork.java:95)
[2025-10-03T22:33:48.543Z]     at scala_maven.ScalaCompilerSupport.compile (ScalaCompilerSupport.java:173)
[2025-10-03T22:33:48.543Z]     at scala_maven.ScalaCompilerSupport.doExecute (ScalaCompilerSupport.java:86)
[2025-10-03T22:33:48.543Z]     at scala_maven.ScalaMojoSupport.execute (ScalaMojoSupport.java:310)
[2025-10-03T22:33:48.543Z]     at scala_maven.ScalaCompileMojo.execute (ScalaCompileMojo.java:108)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo (DefaultBuildPluginManager.java:137)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:210)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.cli.MavenCli.execute (MavenCli.java:972)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:293)
[2025-10-03T22:33:48.543Z]     at org.apache.maven.cli.MavenCli.main (MavenCli.java:196)
[2025-10-03T22:33:48.543Z]     at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
[2025-10-03T22:33:48.543Z]     at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
[2025-10-03T22:33:48.543Z]     at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
[2025-10-03T22:33:48.543Z]     at java.lang.reflect.Method.invoke (Method.java:498)
[2025-10-03T22:33:48.543Z]     at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
[2025-10-03T22:33:48.543Z]     at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
[2025-10-03T22:33:48.543Z]     at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
[2025-10-03T22:33:48.543Z]     at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)

@zhztheplayer
Copy link
Member

@JunhyungSong

The public account / password for CH CI is documented here, although it was a little bit too deeply documented...

@github-actions
Copy link

This PR is stale because it has been open 45 days with no activity. Remove stale label or comment or this will be closed in 10 days.

@github-actions github-actions bot added the stale stale label Nov 28, 2025
@github-actions
Copy link

github-actions bot commented Dec 8, 2025

This PR was auto-closed because it has been stalled for 10 days with no activity. Please feel free to reopen if it is still valid. Thanks.

@github-actions github-actions bot closed this Dec 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core DATA_LAKE stale stale VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants