Skip to content

fix(cudf): Fix table scan fallback condition and library#14728

Closed
jinchengchenghh wants to merge 6 commits intofacebookincubator:mainfrom
jinchengchenghh:cudf_connector
Closed

fix(cudf): Fix table scan fallback condition and library#14728
jinchengchenghh wants to merge 6 commits intofacebookincubator:mainfrom
jinchengchenghh:cudf_connector

Conversation

@jinchengchenghh
Copy link
Collaborator

@jinchengchenghh jinchengchenghh commented Sep 4, 2025

Gluten needs to link velox_cudf_parquet_connector, so it cannot be OBJECT
Aligns with LocalFileSystem to extractFilePath https://github.com/facebookincubator/velox/blob/main/velox/common/file/FileSystems.cpp#L109-L121

Related PR: apache/gluten#10622
Resolves: #14730

@netlify
Copy link

netlify bot commented Sep 4, 2025

Deploy Preview for meta-velox canceled.

Name Link
🔨 Latest commit b63490a
🔍 Latest deploy log https://app.netlify.com/projects/meta-velox/deploys/68cda3610b72540007ea5510

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 4, 2025
Comment on lines +60 to +67
VELOX_CHECK(
start <=
static_cast<uint64_t>(std::numeric_limits<cudf::size_type>::max()),
"ParquetConnectorSplit `start` must be less than or equal to 2^31");
VELOX_CHECK(
length <=
static_cast<uint64_t>(std::numeric_limits<cudf::size_type>::max()),
"ParquetConnectorSplit `length` must be less than or equal to 2^31");
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't start and length be set from constructor args like in #14129. I believe these changes are duplicate as well

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I just move it from header file to cpp file, the #14129. should be merged first

@jinchengchenghh jinchengchenghh changed the title fix(cudf): Fix table scan fall back condition and library fix(cudf): Fix table scan fallback condition and library Sep 10, 2025
@jinchengchenghh
Copy link
Collaborator Author

@mhaseeb123 Could you help review again? Thanks!

@mhaseeb123
Copy link
Collaborator

LGTM thanks.

@jinchengchenghh
Copy link
Collaborator Author

The red CI is not caused by this PR #14920

@jinchengchenghh jinchengchenghh added the ready-to-merge PR that have been reviewed and are ready for merging. PRs with this tag notify the Velox Meta oncall label Sep 22, 2025
@facebook-github-bot
Copy link
Contributor

@xiaoxmeng has imported this pull request. If you are a Meta employee, you can view this in D83066640.

@facebook-github-bot
Copy link
Contributor

@xiaoxmeng merged this pull request in 62fe64a.

jinchengchenghh added a commit to jinchengchenghh/velox that referenced this pull request Sep 24, 2025
…ubator#14728)

Summary:
Gluten needs to link velox_cudf_parquet_connector, so it cannot be OBJECT
Aligns with LocalFileSystem to extractFilePath https://github.com/facebookincubator/velox/blob/main/velox/common/file/FileSystems.cpp#L109-L121

Related PR: apache/gluten#10622
Resolves: facebookincubator#14730

Pull Request resolved: facebookincubator#14728

Reviewed By: zacw7

Differential Revision: D83066640

Pulled By: xiaoxmeng

fbshipit-source-id: 8835a883b4c517b0097e3afc1c83ac3994a91f38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Merged ready-to-merge PR that have been reviewed and are ready for merging. PRs with this tag notify the Velox Meta oncall

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Cudf parquet connector failed by not exists file

3 participants