Skip to content

Conversation

@Zyiqin-Miranda
Copy link
Member

Summary

In this PR, we persist the latest snapshot id Ray converter processed up to as part of snapshot properties. This snapshot id property serve as the same purpose as high watermark.

Key Changes

1. Snapshot Properties vs Table Properties

  • Snapshot Properties (sourceSnapshotId, sourceSequenceNumber): Track the input snapshots that were processed by the Ray converter. These represent the "high watermark" of source data that has been converted.
  • Table Properties: Track the output snapshots produced by the Ray converter itself (the converter-generated snapshot IDs).

Why This Matters: This distinction allows operators to understand both what data was processed (snapshot properties) and what output was generated (table properties), enabling precise incremental processing for CDC usecase and avoiding reprocessing of already-converted data.

Rationale

Explain the reasoning behind the changes and their benefits to the project.

Changes

List the major changes made in this pull request.

Impact

Discuss any potential impacts the changes may have on existing functionalities.

Testing

Describe how the changes have been tested, including both automated and manual testing strategies.
If this is a bugfix, explain how the fix has been tested to ensure the bug is resolved without introducing new issues.

Regression Risk

If this is a bugfix, assess the risk of regression caused by this fix and steps taken to mitigate it.

Checklist

  • Unit tests covering the changes have been added

    • If this is a bugfix, regression tests have been added
  • E2E testing has been performed

Additional Notes

Any additional information or context relevant to this PR.

Copy link
Collaborator

@yankevn yankevn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Zyiqin-Miranda Zyiqin-Miranda merged commit af8ce43 into ray-project:2.0 Dec 9, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants