Concurrent storage write batch support #425

rkistner · 2025-12-04T11:57:30Z

This modifies the APIs and implementation of BucketStorageBatch to allow using multiple batches concurrently without introducing consistency issues. The first goal of this is to allow reading the replication stream while snapshotting - see #426 for a Postgres implementation.

The actual writes to the database still rely on exclusive database locks to a large extent, so there isn't true concurrency yet. But it does allow running a replication stream concurrently with replication snapshots without breaking.

Specifically, this changes:

Rely less on local variables in the batch, using the database state instead.
Remove the distinction between commit() and keepalive(). The only remaining difference is that keepalive is essentially commit({allowEmptyCommit: true}).
snapshot_done is now explicitly set after completing a snapshot, rather than setting it automatically on commit. No commit will go through until snapshot_done is set.
current_data deletes are now soft-deletes (storing an empty document/row instead of deleting the document/row), converted into hard deletes on the next commit. We need this to consistently handle snapshots while streaming.
Fix concurrency issues for flushing and committing batches, especially in Postgres storage (previously not an issue since we wouldn't ever do this concurrently).
Change SourceTable.id to be strongly typed, specifically checking for string in Postgres storage and ObjectId in MongoDB storage. This unfortunately had a big knock-on effect in tests, including now having different checksums in postgres storage vs mongodb storage.

Other more minor changes:

Reduce migration logs in tests.
Rely less on a split between initial snapshot and streaming replication in Postgres tests.
Add a cause to ReplicationAbortedError.

changeset-bot · 2025-12-04T11:57:33Z

🦋 Changeset detected

Latest commit: 0fb42f9

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 17 packages

Name	Type
@powersync/service-module-postgres-storage	Minor
@powersync/service-module-mongodb-storage	Minor
@powersync/service-core-tests	Minor
@powersync/service-module-postgres	Minor
@powersync/service-errors	Minor
@powersync/service-module-mongodb	Minor
@powersync/service-core	Minor
@powersync/service-module-mysql	Minor
@powersync/lib-services-framework	Minor
@powersync/service-schema	Minor
@powersync/service-module-mssql	Patch
@powersync/service-image	Minor
@powersync/service-module-core	Patch
test-client	Patch
@powersync/service-rsocket-router	Patch
@powersync/lib-service-mongodb	Patch
@powersync/lib-service-postgres	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

…tches

stevensJourney

Looks really cool. I could not spot anything major which stands-out as being a potential issue.

stevensJourney · 2025-12-08T14:09:28Z

modules/module-mongodb-storage/src/storage/implementation/MongoBucketBatch.ts

    if (afterId) {
      // Insert or update
-      const after_key: SourceKey = { g: this.group_id, t: sourceTable.id, k: afterId };
+      const after_key: SourceKey = { g: this.group_id, t: sourceTable.id as bson.ObjectId, k: afterId };


Shouldn't this type cast be using mongoTableId?

stevensJourney · 2025-12-08T14:44:47Z

modules/module-mongodb-storage/src/storage/implementation/MongoBucketBatch.ts

+      $cond: [
+        can_checkpoint,
+        {
+          $max: ['$last_checkpoint', { $literal: this.persisted_op }, { $toLong: '$keepalive_op' }]


I'm not sure about the specifics here, but I noticed the Postgres implementation coerces the, potentially null persisted_op to zero. Should we do that here also?

rkistner added 30 commits November 17, 2025 14:44

Refactor replication connections.

fda46b6

Start streaming concurrently - lets see what breaks.

7a812ee

Split out snapshot logic from streaming logic.

be3390a

Quick workaround for now.

fcd539d

Fix test issues.

7701a9a

WIP.

7e3b0d1

Split out "snapshot done" check.

98da657

Tweaks to tests.

db5d578

Fix for postgres storage.

b7a5d5f

Refactor for snapshotting.

df0d4cb

Refactor commit logic to better handle concurrent replication.

ce500a1

Cover case of no tables to snapshot.

f5ec031

Implement missing method for Postgres storage.

049b32a

Port changes to postgres storage.

531e887

Fix data storage tests - markSnapshotComplete is required.

64e2abc

Fix storage sync tests.

333bdad

Fix snapshot_lsn handling.

29f4302

Fix snapshot_lsn in postgres storage.

cd19ba2

Fix test promise handling.

8dc726f

Fix more tests.

ac790a7

Make schema test more stable.

12d60ed

Merge remote-tracking branch 'origin/main' into stream-during-snapshot

85aaccc

Refactor streaming promise management.

f09dea3

Fix more tests.

b3a23ef

More stable abort handling.

8b9ef4f

Skip empty checkpoints on Postgres again.

1d0088f

More test fixes.

1c85ca2

Add tests for empty checkpoints.

8a55aa1

Implement createEmptyCheckpoints filter for mongodb storage.

77cd69b

Implement soft deletes for current_data.

0041ff3

rkistner added 18 commits December 2, 2025 17:25

More abort error improvements.

ec82bb6

Fix truncate on MongoDB storage.

3129507

Fix postgres storage truncate; typed SourceTable id.

3c77f43

Fix build errors.

7b87214

Fix another build error.

4961662

Merge remote-tracking branch 'origin/main' into stream-during-snapshot

4a6a5ae

Workaround for tests.

d45c212

Same workaround in different test.

d968d39

Better fix for tests.

4958054

Better handling of deadlocks on current transactions.

effa749

Another error test tweak.

2a7a572

Restructure TEST_TABLE for tests.

bfc1259

Checksums are different for postgres now.

6a5590f

Merge remote-tracking branch 'origin/main' into stream-during-snapshot

eefddef

Fix merge conflict.

9dab9ef

Revert postgres changes.

8447d1d

Re-apply some postgres changes.

874ccb6

Re-apply more test changes.

ff32dab

Changeset.

d12daac

rkistner changed the title ~~[WIP] Concurrent storage write batch support~~ Concurrent storage write batch support Dec 4, 2025

rkistner requested a review from stevensJourney December 4, 2025 12:01

This was referenced Dec 4, 2025

[WIP] [Postgres] stream while snapshotting #426

Draft

[WIP] Stream during snapshot #409

Closed

rkistner added 2 commits December 4, 2025 15:00

Merge remote-tracking branch 'origin/main' into concurrent-storage-ba…

e72c189

…tches

Fix some post-merge conflicts.

841273e

rkistner force-pushed the concurrent-storage-batches branch from 6522374 to 841273e Compare December 4, 2025 13:08

rkistner added 2 commits December 4, 2025 15:20

Tweaks to commit logs and logic.

5c18559

Fix results of markTableSnapshotDone.

0fb42f9

stevensJourney reviewed Dec 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Concurrent storage write batch support #425

Concurrent storage write batch support #425

Uh oh!

rkistner commented Dec 4, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Dec 4, 2025 •

edited

Loading

Uh oh!

stevensJourney left a comment

Uh oh!

stevensJourney Dec 8, 2025

Uh oh!

stevensJourney Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Concurrent storage write batch support #425

Are you sure you want to change the base?

Concurrent storage write batch support #425

Uh oh!

Conversation

rkistner commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

stevensJourney left a comment

Choose a reason for hiding this comment

Uh oh!

stevensJourney Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

stevensJourney Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rkistner commented Dec 4, 2025 •

edited

Loading

changeset-bot bot commented Dec 4, 2025 •

edited

Loading