Feature/lxl 4524 #1686

jannistsiroyannis · 2026-01-27T13:28:11Z

Adds history archiving (of deleted records) to librisXL. A change to secret.properties (setting a base path for the archive) is necessary. Until it is set, this does nothing (but also isn't destructive in any way).

The history archive consists of gzipped json-lines files. Each around 100Mb in size and named/tagged with the time of archiving. These can be easily searched for individual records, typically using "zcat | grep". Each line consists of an 'original' version and a set of diffs for each following version.

… think.

…ining linearity.

…-history.

andersju

Nice, LGTM but see comments! Additionally:

I suggest moving cutoff time (line 71), batch size (73), and runtime (210) to constants at the top. Maybe also the checkpoint threshold (155).
logger.info walPath on trigger, or at least archiveRoot on startup?

housekeeping/src/main/groovy/whelk/housekeeping/HistoryArchiver.java

whelk-core/src/test/groovy/whelk/diff/LinearitySpec.groovy

housekeeping/src/main/groovy/whelk/housekeeping/HistoryArchiver.java

jannistsiroyannis · 2026-01-28T12:08:43Z

Nice, LGTM but see comments! Additionally:

* I suggest moving cutoff time (line 71), batch size (73), and runtime (210) to constants at the top. Maybe also the checkpoint threshold (155).

* logger.info walPath on trigger, or at least archiveRoot on startup?

Thank you! Excellent feedback! ⭐ I agree with and have fixed all of the issues, except the moving of values to the top. I find this to be useful only if the values are used more than once, otherwise i prefer them where they are used.

olovy

Requested change: We shouldn't delete the tombstone in lddb. We should be able to tell that the resource has existed even though we delete the history (possibly including the last version in lddb).

olovy · 2026-01-30T10:48:00Z

Is the added complexity of generating diffs worth it now that we archive the history to disk instead of trying to compact the history table?

jannistsiroyannis added 17 commits March 5, 2025 15:38

Add a first experimental json diff.

f44360b

First impl of json-patch. Still missing root target for some verbs, i…

ce91f59

… think.

Add a few linearity tests, to start with.

30066a9

Fix a Path-bug (in replace).

4ee9206

Additional test case

2f52f71

Real record test case.

acbcdad

Add explanation.

c3310c4

Make diff/patch handle the existence of null-values correctly, mainta…

955dd75

…ining linearity.

Merge branch 'develop' into feature/lxl-4524

ca9ceac

emergency commit, failing computer

26c8375

Temporary commit

5c0030a

Getting there.

d63a7e3

Working diff history writing.

9aee650

Progress towards history archiving.

0f26fb0

Backing out of compressed history, in favor of just archiving.

a17e04f

Revert changes made to Postgresql-component for the now scrapped diff…

4f6e95b

…-history.

Working history archiving.

7f45f2c

jannistsiroyannis requested review from andersju, kaipoykio, kwahlin and olovy January 27, 2026 13:28

andersju reviewed Jan 28, 2026

View reviewed changes

Fix review comments.

31e398e

andersju approved these changes Jan 28, 2026

View reviewed changes

olovy requested changes Jan 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/lxl 4524 #1686

Feature/lxl 4524 #1686

Uh oh!

jannistsiroyannis commented Jan 27, 2026

Uh oh!

andersju left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jannistsiroyannis commented Jan 28, 2026

Uh oh!

olovy left a comment

Uh oh!

olovy commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Feature/lxl 4524 #1686

Are you sure you want to change the base?

Feature/lxl 4524 #1686

Uh oh!

Conversation

jannistsiroyannis commented Jan 27, 2026

Uh oh!

andersju left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jannistsiroyannis commented Jan 28, 2026

Uh oh!

olovy left a comment

Choose a reason for hiding this comment

Uh oh!

olovy commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants