Feature/lxl 4134 #1583

jannistsiroyannis · 2025-04-10T09:06:30Z

This changes Elastic indexing to occur strictly in order with postgres writes.

Every change is logged (in a table) with a sequence number (assigned within the postgres transaction), and indexing occurs by reading that log in the same order. There is still a tolerance for bad data, in the sense that any 400-response from Elastic will be error-logged but ignored. Other types of errors however (be they timeouts, broken connections, or whatever else) are NOT TOLERATED, and will be retried, in order (forever).

A big change for developers in this, is that the housekeeping module must now be running for any indexing to occur at all. It is however perfectly fine to start it up late (it will catch up on its own).

…fortunately) be tolerated. Other errors will still be retried in order.

olovy · 2025-04-10T12:38:37Z

whelk-core/src/main/groovy/whelk/component/PostgreSQLComponent.groovy

@@ -1624,24 +1658,21 @@ class PostgreSQLComponent {
    boolean saveVersion(Document doc, Connection connection, Date createdTime,


Could be changed to void ?

olovy · 2025-04-10T12:49:18Z

whelk-core/src/main/groovy/whelk/Whelk.groovy

@@ -505,7 +385,7 @@ class Whelk {
     */
    boolean storeAtomicUpdate(String id, boolean minorUpdate, boolean writeIdenticalVersions, String changedIn, String changedBy, UpdateAgent updateAgent) {
        Document preUpdateDoc = null


I believe this variable can be removed?

olovy · 2025-04-10T12:49:45Z

whelk-core/src/main/groovy/whelk/Whelk.groovy

@@ -524,13 +403,12 @@ class Whelk {
    void storeAtomicUpdate(Document doc, boolean minorUpdate, boolean writeIdenticalVersions, String changedIn, String changedBy, String oldChecksum) {
        normalize(doc)
        Document preUpdateDoc = storage.load(doc.shortId)


This variable is unused now and can be removed

olovy

Nice! LGTM!

Remaining possibility for small inconsistencies:
If an elastic request fails after increment/decerement reverse links they will be bumbed again when the whole document is retried. I think this is fine for now. Lots of added complexity to fix this?
The fix would be to reindex them instead of just updating the counters the second time around?

I would like to have a gauge for "seconds behind" in /metrics so that we can track it.
I can add that.

Interesting to see the transcribed (groovy -> java) indexing code. A bit more clunky but not that bad.

It is no longer relevant as indexing is no longer a direct effect of these processes, and so there is no speedup to be had.

jannistsiroyannis added 9 commits March 31, 2025 10:30

Add and populate the change-log table.

8ef9ec8

Untested progress

c9d1dde

Encode large numbers as json-text.

ced5e51

Move indexing out of whelk.

a03e179

First actual indexing. Retry queue must go, however.

5183dab

Working robust indexing. No vacation-mode yet.

6372aed

Add catch-up after reindexing.

da5048d

Restore handling of 400-class errors in ElasticSearch, these must (un…

8f29085

…fortunately) be tolerated. Other errors will still be retried in order.

Clean up

4814656

jannistsiroyannis requested review from andersju, kaipoykio, lrosenstrom, olovy and kwahlin April 10, 2025 09:06

Fix b0rken crud unit-tests.

4a84db4

olovy reviewed Apr 10, 2025

View reviewed changes

olovy approved these changes Apr 10, 2025

View reviewed changes

jannistsiroyannis and others added 8 commits April 11, 2025 10:32

Remove the 'skip index' options for whelktool and dataset loading.

ca4a006

It is no longer relevant as indexing is no longer a direct effect of these processes, and so there is no speedup to be had.

Fix indexing of new records

c5bd9fd

Add automatic cleaning of the change log.

f8c9067

Handle missing doc in updateReverseLinkCounter

1eb9f33

Fix some warnings

a8595ca

Add gauge for latency

39b5ecd

Fix inverted condition in hasChangedMainEntityId

0ed9fc8

Merge branch 'develop' into feature/lxl-4134

2c64b2b

olovy marked this pull request as draft May 14, 2025 13:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/lxl 4134 #1583

Feature/lxl 4134 #1583

Uh oh!

jannistsiroyannis commented Apr 10, 2025

Uh oh!

olovy Apr 10, 2025

Uh oh!

olovy Apr 10, 2025

Uh oh!

olovy Apr 10, 2025

Uh oh!

olovy left a comment •

edited

Loading

Uh oh!

Uh oh!

		@@ -1624,24 +1658,21 @@ class PostgreSQLComponent {
		boolean saveVersion(Document doc, Connection connection, Date createdTime,

Feature/lxl 4134 #1583

Are you sure you want to change the base?

Feature/lxl 4134 #1583

Uh oh!

Conversation

jannistsiroyannis commented Apr 10, 2025

Uh oh!

olovy Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

olovy Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

olovy Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

olovy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

olovy left a comment •

edited

Loading