Add genesis sync test to CI #7561

chong-he · 2025-06-05T04:29:09Z

Issue Addressed

Add e2e sync tests for lookup and range sync to CI #7550

Use existing code from @jimmygchen in #7530 and modify for genesis sync test. Thanks @jimmygchen !

…test

…ighthouse into add-genesis-sync-test

chong-he · 2025-06-06T00:57:33Z

This PR is ready for review. In this test, we use the minimal preset so that it is faster. The local testnet script is also updated to build for minimal:

The rebuild image -b true (link here) is required or Kurtosis wouldn't start

For 120s stop duration, we trigger short distance sync, the lookup sync. We see in the logs:

DEBUG Created block lookup                          peers: [PeerId("16Uiu2HAkzuo2s4zS8Ab1FVPU1GAue212NT3YhHVRgJwE9S1HQ12J")], block_root: 0x149f7747579266942795085564847f3e903de77524ddb59de3077fae5b238432, awaiting_parent: "none", id: 1, service: "lookup_sync"

For 300s stop duration, we trigger long distance sync, the range sync. We see in the logs:

DEBUG Syncing new finalized chain                   id: 1, component: "range_sync"
DEBUG Sync RPC request sent                         method: "BlocksByRange", slots: 8, epoch: 0, peer: 16Uiu2HAm2pJM6EUvzVtN5JwaKyXBx8DAyg9dSXrNPBc81UCP8WBp, id: 3/2/RangeSync/0/1, chain: 1, service: "range_sync"

All sync completed very fast, under 5s. So I reduce the polling time in the script to 0.5s so that we can see some outputs in the terminal, and set a shorter timeout of 5 minutes.

…ighthouse into add-genesis-sync-test

scripts/tests/genesis-sync.sh

jimmygchen · 2025-06-06T13:12:51Z

scripts/tests/genesis-sync.sh

+# ------------------------------------------------------
+# Interval for polling the /lighthouse/syncing endpoint for sync status
+# Reduce the polling time so that some progress can be seen
+POLL_INTERVAL_SECS=0.5


This is very frequent, I think it's not necessary and we could probably live with something like 5s, or 6s (1 poll per slot)? Don't forgot HTTP API requests goes through BeaconProcessor as well and it's Priority::P0, which means it will be prioritised over most tasks.

lighthouse/beacon_node/http_api/src/lib.rs

Line 4384 in 7e2df6b

task_spawner.blocking_json_task(Priority::P0, move || {

As discussed, the sync completed very fast, like under 2s, e.g., for sync-electra-300s, we see from the logs for cl-3-lighthouse-geth:

Jun 06 06:33:34.585 DEBUG Range request block roots retrieved req_type: "BlocksByRange", start_slot: 8, req_count: 8, roots_count: 8, source: "store", elapsed: 89.035µs, finalized_slot: 32 Jun 06 06:33:34.774 DEBUG Range request block roots retrieved req_type: "BlocksByRange", start_slot: 16, req_count: 8, roots_count: 8, source: "store", elapsed: 40.275µs, finalized_slot: 32 Jun 06 06:33:34.774 DEBUG Range request block roots retrieved req_type: "BlocksByRange", start_slot: 24, req_count: 8, roots_count: 8, source: "store", elapsed: 61.705µs, finalized_slot: 32

and the sync is completed about 0.2-0.3 second later:

Jun 06 06:33:35.000 INFO Synced peers: "3", exec_hash: "0xa95848dca51fbedfce6f84bd2d7f2df0b459a5672c8c8fb118ad44b321170f2d (unverified)", finalized_root: 0x57274ba74a8a091caa527a1af01269489c333da085503a0ae76d3534ecbe03d0, finalized_epoch: 4, epoch: 6, block: "0x2fb201cc9202c794b31212e49d8c828274fa76669db0453993bd8650a19dad10", slot: 52

So if we poll every 5s, we may not see any output, unless we increase the sync distance (stop the node for longer periods of time

jimmygchen · 2025-06-06T13:21:18Z

scripts/tests/genesis-sync.sh

+    echo "${node_type} status: ${status:-Unknown}"
+
+    # The test is complete when the node is synced
+    if [ "$status" = "Synced" ]; then


I removed the "Synced" check in the checkpoint sync script, because of the issue we talked about on the call - there's a bug and LH may switch to Synced even though it's not.

But I guess for forward sync this is fine, as it should only switch to this status if we're within certain distance. If this doesn' work well, we could consider using the standard node/syncing API, as it also returns the sync distance, which provides a bit more certainty.

I looked at the sync-electra-300s logs, and realised it didn't even start syncing, i suspect LH is reporting Synced status too early. Would it be worth looking at the sync distance?

For forward sync it looks ok? This is the output for sync-electra-300s:
https://github.com/sigp/lighthouse/actions/runs/15484117320/job/43595585222?pr=7561

We can increase the poll time to 1-2s if that's better?

I have also tested using the beacon API /eth/v1/node/syncing, the output could look like this (also with 0.5s poll time):

olling sync status until nodes are synced or timeout of 5 mins supernode status: Synced (head_slot: 51, sync_distance: 0) supernode completed sync in 1 seconds fullnode status: No response or null response fullnode status: No response or null response fullnode status: No response or null response fullnode status: Syncing (head_slot: 23, sync_distance: 28, is_optimistic: true, el_offline: false) fullnode status: Synced (head_slot: 51, sync_distance: 0) fullnode completed sync in 3 seconds Genesis sync test complete! Both supernode and fullnode have synced successfully. Supernode time: 1 seconds Fullnode time: 3 seconds

It has the head_slot (which I believe is the same as target_slot in the lighthouse/syncing endpoint (the beacon API also has the some other info in the response like el_offline and is_optimistic). If the beacon API is better, I can switch to that

chong-he added 3 commits June 5, 2025 11:48

add genesis test

2430a42

update time

c0a364c

Add CI

b7056a4

chong-he added test improvement Improve tests work-in-progress PR is a work-in-progress syncing infra-ci labels Jun 5, 2025

chong-he and others added 14 commits June 5, 2025 14:02

exclude jobs

c7607f8

Merge remote-tracking branch 'origin/unstable' into add-genesis-sync-…

3c858de

…test

merge unstable

ff7c910

exclude genesis test

70b50b9

revise

725ec9c

yml file

3d7e83c

Merge branch 'unstable' into add-genesis-sync-test

8ef5283

update script

cb643bf

Merge branch 'add-genesis-sync-test' of https://github.com/chong-he/l…

0f4f6b1

…ighthouse into add-genesis-sync-test

minimal

e44df3c

minimal

04093d2

-b false no rebuild image

8c0a3ea

have to rebuild image

635fbe6

Merge branch 'unstable' into add-genesis-sync-test

67de688

chong-he added ready-for-review The code is ready for review and removed work-in-progress PR is a work-in-progress labels Jun 6, 2025

chong-he requested a review from jimmygchen June 6, 2025 00:58

chong-he added 3 commits June 6, 2025 09:12

delete comment

56a4b2c

Merge branch 'add-genesis-sync-test' of https://github.com/chong-he/l…

61b496e

…ighthouse into add-genesis-sync-test

rebuild image

01f2812

jimmygchen reviewed Jun 6, 2025

View reviewed changes

scripts/tests/genesis-sync.sh Outdated Show resolved Hide resolved

change to -b false

8254067

chong-he added 4 commits June 6, 2025 13:55

add spec-minimal to build Lighthouse

3f76dbe

features spec-minimal

020da9f

docker spec-minimal

d7b0926

remove test-suite change

da06778

jimmygchen reviewed Jun 6, 2025

View reviewed changes

jimmygchen added waiting-on-author The reviewer has suggested changes and awaits thier implementation. and removed ready-for-review The code is ready for review labels Jun 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add genesis sync test to CI #7561

Add genesis sync test to CI #7561

Uh oh!

chong-he commented Jun 5, 2025

Uh oh!

chong-he commented Jun 6, 2025

Uh oh!

Uh oh!

jimmygchen Jun 6, 2025

Uh oh!

chong-he Jun 9, 2025

Uh oh!

jimmygchen Jun 6, 2025

Uh oh!

jimmygchen Jun 6, 2025 •

edited

Loading

Uh oh!

chong-he Jun 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add genesis sync test to CI #7561

Are you sure you want to change the base?

Add genesis sync test to CI #7561

Uh oh!

Conversation

chong-he commented Jun 5, 2025

Issue Addressed

Uh oh!

chong-he commented Jun 6, 2025

Uh oh!

Uh oh!

jimmygchen Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

chong-he Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

jimmygchen Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

jimmygchen Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chong-he Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jimmygchen Jun 6, 2025 •

edited

Loading

chong-he Jun 9, 2025 •

edited

Loading