Optimize importing archive packages #1876

SpyCheese · 2025-11-06T14:03:58Z

No description provided.

github-actions · 2025-11-26T11:42:15Z

Blocking Issues

validator/import-db-slice-local.cpp:551-560 – The new pipelined path now calls Db::add_handle_to_archive (which ultimately writes a handle into the archive index and flips the handle_moved_to_archive flag) before we know that the block will actually be applied (apply_block_async_3 / ValidatorManager::new_block). If new_block later rejects the block (bad state, timeout, etc.), apply_blocks_async propagates the failure and the importer exits, but at that point the archive already contains the new handle even though the block was never applied or fully archived. On the next retry the handle will be seen as “already moved”, so BlockArchiver will skip the add‑handle step and you’re left with an inconsistent archive (no files, moved_to_archive == false, but the LT index already advanced). Previously the add‑handle happened inside BlockArchiver, i.e. only after new_block triggered archiving and only if that succeeded. We now need either to defer add_handle_to_archive until after new_block completes, or to implement a rollback for the handle/lt‑db entry when later phases fail; otherwise a transient failure strands half‑archived handles and the import can’t be retried cleanly.
validator/import-db-slice-local.cpp:99-104 – process_package now silently swallows any FileReference::create error and just return true;. Before this change we set S = F.move_as_error() and aborted the package, so the caller logged which file name was malformed/corrupted. With the new behaviour a broken entry (e.g. truncated proof filename) is skipped without any warning, the package reports success, and the importer only fails much later with a generic “no block data/proof” error – or worse, if that file was optional, the block is silently missing. At minimum we should keep surfacing the parsing error so operators know the archive on disk is corrupted; silently ignoring unknown file names makes diagnosis of archive issues almost impossible.

github-actions · 2025-11-27T10:44:13Z

No blocking bugs jumped out from this diff. The refactoring is large but the new coroutine‑based importer, async archiving changes, and option plumbing all look internally consistent, and I didn’t spot a correctness regression in the code paths I inspected.

Residual risks / things to double‑check:

validator/import-db-slice-local.cpp:551 – apply_block_async_2 explicitly states that add_handle must be performed in order for LTD index updates to stay monotonic, but the current implementation launches every Db::add_handle_to_archive through td::actor::all with no serialization. If the actor scheduler ever reorders those asks, older seqnos could be treated as duplicates and skipped inside ArchiveSlice::add_handle.
validator/import-db-slice-local.cpp pipeline still calls ValidatorManager::new_block, which unconditionally runs Db::archive. Because the importer now pre‑archives each handle via Db::add_handle_to_archive, BlockArchiver still replays the entire proof/data copying step, defeating the “Optimize importing archive packages” goal and potentially doubling the I/O for each block. Consider short‑circuiting ValidatorManager::new_block when the handle is already archived to avoid undoing the optimization.

github-actions · 2025-12-15T17:35:01Z

validator/import-db-slice-local.cpp: the new coroutine flow drops every td::Result returned by the async helpers. run_inner() co_awaits process_masterchain_blocks, process_shard_blocks, store_data, and apply_blocks without unwrapping the result, so any error from these steps is ignored and the importer will still report success. Use TRY_STATUS/TRY_RESULT (or co_await ... .wrap() and check) to propagate failures instead of discarding them.
validator/import-db-slice-local.cpp: try_advance_shard_client_seqno() now returns td::actor::Task<bool>, but process_shard_blocks() loops with while (co_await try_advance_shard_client_seqno()). Task<T>::await_resume yields td::Result<T>, not T, so the loop condition never inspects the returned bool (and may not even compile). Even if it compiles, it will treat any OK result as true and can spin forever when the function returns false. The result needs to be unwrapped (TRY_RESULT(bool advanced, co_await …)) and the contained bool should drive the loop.

github-actions · 2025-12-15T18:45:49Z

No blocking correctness issues jumped out while reviewing the diff.

The only behavioral change that feels risky is in validator/import-db-slice-local.cpp: process_package now ignores FileReference::create failures (previously it treated them as fatal), so archives with malformed filenames or corrupted entries will now be accepted and the offending files silently skipped. That could hide broken packages and lead to partial imports without an error. If the intent wasn’t to relax validation, consider keeping a failure path (or at least log the skipped entries).

If you want, I can add a small guard/log around that spot to keep corrupted archives from being treated as successful imports.

github-actions · 2025-12-16T09:11:20Z

validator/import-db-slice-local.cpp:561-577: in the permanent celldb path we now apply blocks via apply_blocks_async_*. store_data() writes block data while the handle isn’t marked as moved-to-archive, so ArchiveManager::add_file stores it in temp packages and never sets moved_to_archive. apply_blocks_async_2 later only calls add_handle_to_archive, which sets the handle-moved flag but still doesn’t set moved_to_archive. By the time apply_block_async_4 runs, moved_to_archive() remains false for freshly imported blocks, so the new CHECK(handle->moved_to_archive()) will trip and abort every import when permanent_celldb is enabled. You likely need to move the data to the final archive (or set the flag) before this check or drop the assertion.

github-actions · 2025-12-18T12:34:25Z

validator/import-db-slice-local.cpp:489-519 (permanent celldb path) applies all masterchain/shard blocks in parallel. The tasks_3 batch calls ValidatorManager::new_block for every handle concurrently, so masterchain blocks can reach the manager out of order. The manager expects strict sequencing (CHECK(seqno == last_masterchain_seqno_ + 1) in validator/manager-disk.cpp:830-840) and will crash or corrupt state if a later masterchain block is delivered first. Masterchain blocks need to be fed to new_block sequentially (or otherwise ordered) when importing in permanent mode.

SpyCheese added 5 commits October 22, 2025 10:08

Refactor ArchiveImporterLocal using coroutines

93923c7

Parallel apply block in ArchiveImporterLocal

796f511

Fix ArchiveSlice::add_file_cont

780d75b

Merge branch 'testnet' into archive-import

fbcb9b8

Merge branch 'testnet' into archive-import

29ca3db

SpyCheese force-pushed the archive-import branch from 4156195 to 29ca3db Compare November 7, 2025 09:51

SpyCheese added 9 commits November 7, 2025 12:53

Merge branch 'testnet' into archive-import

1823a39

Reformat code

16d1675

Fix updating LTDB during archive import

3f41bd4

Merge branch 'testnet' into archive-import

885de7c

Disable celldb-v2 when permanent celldb is enabled

e225b44

Reformat code

f69a94b

Merge branch 'testnet' into archive-import

abf31f6

Merge branch 'testnet' into archive-import

f6a6d00

Merge branch 'testnet' into archive-import

1469b23

--unsynced-liteserver flag

8a7bf90

SpyCheese marked this pull request as draft December 15, 2025 14:58

SpyCheese added 2 commits December 15, 2025 19:13

Merge branch 'testnet' into archive-import

4726c59

Log number of blocks and size in ArchiveImporterLocal

d9bf803

Merge branch 'testnet' into archive-import

22977b0

Remove temp packages during sync

d287351

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize importing archive packages #1876

Optimize importing archive packages #1876

Uh oh!

SpyCheese commented Nov 6, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

github-actions bot commented Dec 16, 2025

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimize importing archive packages #1876

Are you sure you want to change the base?

Optimize importing archive packages #1876

Uh oh!

Conversation

SpyCheese commented Nov 6, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

github-actions bot commented Dec 15, 2025

Uh oh!

github-actions bot commented Dec 16, 2025

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants