fatp: Implement blockchain revert handling #10453

AlexandruCihodaru · 2025-11-28T08:54:19Z

Add support for handling blockchain revert. This is useful in testing.

Changes:

Add ChainEvent::Reverted variant to represent backward blockchain progression
Implement handle_reverted() method that:
- Collects transactions from retracted blocks via included_transactions cache or by fetching block bodies from the API
- Removes all views beyond the revert point to prevent zombie views
- Removes included transactions from mempool (they can be resubmitted later)
- Updates enactment state (recent_finalized_block and recent_best_block)
- Ensures a valid view exists at the revert target block
Add early return in maintain() for Reverted events to prevent normal forward-progression logic from running

These changes fix issues where reverting would leave zombie views in the view store, causing issues at subsequent operations.

Note: Transactions that were pending may not be visible after revert if they fail the revalidation

Add support for handling blockchain revert. This is useful in testing. Changes: - Add ChainEvent::Reverted variant to represent backward blockchain progression - Implement handle_reverted() method that: * Collects transactions from retracted blocks via included_transactions cache or by fetching block bodies from the API * Removes all views beyond the revert point to prevent zombie views * Removes included transactions from mempool (they can be resubmitted later) * Updates enactment state (recent_finalized_block and recent_best_block) * Ensures a valid view exists at the revert target block - Add early return in maintain() for Reverted events to prevent normal forward-progression logic from running These changes fix issues where reverting would leave zombie views in the view store, causing issues at subsequent operations. Note: Transactions that were pending may not be visible after revert if they fail the revalidation Signed-off-by: Alexandru Cihodaru <[email protected]>

AlexandruCihodaru · 2025-11-28T09:06:16Z

/cmd fmt

AlexandruCihodaru · 2025-11-28T09:06:47Z

/cmd prdoc --audience runtime_dev --bump patch

…time_dev --bump patch'

Signed-off-by: Alexandru Cihodaru <[email protected]>

substrate/client/transaction-pool/src/common/enactment_state.rs

substrate/client/transaction-pool/src/fork_aware_txpool/fork_aware_txpool.rs

Signed-off-by: Alexandru Cihodaru <[email protected]>

substrate/client/transaction-pool/src/fork_aware_txpool/fork_aware_txpool.rs

michalkucharczyk · 2025-11-28T16:48:18Z

DQ:

          D1-E1-F1-G1-..-X1
         /
A - B - C - D2-E2-F2-G2-..-X2
         \
          D3-E3-F3-G3-..-X3

Is it realistic scenario? Should we handle this properly? If we revert from X1 to B, shold we also remove all transactions included on D2-...-X2 and D3-...-X3 forks (as we do for D1-...-X1)?

If we assume that revert can only be called if there is single fork - should we somehow check this in handle_revert function? (or at least document it somehow).

AlexandruCihodaru · 2025-11-28T17:18:21Z

DQ:
          D1-E1-F1-G1-..-X1
         /
A - B - C - D2-E2-F2-G2-..-X2
         \
          D3-E3-F3-G3-..-X3
Is it realistic scenario? Should we handle this properly? If we revert from X1 to B, shold we also remove all transactions included on D2-...-X2 and D3-...-X3 forks (as we do for D1-...-X1)?

If we assume that revert can only be called if there is single fork - should we somehow check this in handle_revert function? (or at least document it somehow).

Excellent question. I think that in anvil it is not possible to have such a scenario but I believe that we should delete the transactions on all possible paths.

substrate/client/transaction-pool/api/src/lib.rs

iulianbarbu

Looks good in general. It would be great to capture in the event the idea of reverting all existing forks, to have this logic appliable not just to single chain nodes like anvil-polkadot - but tbh not super sure how difficult it is.

substrate/client/transaction-pool/tests/fatp.rs

substrate/client/transaction-pool/api/src/lib.rs

Signed-off-by: Alexandru Cihodaru <[email protected]>

AlexandruCihodaru · 2025-12-02T12:39:29Z

/cmd fmt

Signed-off-by: Alexandru Cihodaru <[email protected]>

re-gius · 2026-01-23T14:31:36Z

Changes applied to handle_reverted() implementation in #10867

Always create a fresh view at the new head by populating it from the current mempool state. We do not rely on stale views because:
- Old views may contain transactions from now-reverted blocks, but we want to remove those
- Old views won't contain transactions submitted after the view was created, but we want to include those
Atomic view removal with race condition prevention
- Step 5 now holds all view locks simultaneously (following the same pattern as ViewStore::insert_new_view_sync)
- We also abort view removal if no view exists at new head to avoid inconsistent state
Updated documentation
Added view state assertions to tests: fatp_revert_multiple_blocks_does_not_resubmit now verifies view exists at new head and reverted views are removed

I guess this PR is ready for review now

michalkucharczyk · 2026-01-26T07:21:54Z

We hade some offline discussions with @AlexandruCihodaru regarding this PR. I am leaving here the main concerns I still have about this PR:

1. Inconsistent handling of included vs. in-pool transactions

Current implementation removes only transactions included in reverted blocks. Transactions still in ready/future queues are left untouched, even though they may have been sent at the same time.

  Example:
  Block N-1: submit tx0, tx1, tx2 (all ready, prio: tx2 > tx1 > tx0, txs are "heavy")
  Block N:   tx2 included (InBlock)
  Block N+1: tx1 included (InBlock)

  Revert to N-1

After revert:

tx1, tx2 -> removed (they were included in reverted blocks)
tx0 -> stays in pool (was never included)

This is behavior is inconsistent and hard to understand. It is impossible to control what transactions we have in pool.

I would propose to provide explicit API for removal - decouple "remove transactions" from "revert chain" (which are orthogonal operations), giving the node builders flexibility to implement their desired behavior.
The new method would be "removal without banning" so transaction can be resubmitted after reverting.

2. Missing Dropped event for watchers

When using submit_and_watch, the watcher may hang indefinitely if a transaction is silently removed after reversal:

  watcher = pool.submit_and_watch(tx);
  pool.maintain(NewBlock(N, [tx]));  
  // ...
  // watcher receives InBlock event
  // ...
  pool.maintain(Revert(N-1));        
  // tx removed, but watcher never notified => hangs forever

The event flow Ready -> InBlock -> Dropped is valid and should be emitted when transactions are removed due to revert.

3. Better documentation of behavior needed

Whatever behavior is chosen, it should be documented (perhaps in ChainEvent::Reverted docs) so users know:

Which transactions get removed on revert,
Whether they need to resubmit pending transactions,
What events to expect from watchers,

Possible approaches

Remove only included transactions (current) - working, but incosistent, hard to control,
Remove all transactions (included + ready + future) - but requires resubmission (which probably is intended),
Provide explicit API for removal - decouple "remove transactions" from "revert chain", giving node builders flexibility to implement their desired behavior.

I think approach 3 is the cleanest.

bkchr · 2026-01-26T22:14:02Z

@michalkucharczyk hadn't we discussed some time ago that it would be the simplest to just re-create the tx pool? So, not requiring any of this code here?

michalkucharczyk · 2026-01-27T11:18:14Z

@michalkucharczyk hadn't we discussed some time ago that it would be the simplest to just re-create the tx pool? So, not requiring any of this code here?

Could be a solution, but it may have some limitation. E.g.

B0->B1->B2->B3

reverting to B2 would "kill" view for B1, so you would not be able to build a block on top of it.
killing the pool means you need to resubmit all transactions,

It depends on the requirements the manual-seal / anvil node has from the reverting mechanism. Honestly I am not sure how they should work, so I try to build mechanism which is flexible enough to handle different scenarios and does not pull specifics of anvil node into the generic pool.

re-gius · 2026-01-27T16:53:41Z

New changes from 121a9be

Decoupling the transaction removal from chain revert handling. Now:

ChainEvent::Reverted only handles view management:
- Removes views beyond the revert point
- Creates a fresh view at the new head
- Does NOT touch the mempool
remove_transactions() is a separate API for explicit transaction removal without banning:
- Node builders call it when/if they want to remove specific transactions
- Gives flexibility: remove all, remove only reverted-block txs, or keep everything

Implementation Details

Dependents are notified: Unlike report_invalid, we emit Dropped events for dependent transactions to prevent their watchers from hanging indefinitely.
Dependents hashes are not returned: Exactly like report_invalid does, we only return hashes of transactions that were also in the input list of transactions to remove.

@michalkucharczyk what do you think of this new implementation?

bkchr · 2026-01-27T19:16:34Z

reverting to B2 would "kill" view for B1, so you would not be able to build a block on top of it.

killing the pool means you need to resubmit all transactions,

You can just rebuild B1 by sending all the transactions again. I would assume that the anvil stuff still has all the transactions. Right now this pull request is trying to add a feature that is never used for normal operations and will never be used for them.

michalkucharczyk · 2026-01-27T19:53:41Z

technically you would need to “import block” (call pool’s maintain), and resubmit transactions, but this is detail.

I see your point, if we can get all the functionality and avoid new complexity in the code - I am all in :). Question if anvil node would be happy with this - I don’t know the answer…

Maybe @AlexandruCihodaru or @alindima can comment on this proposal.

michalkucharczyk · 2026-01-27T19:56:24Z

Also if we want to have anvil node in our contracts (reliability) toolset, then it becomes normal-like operation, we need to support it, test, etc…

bkchr · 2026-01-27T21:47:04Z

Also if we want to have anvil node in our contracts (reliability) toolset, then it becomes normal-like operation, we need to support it, test, etc…

By "normal operation" I meant anything that you need to run a blockchain network. This here is for testing. I'm not saying that this is not important, but if we can achieve the same results if we don't need to modify the internals of tx pool, we should not do this. Just adds more complexity that we can move closer to where it is needed (anvil node).

alindima · 2026-01-28T08:26:36Z

Chain reversion is something that can happen in polkadot under "normal" protocol operation (although in the case of disputes, which are exceptional). I remember @sandreim mentioning some issues with the txpool on reversion as well (on some polkadot-based network or locally, not on an anvil-based instance).

I would assume that the anvil stuff still has all the transactions.

No, anvil uses the txpool from substrate, does not have a wrapper over it. Of course, we could have implemented our own txpool but there was no good reason for it at the time (the substrate txpool looked flexible enough for our use case). Since chain reversion is something that needs to work regardless of whether or not anvil uses it, I'd much rather solve this problem for good in here rather than adding a reimplementation to work around this

michalkucharczyk · 2026-01-28T09:52:39Z

I think the point is to re-use the existing txpool in anvil-specific pool-wrapper. The inner pool could be dropped and new instance of substrate could be created as inner when reversion happens.

I don't have enough information about anvil-node scenarios to judge if this approach is feasible, and if all scenarios can be covered.

The ultimate code is to reduce complexity in existing pool.

paritytech-workflow-stopper · 2026-01-28T10:06:20Z

All GitHub workflows were cancelled due to failure one of the required jobs.
Failed workflow url: https://github.com/paritytech/polkadot-sdk/actions/runs/21433377790
Failed job name: cargo-clippy

bkchr · 2026-01-29T09:01:26Z

Chain reversion is something that can happen in polkadot under "normal" protocol operation (although in the case of disputes, which are exceptional). I remember @sandreim mentioning some issues with the txpool on reversion as well (on some polkadot-based network or locally, not on an anvil-based instance).

Chain reversion are not happening under normal protocol operations. We are forking, if an invalid candidate was found by approval voting. But we do not revert, especially we do not revert the finalized chain.

sandreim · 2026-01-29T09:06:01Z

Chain reversion is something that can happen in polkadot under "normal" protocol operation (although in the case of disputes, which are exceptional). I remember @sandreim mentioning some issues with the txpool on reversion as well (on some polkadot-based network or locally, not on an anvil-based instance).

Chain reversion are not happening under normal protocol operations. We are forking, if an invalid candidate was found by approval voting. But we do not revert, especially we do not revert the finalized chain.

The context is some block was dropped in RC (never backed on chain).

"revert" indeed is not the right word here, forking is accurate, but the percieved effect by the user is that the chain state has been reverted up to the blockheight that we start building the fork on.

What I was expecting to happen is that the transactions that were included in the abandoned fork are included in the new one.

bkchr · 2026-01-29T09:11:25Z

What I was expecting to happen is that the transactions that were included in the abandoned fork are included in the new one.

That is happening and if not, it is a bug :) Maybe directly try the forkaware transaction pool, but this should only be required on parachains. For the relay chain the normal tx pool should handle it correctly.

But ahh, yeah for the fork case in Polkadot we will not change the best chain until we have a longer/better chain that the old one. So, the old tx pool will not insert the transactions. If you use the forkaware tx pool it should fix this behavior.

sandreim · 2026-01-29T09:38:23Z

That is happening and if not, it is a bug :) Maybe directly try the forkaware transaction pool, but this should only be required on parachains. For the relay chain the normal tx pool should handle it correctly.

But ahh, yeah for the fork case in Polkadot we will not change the best chain until we have a longer/better chain that the old one. So, the old tx pool will not insert the transactions. If you use the forkaware tx pool it should fix this behavior.

I was using the FATP and I was hitting this on every session boundary (when we clearly drop blocks).

bkchr · 2026-01-29T10:20:43Z

I was using the FATP and I was hitting this on every session boundary (when we clearly drop blocks).

https://github.com/paritytech/polkadot-sdk/issues/new/choose and ping @michalkucharczyk :D

re-gius · 2026-01-29T11:37:05Z

After reading your comments and investigating the original anvil implementation more in detail, I propose we simplify this PR to only keep what's truly fundamental for anvil revert methods, so the handle_reverted logic.
What we need is cleaning up zombie views and update most_recent_view properly - everything that's currently inside the handle_reverted method.
As for removing transactions or manipulating the mempool, we don't necessarily need it. Both because the original anvil does not restore removed txns in the mempool when reverting, and because this adds more complexity to the transaction pool.

What do you think? @bkchr @michalkucharczyk

bkchr · 2026-01-29T11:44:46Z

What we need is cleaning up zombie views and update most_recent_view properly - everything that's currently inside the handle_reverted method.

But wouldn't this be solved by just recreating the tx pool?

re-gius · 2026-01-29T12:04:42Z

But wouldn't this be solved by just recreating the tx pool?

That would be technically possible. We would need to copy all transactions from the mempool and carry them to the new pool. I can try implementing it directly in anvil-polkadot.

The remaining issue that won't be solved is that watchers from submit_and_watch may be orphaned and hang forever waiting for notifications. We may still accept this behavior for a dev/testing tool, but it's a bug.

re-gius · 2026-01-29T17:12:14Z

I dove deeper into anvil-polkadot and the "recreate pool on revert" approach is quite involved, it affects several functionalities and it has a couple of bugs. These are:

submit_and_watch listeners are orphaned: after the old pool is dropped, clients waiting for transaction updates will hang or timeout.
Lost pool metadata on resubmit: when we replay transaction on the new pool to mimic Anvil, we only have raw extrinsic bytes. So we lose TransactionSource (all become Local), watch status, and the older internal pool status. - this is probably unexpected for anvil users (?)

Moreover, all stream subscribers require explicit refresh/reset: mining engine needs to be refreshed, pending transaction filters need to be recreated... This is not a bug, but more a tedious operation.

After all that, I still believe that supporting some basic revert functionalities in the Substrate Transaction Pool remains useful for Anvil and for whoever needs a bug-free revert on a Substrate chain.
EDIT: remove_transactions logic however is not necessary, it's just a nice-to-have to allow flexible transaction removal policies on revert

michalkucharczyk · 2026-01-29T19:23:19Z

hm, you should also intercept submit/submit and watch in wrapper. you also can intercept the listener? so orphaned listener should not be a problem.

I don’t think transaction source is a problem. You want to resubmit transactions (to new inner pool) previously submitted to wrapper, right?

re-gius · 2026-01-30T10:05:30Z

hm, you should also intercept submit/submit and watch in wrapper. you also can intercept the listener? so orphaned listener should not be a problem.

I don’t think transaction source is a problem. You want to resubmit transactions (to new inner pool) previously submitted to wrapper, right?

You're right. I can intercept all submissions and store the metadata in the wrapper, I can also edit all relevant streams (like pending transaction filters and the mining engine too) to check for pool recreations at each poll.

So, in the end, it's technically possible to build a bug-free tx pool wrapper to support reverts, but it's more error-prone and significantly more complex than allowing a basic revert in the Substrate pool. If you think that revert complexity doesn’t justify changing Substrate, I’m fine limiting the changes to Anvil. Please let me know which direction you prefer.

bkchr · 2026-01-30T19:38:33Z

but it's more error-prone and significantly more complex than allowing a basic revert in the Substrate pool. If you think that revert complexity doesn’t justify changing Substrate, I’m fine limiting the changes to Anvil. Please let me know which direction you prefer.

Let's try to implement it and compare it. I don't see why this should be complicated.

AlexandruCihodaru requested review from iulianbarbu, michalkucharczyk and skunert November 28, 2025 08:54

AlexandruCihodaru self-assigned this Nov 28, 2025

github-actions bot and others added 3 commits November 28, 2025 09:08

Update from github-actions[bot] running command 'fmt'

dbbfe61

Update from github-actions[bot] running command 'prdoc --audience run…

40f6b28

…time_dev --bump patch'

fix prdoc

e0bb59f

Signed-off-by: Alexandru Cihodaru <[email protected]>

alindima reviewed Nov 28, 2025

View reviewed changes

AlexandruCihodaru added 2 commits November 28, 2025 14:49

feedback v1

e36f7d3

Signed-off-by: Alexandru Cihodaru <[email protected]>

Merge remote-tracking branch 'origin' into acihodaru/chain_event_revert

d007072

michalkucharczyk reviewed Nov 28, 2025

View reviewed changes

substrate/client/transaction-pool/src/fork_aware_txpool/fork_aware_txpool.rs Outdated Show resolved Hide resolved

michalkucharczyk reviewed Nov 28, 2025

View reviewed changes

substrate/client/transaction-pool/src/fork_aware_txpool/fork_aware_txpool.rs Outdated Show resolved Hide resolved

michalkucharczyk reviewed Dec 1, 2025

View reviewed changes

substrate/client/transaction-pool/api/src/lib.rs Outdated Show resolved Hide resolved

iulianbarbu reviewed Dec 2, 2025

View reviewed changes

substrate/client/transaction-pool/tests/fatp.rs Show resolved Hide resolved

substrate/client/transaction-pool/api/src/lib.rs Outdated Show resolved Hide resolved

AlexandruCihodaru added 2 commits December 2, 2025 11:32

Merge branch 'master' into acihodaru/chain_event_revert

254eec1

feedback v2

6d56226

Signed-off-by: Alexandru Cihodaru <[email protected]>

AlexandruCihodaru requested review from alindima, iulianbarbu and michalkucharczyk December 2, 2025 12:33

AlexandruCihodaru added the T0-node This PR/Issue is related to the topic “node”. label Dec 2, 2025

github-actions bot and others added 4 commits December 2, 2025 12:42

Update from github-actions[bot] running command 'fmt'

8bf5055

Merge remote-tracking branch 'origin' into acihodaru/chain_event_revert

c08468b

on revert make hash finalized

4503871

Signed-off-by: Alexandru Cihodaru <[email protected]>

Merge branch 'master' into acihodaru/chain_event_revert

7dce6b2

re-gius added 2 commits January 27, 2026 17:48

decouple txn removal logic from chain revert handling

121a9be

Merge branch 'master' into acihodaru/chain_event_revert

d254d83

implement remove_transactions for MiddlewarePool

f981da7

re-gius added 2 commits January 28, 2026 11:19

clippy fix

9cf4cba

semver fix

c154749

Merge branch 'master' into acihodaru/chain_event_revert

ebc3661

fatp: Implement blockchain revert handling #10453

Are you sure you want to change the base?

fatp: Implement blockchain revert handling #10453

Conversation

AlexandruCihodaru commented Nov 28, 2025

Uh oh!

AlexandruCihodaru commented Nov 28, 2025

Uh oh!

AlexandruCihodaru commented Nov 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michalkucharczyk commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexandruCihodaru commented Nov 28, 2025

Uh oh!

Uh oh!

iulianbarbu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

AlexandruCihodaru commented Dec 2, 2025

Uh oh!

re-gius commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michalkucharczyk commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bkchr commented Jan 26, 2026

Uh oh!

michalkucharczyk commented Jan 27, 2026

Uh oh!

re-gius commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bkchr commented Jan 27, 2026

Uh oh!

michalkucharczyk commented Jan 27, 2026

Uh oh!

michalkucharczyk commented Jan 27, 2026

Uh oh!

bkchr commented Jan 27, 2026

Uh oh!

alindima commented Jan 28, 2026

Uh oh!

michalkucharczyk commented Jan 28, 2026

Uh oh!

paritytech-workflow-stopper bot commented Jan 28, 2026

Uh oh!

bkchr commented Jan 29, 2026

Uh oh!

sandreim commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bkchr commented Jan 29, 2026

Uh oh!

sandreim commented Jan 29, 2026

Uh oh!

bkchr commented Jan 29, 2026

Uh oh!

re-gius commented Jan 29, 2026

Uh oh!

bkchr commented Jan 29, 2026

Uh oh!

re-gius commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

re-gius commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michalkucharczyk commented Jan 29, 2026

Uh oh!

re-gius commented Jan 30, 2026

Uh oh!

michalkucharczyk commented Nov 28, 2025 •

edited

Loading

re-gius commented Jan 23, 2026 •

edited

Loading

michalkucharczyk commented Jan 26, 2026 •

edited

Loading

re-gius commented Jan 27, 2026 •

edited

Loading

sandreim commented Jan 29, 2026 •

edited

Loading

re-gius commented Jan 29, 2026 •

edited

Loading

re-gius commented Jan 29, 2026 •

edited

Loading