Fix: Ban problematic orders instead of tokens on simulation failures #3949

anxolin · 2025-12-05T12:25:31Z

Disclaimer: I implemented this feature mostly using claude, so please carefully review. The logic fills good now to me. Also, feel free to take ownership of this PR and apply commits on top.

Problem

The current metrics-based bad token detection incorrectly bans entire tokens when settlement simulations fail, even though failures are often caused by solver-specific issues, not the tokens themselves. This leads to legitimate tokens like
WETH or USDC being banned (for specific colocated solvers).

Example scenario:

A solution with WETH→USDC + DAI→USDT fails due to a: solver routing bug, flashloan reverting, hook failing, other reasons
All 4 tokens (WETH, USDC, DAI, USDT) get failure marks
Eventually, good tokens get banned

Settlement simulations can fail for many reasons unrelated to token quality:

Solver bugs
Slippage issues
Insufficient solver balance
RPC/infrastructure errors
Invalid solution encoding

Solution

Implement order-level banning with metrics-based detection tracking failures per order UID instead of per token.

Changes

New bad_orders module mirroring bad_tokens structure
Added order_uids() to solutions for tracking
Modified encoding flow to track both tokens and orders
New configuration options for order-level banning
Added bad_orders_detected Prometheus metric

Configuration

[solver.my-solver]
enable-metrics-bad-order-detection = true
metrics-bad-order-detection-failure-ratio = 0.9
metrics-bad-order-detection-required-measurements = 5
metrics-bad-order-detection-order-freeze-time = "1h"

Note: Simulation-based bad token detection (which directly tests token behavior) remains unchanged and continues to work correctly.

Caveats

I think this PR improves things, preventing to blame tokens for an issue with a specific order. We can have now a similar issue with one problematic order making some other order to be banned because they are part of the same solution.

In practice, this affects only to one solver (is done in the driver), and I don't think will happen often. However, future PRs could enhance the granularity and try to simulate things in isolation if possible to find the culpit of reverts. Never the less, this should be way less intrusive and give fewer false positives than the old check.

Test

cargo test --package driver bad_orders

github-actions · 2025-12-05T12:25:41Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

anxolin · 2025-12-05T12:38:33Z

I have read the CLA Document and I hereby sign the CLA

squadgazzz · 2025-12-05T14:21:31Z

crates/driver/src/domain/competition/bad_orders/metrics.rs

+pub struct Detector {
+    failure_ratio: f64,
+    required_measurements: u32,
+    counter: Arc<DashMap<order::Uid, OrderStatistics>>,


The order needs to be dropped from this cache once it is executed. We shouldn't accumulate all the failed orders here indefinitely.

squadgazzz · 2025-12-05T14:22:36Z

crates/driver/src/domain/competition/bad_orders/metrics.rs

+                .and_modify(|counter| {
+                    counter.attempts += 1;
+                    counter.fails += u32::from(failure);
+                })
+                .or_insert_with(|| OrderStatistics {
+                    attempts: 1,
+                    fails: u32::from(failure),
+                    flagged_unsupported_at: None,
+                });


And we shouldn't accumulate successful orders here for sure.

Add bad order detection

a3e94e2

anxolin changed the title ~~Add bad order detection~~ Fix: Ban problematic orders instead of tokens on simulation failures Dec 5, 2025

anxolin marked this pull request as ready for review December 5, 2025 12:38

anxolin requested a review from a team as a code owner December 5, 2025 12:38

github-actions bot added a commit that referenced this pull request Dec 5, 2025

@anxolin has signed the CLA in #3949

49c0351

anxolin added 2 commits December 5, 2025 12:43

Fix tests

157ab39

Fix lint

f40ae4e

squadgazzz reviewed Dec 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Ban problematic orders instead of tokens on simulation failures #3949

Fix: Ban problematic orders instead of tokens on simulation failures #3949

anxolin commented Dec 5, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 5, 2025 •

edited

Loading

Uh oh!

anxolin commented Dec 5, 2025

Uh oh!

squadgazzz Dec 5, 2025

Uh oh!

squadgazzz Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix: Ban problematic orders instead of tokens on simulation failures #3949

Are you sure you want to change the base?

Fix: Ban problematic orders instead of tokens on simulation failures #3949

Conversation

anxolin commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changes

Configuration

Caveats

Test

Uh oh!

github-actions bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anxolin commented Dec 5, 2025

Uh oh!

squadgazzz Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

squadgazzz Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anxolin commented Dec 5, 2025 •

edited

Loading

github-actions bot commented Dec 5, 2025 •

edited

Loading