cmd/evm: add enginetest command for direct engine fixture execution by spencer-tb · Pull Request #34650 · ethereum/go-ethereum

spencer-tb · 2026-04-03T14:10:50Z

Summary

Add evm enginetest command that runs blockchain_test_engine fixtures directly against a lightweight Engine API handler, without requiring Hive or full client startup. Also add --workers flag to all three test runners (enginetest, blocktest, statetest) for parallel fixture file processing.

`evm enginetest`

A new direct runner for Engine API test fixtures. Implements a lightweight engine handler that mirrors the core logic of eth/catalyst.ConsensusAPI:

Version-specific NewPayloadV1-V5 parameter validation
ExecutableDataToBlock payload conversion and block insertion via InsertBlockWithoutSetHead
ForkchoiceUpdated with SetCanonical head management (including initial FCU to genesis)
Invalid block ancestor tracking with proper PayloadStatusV1 responses
EngineAPIError code validation against fixture expectations (errorCode, validationError)

This exercises the actual engine code path (two-phase insert-then-canonicalize via forkchoice), not just InsertChain like blocktest.

Benchmarks

Tested against EEST v5.3.0 stable fixtures on Apple M-series.

For reference, Hive runs of the same test suite on the same geth version:

consume engine: 2h 48m
consume rlp: 4h 29m

evm enginetest — exercises the same engine code paths as consume engine (40,523 tests):

Workers	Time	Speedup vs serial	vs Hive consume engine
1	1m02s	1x	~162x
8	12.0s	5.2x	~840x

evm blocktest — exercises the same execution paths as consume rlp (43,924 tests):

Workers	Time	Speedup vs serial
1	1m06s	1x
8	12.7s	5.2x

evm statetest (40,553 tests):

Workers	Time	Speedup
1	21.8s	1x
8	4.4s	4.9x

Hive parity

Tested against v5.3.0 stable release — exact same 4 failures as Hive consume engine on geth master:

eip7002/test_system_contract_deployment[CancunToPragueAtTime15k-deploy_after_fork-nonzero_balance]
eip7002/test_system_contract_deployment[CancunToPragueAtTime15k-deploy_after_fork-zero_balance]
eip7251/test_system_contract_deployment[CancunToPragueAtTime15k-deploy_after_fork-nonzero_balance]
eip7251/test_system_contract_deployment[CancunToPragueAtTime15k-deploy_after_fork-zero_balance]

Usage

# Run engine fixtures
evm enginetest /path/to/blockchain_tests_engine/

# With parallel workers
evm enginetest --workers 8 /path/to/blockchain_tests_engine/

# Filter by regex
evm enginetest --run "eip4844" /path/to/fixtures/

# Human-readable output
evm enginetest --human /path/to/fixtures/

# Same --workers flag on blocktest and statetest
evm blocktest --workers 8 /path/to/blockchain_tests/
evm statetest --workers 8 /path/to/state_tests/

kevaundray · 2026-04-03T18:50:03Z

tests/engine_test_util.go

+			return engine.PayloadStatusV1{Status: engine.INVALID}, engineParamsErr("nil versionedHashes post-cancun")
+		case p.BeaconRoot == nil:
+			return engine.PayloadStatusV1{Status: engine.INVALID}, engineParamsErr("nil beaconRoot post-cancun")
+		case !h.checkFork(params.Timestamp, forks.Cancun, forks.Prague, forks.Osaka, forks.BPO1, forks.BPO2, forks.BPO3, forks.BPO4, forks.BPO5):


Will need to double check this for my own sanity -- for some reason I was expecting it to just say forks.Cancun

This matches the real ConsensusAPI.NewPayloadV3 at api.go:204 which allows V3 for Cancun through BPO5. So maybe can be changed there too but not 100% certain :)

Ah I see, seems I was remembering this:

go-ethereum/eth/catalyst/api.go

Line 708 in d8cb8a9

case !api.checkFork(params.Timestamp, forks.Cancun):

kevaundray · 2026-04-05T12:22:48Z

tests/engine_test_util.go

+	if postCheck != nil {
+		defer postCheck(result, chain)
+	}


Minor nit: I think we may want to use a closure here becauseresult here would be evaluated at the callsite and not when the defer is triggered.

Small self-contained example to elaborate:

package main import "fmt" func runBuggy(postCheck func(error, string)) (result error) { chain := "some-chain" // `result` is evaluated at the callsite, so its nil and not when the defer fires. if postCheck != nil { defer postCheck(result, chain) } result = fmt.Errorf("payload 3: expected VALID, got INVALID") return result } func runFixed(postCheck func(error, string)) (result error) { chain := "some-chain" // closure captures `result` by reference, so it reads the final value when the defer fires. if postCheck != nil { defer func() { postCheck(result, chain) }() } result = fmt.Errorf("payload 3: expected VALID, got INVALID") return result } func main() { check := func(res error, chain string) { fmt.Println(" postCheck got error:", res) } fmt.Println("buggy:") runBuggy(check) fmt.Println("fixed:") runFixed(check) }

kevaundray · 2026-04-05T12:35:26Z

cmd/evm/blockrunner.go

 	var tests map[string]*tests.BlockTest
 	if err = json.Unmarshal(src, &tests); err != nil {
-		return nil, err
+		return nil, nil // Skip non-fixture JSON files


Just want to confirm that this also skips errors from malformed fixture files?

- Add lastBlockHash to blocktest/enginetest, lastPayloadStatus to enginetest - Remove stateRoot from blocktest/enginetest (only statetest has it) - Report validation/rejection error in `error` even when test passes, for negative tests (expected exceptions) - Enables EELS consume direct to map errors through ExceptionMapper and verify correct exception for every invalid test

MariusVanDerWijden · 2026-04-07T11:37:39Z

I like the idea. I don't know if we need --worker though, we could just default to runtime.NumCPU()

spencer-tb force-pushed the feat/evm-enginetest branch 5 times, most recently from 29e3f4c to 00957ea Compare April 3, 2026 16:47

spencer-tb added 3 commits April 3, 2026 17:52

cmd/evm: add enginetest command for direct engine fixture execution

2ef6227

cmd/evm: add --workers flag to blocktest for parallel file processing

58fe592

cmd/evm: add --workers flag to statetest for parallel file processing

d46370a

spencer-tb force-pushed the feat/evm-enginetest branch from 00957ea to d46370a Compare April 3, 2026 16:53

spencer-tb marked this pull request as ready for review April 3, 2026 17:01

spencer-tb requested review from MariusVanDerWijden and lightclient as code owners April 3, 2026 17:01

This was referenced Apr 3, 2026

cmd/evm: add enginetest command and parallel workers for test runners erigontech/erigon#20315

Draft

feat: consume direct support for all EL clients with engine fixture runners ethereum/execution-spec-tests#2319

Open

kevaundray reviewed Apr 3, 2026

View reviewed changes

This was referenced Apr 4, 2026

nethtest: add --engineTest, --stateTest, --jsonout, --workers flags NethermindEth/nethermind#11035

Draft

evmtool: add engine-test subcommand and parallel workers for test runners besu-eth/besu#10184

Draft

cmd/evm: add initial forkchoice update to genesis in enginetest

1063416

kevaundray reviewed Apr 5, 2026

View reviewed changes

spencer-tb added 4 commits April 5, 2026 19:45

cmd/evm: always include error field in JSON output

ffc5899

cmd/evm: add --ndjson flag for streaming JSON output

6f0ae11

tests: move block insertion debug output from stdout to stderr

2ca9720

cmd/evm: remove --ndjson flag (not needed for consume direct)

c8fdb1d

spencer-tb mentioned this pull request Apr 5, 2026

feat(test-consume): direct with per type result models and exception matching ethereum/execution-specs#2622

Draft

4 tasks

spencer-tb marked this pull request as draft April 5, 2026 23:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/evm: add enginetest command for direct engine fixture execution#34650

cmd/evm: add enginetest command for direct engine fixture execution#34650
spencer-tb wants to merge 9 commits intoethereum:masterfrom
spencer-tb:feat/evm-enginetest

spencer-tb commented Apr 3, 2026 •

edited

Loading

Uh oh!

kevaundray Apr 3, 2026

Uh oh!

spencer-tb Apr 4, 2026 •

edited

Loading

Uh oh!

kevaundray Apr 5, 2026

Uh oh!

kevaundray Apr 5, 2026

Uh oh!

kevaundray Apr 5, 2026

Uh oh!

kevaundray Apr 5, 2026

Uh oh!

MariusVanDerWijden commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

spencer-tb commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

evm enginetest

Benchmarks

Hive parity

Usage

Uh oh!

kevaundray Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

spencer-tb Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kevaundray Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

kevaundray Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

kevaundray Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

kevaundray Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

spencer-tb commented Apr 3, 2026 •

edited

Loading

`evm enginetest`

spencer-tb Apr 4, 2026 •

edited

Loading