feat(fill): add `--gas-benchmark-values` command to support single genesis file #1895

LouisTsai-Csie · 2025-07-11T14:21:02Z

🗒️ Description

This PR introduces a new fill option, --gas-benchmark-values. Supply a comma-separated list of gas amounts (in millions) to set the values used during benchmarking.

The PR also adds two example tests in tests/benchmark/test_worst_blocks.py. To generate their fixtures, run:

uv run fill -v tests/benchmark/test_worst_blocks.py::test_block_full_data \
  --fork Prague \
  --gas-benchmark-values 1,10,30,60,90,120 \
  --generate-pre-alloc-groups \
  --clean

Flag --generate-pre-alloc-groups is required for the enginex fixture format.

The command creates two directories:

fixtures/blockchain_tests_engine_x/benchmark/worst_blocks
fixtures/blockchain_tests_engine_x/pre_alloc

Because only one preAllocGroup is produced, this process generates a single genesis file.

To generate the genesis file, please follow the documentation to run hive locally and run the extract_config command

For example: uv run extract_config --fixture fixtures/blockchain_tests_engine_x/pre_alloc/0x10763c36b27696c5.json

I would prefer to refactor the benchmark test in a separate PR, this task is updated in the issue.

I’ve reviewed the Filling Test section, and I see that the command and flag descriptions are generated by this script. However, I’m happy to contribute additional documentation if needed.

For pytest plugin test cases, I add three cases, you could run with the following command:
Case 1: Verify the --gas-benchmark-values flag is added
Case 2: Verify the flag works as expected if provided
Case 3: Verify the non-benchmark test is not affected.

python -m pytest src/pytest_plugins/filler/tests/test_benchmarking.py -v

🔗 Related Issues or PRs

Issue #1891

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx --with=tox-uv tox -e lint,typecheck,spellcheck,markdownlint
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.
Ported Tests: All converted JSON/YML tests from ethereum/tests or tests/static have been assigned @ported_from marker.

marioevz

Looks great! I think this a great change for maintainability and make it easier for us to generate more vectors as they are required.

Small downside is that we have to remove Environment().gas_limit from most of tests, but I would say we do it the earlier the better.

cc @jsign for some feedback on my comments.

Thanks!

.github/configs/feature.yaml

src/pytest_plugins/filler/filler.py

jsign · 2025-07-11T16:39:14Z

Nice! @LouisTsai-Csie @marioevz, is this compatible with supporting all the test formats too? (i.e. #1778).

Mostly asking since I think this is coming from the fact of simplifying the single genesis for perfnets, but wondering if it should still be fine for the other formats that we need for zkVMs.

marioevz · 2025-07-11T18:19:21Z

Nice! @LouisTsai-Csie @marioevz, is this compatible with supporting all the test formats too? (i.e. #1778).

Mostly asking since I think this is coming from the fact of simplifying the single genesis for perfnets, but wondering if it should still be fine for the other formats that we need for zkVMs.

Should be compatible out of the box, but I'll give that a look again and raise if the there's any concerns.

danceratopz

Thanks, this looks great @LouisTsai-Csie!

Shame, that this didn't occur to me up front in #1891, but I'd suggest that we move this codeto a new plugin that gets activated with fill by default. This should work well due to the composability of pytest plugins.

This means, we:

Add these changes (and other benchmarking related pytest config, if any) to a separate pytest plugin, I'd suggest src/pytest_plugins/filling/benchmarking.py.

Enable this plugin using -p via the fill command's pytest ini:

execution-spec-tests/src/cli/pytest_commands/pytest_ini_files/pytest-fill.ini

Lines 11 to 22 in 0f7c73a

    
           addopts =  
        
               -p pytest_plugins.concurrency 
        
               -p pytest_plugins.filler.pre_alloc 
        
               -p pytest_plugins.filler.filler 
        
               -p pytest_plugins.filler.ported_tests 
        
               -p pytest_plugins.filler.static_filler 
        
               -p pytest_plugins.shared.execute_fill 
        
               -p pytest_plugins.forks.forks 
        
               -p pytest_plugins.eels_resolver 
        
               -p pytest_plugins.help.help 
        
               --tb short 
        
               --ignore tests/cancun/eip4844_blobs/point_evaluation_vectors/

All benchmarking-related plugin customizations (e.g. pytest_addoption, pytest_generate_tests, etc.) currently in filler/filler.py can be moved directly to filler/benchmarking.py. This keeps the benchmarking logic self-contained. Pytest hooks from both modules should compose as expected.

To cleanly handle options/values that are specific to benchmarking, I'd suggestion the following approach, if you agree/like it feel free to go for it!

1. Define a filling mode enum in `filler/filler.py`:

from enum import StrEnum, unique

@unique
class FillMode(StrEnum):
    CONSENSUS = "consensus"
    BENCHMARKING = "benchmarking"

2. In the filler plugin (`filler.py`), set the default:

from _pytest.config import Config
from .filler import FillMode

def pytest_configure(config: Config) -> None:
    if not hasattr(config, "fill_mode"):
        config.fill_mode = FillMode.CONSENSUS

3. In the benchmarking plugin (`filler/benchmarking.py`), override only if `--benchmark-gas-values` is set:

from _pytest.config import Config
from .filler import FillMode

def pytest_configure(config: Config) -> None:
    if config.getoption("--benchmark-gas-values") is not None:
        config.fill_mode = FillMode.BENCHMARKING

4. Example usage in filler logic, wrapped in a fixture:

import pytest
from ,filler import FillMode

GIGA_GAS = 1_000_000_000

@pytest.fixture
def env() -> Environment:  # noqa: D103
    return 1_000_000_000)
    if config.fill_mode == FillMode.BENCHMARKING:
        return Environment(gas_limit=GIGA_GAS)
    else:
        return Environment()

src/pytest_plugins/filler/filler.py

tests/conftest.py

danceratopz

Thanks! This looks great to me!

One comment below.

tests/conftest.py

LouisTsai-Csie · 2025-07-15T10:23:39Z

@danceratopz Thank you for review, but I am wondering the following:

Should I add test cases under src/cli/tests/? I noticed your recent PR included tests there.
Should I add documentation for the new flag? I’m happy to do that, but I ran into some issues building the docs locally with mkdocs (related to cairosvg and missing libcairo on macOS). Let me know if there’s a preferred workaround or if I should just update the markdown and let CI verify the build (not a good idea).

danceratopz · 2025-07-15T12:57:17Z

@danceratopz Thank you for review, but I am wondering the following:

* Should I add test cases under `src/cli/tests/`? I noticed your recent [PR](https://github.com/ethereum/execution-spec-tests/pull/1855/files#diff-5c3633f8cbee135e20eb35f9537277edaf7ff69714db9f5c0993431a312ca5f5) included tests there.

I don't think it's strictly necessary for the PR, but some sanity check that the flag works is nice, of course. Recently, I've been pointing Claude at unit testing tasks.

* Should I add documentation for the new flag? I’m happy to do that, but I ran into some [issues](https://github.com/ethereum/execution-spec-tests/issues/1908) building the docs locally with `mkdocs` (related to cairosvg and missing `libcairo` on `macOS`). Let me know if there’s a preferred workaround or if I should just update the markdown and let CI verify the build (not a good idea).

Does this work?

uvx --with=tox-uv tox -e mkdocs

If so, its' because of the macOS trick found in these lines (you can then set the env var locally):

execution-spec-tests/tox.ini

Lines 56 to 58 in dfdd433

    
           # Required for `cairosvg` so tox can find `libcairo-2`. 
        
           # https://squidfunk.github.io/mkdocs-material/plugins/requirements/image-processing/?h=cairo#cairo-library-was-not-found 
        
           DYLD_FALLBACK_LIBRARY_PATH = /opt/homebrew/lib

marioevz

Looks great! I did my suggestions locally and execute is working with the new flag! 🎉

src/cli/pytest_commands/pytest_ini_files/pytest-fill.ini

src/pytest_plugins/filler/benchmarking.py

src/pytest_plugins/filler/filler.py

marioevz

Awesome work, I would like to see if we could rebase and use gas_benchmark_value in all benchmark tests so we can prepare for the next benchmark release if possible.

.github/configs/feature.yaml

src/pytest_plugins/filler/tests/test_benchmarking.py

…file

…le entry

…guration

marioevz

LGTM, thanks!

…nesis file (ethereum#1895) * feat(fill): add benchmark gas valu command to support single genesis file * refactor(tests): update benchmark test for supported command * refactor(benchmark): consolidate benchmark configurations into a single entry * doc(fill): update command description and changelog * chore(fill): remove legacy gas benchmark values command * refactor(fill): create gas benchmakr value pytest plugin * test(fill): add pytest plugin test and update state test * refactor(fill): add env fixture for benchmarking with gas limit configuration * refactor: support both fill and execute mode * fix: update ci flag and test command

LouisTsai-Csie self-assigned this Jul 11, 2025

LouisTsai-Csie added scope:fill Scope: fill command feature:benchmark labels Jul 11, 2025

LouisTsai-Csie force-pushed the fill-benchmark-command branch from 8675c6c to d0413c8 Compare July 11, 2025 15:22

marioevz reviewed Jul 11, 2025

View reviewed changes

.github/configs/feature.yaml Outdated Show resolved Hide resolved

src/pytest_plugins/filler/filler.py Outdated Show resolved Hide resolved

src/pytest_plugins/filler/filler.py Outdated Show resolved Hide resolved

src/pytest_plugins/filler/filler.py Outdated Show resolved Hide resolved

marioevz mentioned this pull request Jul 11, 2025

feat(benchmark): create new benchmark_test test type #1896

Closed

LouisTsai-Csie requested review from danceratopz and marioevz July 14, 2025 02:13

LouisTsai-Csie marked this pull request as ready for review July 14, 2025 02:13

danceratopz reviewed Jul 14, 2025

View reviewed changes

src/pytest_plugins/filler/filler.py Outdated Show resolved Hide resolved

src/pytest_plugins/filler/filler.py Outdated Show resolved Hide resolved

src/pytest_plugins/filler/filler.py Outdated Show resolved Hide resolved

LouisTsai-Csie force-pushed the fill-benchmark-command branch from d0413c8 to 946de75 Compare July 15, 2025 09:09

LouisTsai-Csie commented Jul 15, 2025

View reviewed changes

src/pytest_plugins/filler/filler.py Outdated Show resolved Hide resolved

LouisTsai-Csie commented Jul 15, 2025

View reviewed changes

tests/conftest.py Outdated Show resolved Hide resolved

LouisTsai-Csie commented Jul 15, 2025

View reviewed changes

tests/conftest.py Outdated Show resolved Hide resolved

danceratopz reviewed Jul 15, 2025

View reviewed changes

tests/conftest.py Outdated Show resolved Hide resolved

marioevz reviewed Jul 15, 2025

View reviewed changes

LouisTsai-Csie force-pushed the fill-benchmark-command branch from fa71d8d to 7e5f501 Compare July 18, 2025 15:21

marioevz reviewed Jul 21, 2025

View reviewed changes

.github/configs/feature.yaml Outdated Show resolved Hide resolved

src/pytest_plugins/filler/tests/test_benchmarking.py Outdated Show resolved Hide resolved

LouisTsai-Csie added 8 commits July 22, 2025 16:50

feat(fill): add benchmark gas valu command to support single genesis …

e9ff58d

…file

refactor(tests): update benchmark test for supported command

1e6ba64

refactor(benchmark): consolidate benchmark configurations into a sing…

3091dc2

…le entry

doc(fill): update command description and changelog

606b7b7

chore(fill): remove legacy gas benchmark values command

894002e

refactor(fill): create gas benchmakr value pytest plugin

51ed538

test(fill): add pytest plugin test and update state test

543ce90

refactor(fill): add env fixture for benchmarking with gas limit confi…

172f359

…guration

refactor: support both fill and execute mode

ca18d04

LouisTsai-Csie force-pushed the fill-benchmark-command branch from 7e5f501 to ca18d04 Compare July 22, 2025 08:53

fix: update ci flag and test command

5555d12

marioevz approved these changes Jul 22, 2025

View reviewed changes

marioevz merged commit 323b2b3 into ethereum:main Jul 22, 2025
14 checks passed

marioevz mentioned this pull request Jul 25, 2025

chore(benchmark): don't fill benchmark tests by default #1920

Merged

5 tasks

LouisTsai-Csie mentioned this pull request Aug 21, 2025

src(eth_config): added eth_config simulator prototype #2054

Merged

8 tasks

danceratopz mentioned this pull request Oct 2, 2025

feat(benchmark,fill): generate all gas value benchmarks in one fill execution #1891

Closed

	addopts =
	-p pytest_plugins.concurrency
	-p pytest_plugins.filler.pre_alloc
	-p pytest_plugins.filler.filler
	-p pytest_plugins.filler.ported_tests
	-p pytest_plugins.filler.static_filler
	-p pytest_plugins.shared.execute_fill
	-p pytest_plugins.forks.forks
	-p pytest_plugins.eels_resolver
	-p pytest_plugins.help.help
	--tb short
	--ignore tests/cancun/eip4844_blobs/point_evaluation_vectors/

Uh oh!

feat(fill): add --gas-benchmark-values command to support single genesis file #1895

feat(fill): add --gas-benchmark-values command to support single genesis file #1895

Uh oh!

Conversation

LouisTsai-Csie commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🗒️ Description

🔗 Related Issues or PRs

✅ Checklist

Uh oh!

marioevz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsign commented Jul 11, 2025

Uh oh!

marioevz commented Jul 11, 2025

Uh oh!

danceratopz left a comment

Choose a reason for hiding this comment

1. Define a filling mode enum in filler/filler.py:

2. In the filler plugin (filler.py), set the default:

3. In the benchmarking plugin (filler/benchmarking.py), override only if --benchmark-gas-values is set:

4. Example usage in filler logic, wrapped in a fixture:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danceratopz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LouisTsai-Csie commented Jul 15, 2025

Uh oh!

danceratopz commented Jul 15, 2025

Uh oh!

marioevz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

marioevz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

marioevz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat(fill): add `--gas-benchmark-values` command to support single genesis file #1895

feat(fill): add `--gas-benchmark-values` command to support single genesis file #1895

LouisTsai-Csie commented Jul 11, 2025 •

edited

Loading

1. Define a filling mode enum in `filler/filler.py`:

2. In the filler plugin (`filler.py`), set the default:

3. In the benchmarking plugin (`filler/benchmarking.py`), override only if `--benchmark-gas-values` is set: