Refactor coding of benchmark params to stop skipping, lower complexity #150

rly · 2025-09-09T06:04:29Z

Params are now just a tuple of dictionaries. Each benchmark takes one parameter dictionary. File read benchmarks have keys "name", "https_url". Data slicing benchmarks have keys "name", "https_url", "object_name", and "slice_range". I could create dataclasses instead of using dicts, but this keeps things pretty flexible while we iterate. Let me know if you think that would be worth changing. This change adds a couple more boilerplate lines in each benchmark function to unpack the dictionary, but I think that is OK.

I added a "name" key because it helps with display and logging. It's not used anywhere.

BaseBenchmark now just holds common settings for benchmarks, setting:

    rounds = 1
    repeat = 1
    warmup_time = 0.0

I updated the docs accordingly.

Note: The "results" key in the intermediate results json file from asv now looks like this:

"results": {
  "network_tracking_remote_file_reading.HDF5H5pyFileReadBenchmark.track_network_read_hdf5_h5py_fsspec_https_no_cache": [
    [
      true,
      true,
      true
    ],
    [
      [
        "{'name': 'EcephysTestCase', 'https_url': 'https://dandiarchive.s3.amazonaws.com/blobs/30c/6cc/30c6cc7b-4d17-4237-9786-66623a6c65eb'}",
        "{'name': 'OphysTestCase', 'https_url': 'https://dandiarchive.s3.amazonaws.com/blobs/38c/c24/38cc240b-77c5-418a-9040-a7f582ff6541'}",
        "{'name': 'IcephysTestCase', 'https_url': 'https://dandiarchive.s3.amazonaws.com/blobs/c98/3a4/c983a4e1-097a-402c-bda8-e6a41cb7e24a'}"
      ]
    ],
    "594b2cc8d025210bda2b732969917c4a169cac0d9eba056eedb9452627346a38",
    1757392268936,
    32.685,
    null,
    null,
    null,
    null,
    null,
    null,
    [
      {
        "total_transfer_in_number_of_packets": 7869,
        "total_traffic_in_number_of_web_packets": 620,
        "amount_downloaded_in_number_of_packets": 7249,
        "amount_uploaded_in_number_of_packets": 620,
        "total_transfer_in_bytes": 10744542,
        "amount_downloaded_in_bytes": 10698584,
        "amount_uploaded_in_bytes": 45958,
        "total_transfer_time_in_seconds": 1.8256669999998583,
        "network_total_time_in_seconds": 2.2093868255615234
      },
      {
        "total_transfer_in_number_of_packets": 2916,
        "total_traffic_in_number_of_web_packets": 105,
        "amount_downloaded_in_number_of_packets": 2811,
        "amount_uploaded_in_number_of_packets": 105,
        "total_transfer_in_bytes": 4163646,
        "amount_downloaded_in_bytes": 4155891,
        "amount_uploaded_in_bytes": 7755,
        "total_transfer_time_in_seconds": 0.9806429999999909,
        "network_total_time_in_seconds": 1.3765041828155518
      },
      {
        "total_transfer_in_number_of_packets": 3840,
        "total_traffic_in_number_of_web_packets": 370,
        "amount_downloaded_in_number_of_packets": 3470,
        "amount_uploaded_in_number_of_packets": 370,
        "total_transfer_in_bytes": 5166375,
        "amount_downloaded_in_bytes": 5135446,
        "amount_uploaded_in_bytes": 30929,
        "total_transfer_time_in_seconds": 0.9876550000000466,
        "network_total_time_in_seconds": 1.157383918762207
      }
    ]
  ]
},

I adjusted _reduce_results.py to account for that -- results[1] is already a serialization of the single parameter dict. No flattening is required.

for more information, see https://pre-commit.ci

rly · 2025-09-09T06:05:59Z

@CodyCBakerPhD I don't think the database or flask app need to change because the parameter case is still a string, but since the benchmark results/parameters are in a new format, does the database version need to be bumped?

CodyCBakerPhD · 2025-09-09T15:11:50Z

does the database version need to be bumped?

Definetely

CodyCBakerPhD · 2025-09-09T15:32:05Z

@rly Testing it out now (literally on my own device and mechanistically by adding actual tests to the CI lol)

CodyCBakerPhD · 2025-09-09T15:37:48Z

@rly Ran as expected but the results file does not get handled

E:\GitHub\nwb_benchmarks\src\nwb_benchmarks\setup\_reduce_results.py:62: UserWarning: In intermediate results for test case time_remote_slicing.HDF5PyNWBRemfilePreloadedNoCacheContinuousSliceBenchmark.time_slice:
        Length mismatch between parameters (1) and result samples (15)!

Removed unnecessary git fetch command from workflow.

rly · 2025-09-09T15:45:39Z

Huh. I thought I tested reduce_results before pushing, but maybe not. I think results[1] should be results[1][0]. I'll test locally and update.

CodyCBakerPhD · 2025-09-09T15:46:43Z

FYI I ran nwb_benchmarks run --bench time_remote_slicing.HDF5PyNWBRemfilePreloadedNoCacheContinuousSliceBenchmark.time_slice for testing

Add environment variable TSHARK_PATH to testing workflow

CodyCBakerPhD · 2025-09-09T17:52:35Z

@rly CI is up and running, and reproduces the issue I see (even in the file_reading benchmark)

rly · 2025-09-09T19:04:53Z

Tests pass. This all looks good to me. I have not run a full run through the benchmarks on this branch but will do so after merging

CodyCBakerPhD · 2025-09-09T19:29:05Z

Works for me!

rly and others added 4 commits September 8, 2025 21:41

Refactor coding of benchmark params to stop skipping, lower complexity

2a17968

Update results parsing

5803445

Uncomment deletion of raw results file

44f601d

[pre-commit.ci] auto fixes from pre-commit.com hooks

de2caee

for more information, see https://pre-commit.ci

rly requested a review from CodyCBakerPhD September 9, 2025 06:06

CodyCBakerPhD added 4 commits September 9, 2025 11:13

Update DATABASE_VERSION to 3.0.0

d88c179

Add GitHub Actions workflow for pull request deployment

27a4dbb

Add GitHub Actions workflow for Dev tests

77d2c30

Replace pytest with benchmark tests in workflow

6ec7547

CodyCBakerPhD assigned CodyCBakerPhD and rly and unassigned CodyCBakerPhD Sep 9, 2025

CodyCBakerPhD and others added 4 commits September 9, 2025 11:24

Merge branch 'main' into refactor_params

7c1ed7c

Merge branch 'main' into refactor_params

7f0ffe8

Remove secrets from Dev tests workflow

18d41d9

Add debug flag to benchmark test runs

3783d59

CodyCBakerPhD mentioned this pull request Sep 9, 2025

Continuous slicing benchmarks run many more iterations than they should #149

Closed

Remove git fetch command from testing workflow

d5d3e05

Removed unnecessary git fetch command from workflow.

CodyCBakerPhD and others added 6 commits September 9, 2025 11:47

Set TSHARK_PATH in Dev tests workflow

38ceb6d

Add environment variable TSHARK_PATH to testing workflow

fix: try -e in CI

45a218b

fix: use conda instead

e2c6b07

fix: activate base

60f8bbe

fix: try fixing the shell

baf2236

fix: try fixing the shell

83433e1

CodyCBakerPhD added 12 commits September 9, 2025 12:25

fix: try fixing the git checkout

5f79d66

fix: try fixing the git checkout

089eb31

fix: try fixing the git checkout

9e31b9c

fix: try fixing the git checkout

02bddcc

fix: I'm really starting to doubt copilot lol

72a7c51

fix: I'm really starting to doubt copilot lol

810d867

fix: determine environment path

a968341

fix: determine environment path

6c08751

fix: make ASV machine optional

5d954ec

fix: see what git rev-parse issue is

4f77a7b

fix: use correct path at runtime

3694ec4

feat: always run reduce_results

135851f

rly added 2 commits September 9, 2025 11:38

Fix reduce results

4846e55

Do not try to delete already deleted raw results file

0b30ff9

CodyCBakerPhD approved these changes Sep 9, 2025

View reviewed changes

CodyCBakerPhD merged commit a4f20be into main Sep 9, 2025
3 checks passed

CodyCBakerPhD deleted the refactor_params branch September 9, 2025 19:29

stephprince mentioned this pull request Sep 17, 2025

Update parameter case storage in data base #154

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor coding of benchmark params to stop skipping, lower complexity #150

Refactor coding of benchmark params to stop skipping, lower complexity #150

Uh oh!

rly commented Sep 9, 2025 •

edited

Loading

Uh oh!

rly commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

rly commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

rly commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Refactor coding of benchmark params to stop skipping, lower complexity #150

Refactor coding of benchmark params to stop skipping, lower complexity #150

Uh oh!

Conversation

rly commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rly commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

rly commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

rly commented Sep 9, 2025

Uh oh!

CodyCBakerPhD commented Sep 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rly commented Sep 9, 2025 •

edited

Loading