ci: Add codspeed for performance monitoring #2516

FBruzzesi · 2025-05-08T13:47:55Z

What type of PR is this? (check all applicable)

Related issues

Related issue [Enh]: Better benchmarking routine #805

Checklist

Code follows style guide (ruff)
Tests added
Documented the changes

If you have comments or can explain your changes, please do so below

codspeed-hq · 2025-05-08T14:29:50Z

CodSpeed Performance Report

Congrats! CodSpeed is installed 🎉

🆕 22 new benchmarks were detected.

You will start to see performance impacts in the reports once the benchmarks are run from your default branch.

Detected benchmarks

test_benchmark_scripts[query_path0] (323.5 µs)
test_benchmark_scripts[query_path10] (322.9 µs)
test_benchmark_scripts[query_path11] (325 µs)
test_benchmark_scripts[query_path12] (327.3 µs)
test_benchmark_scripts[query_path13] (319.7 µs)
test_benchmark_scripts[query_path14] (320.7 µs)
test_benchmark_scripts[query_path15] (321 µs)
test_benchmark_scripts[query_path16] (323 µs)
test_benchmark_scripts[query_path17] (319.8 µs)
test_benchmark_scripts[query_path18] (320.7 µs)
test_benchmark_scripts[query_path19] (319.5 µs)
test_benchmark_scripts[query_path1] (320.4 µs)
test_benchmark_scripts[query_path20] (320.6 µs)
test_benchmark_scripts[query_path21] (375.8 µs)
test_benchmark_scripts[query_path2] (319.8 µs)
test_benchmark_scripts[query_path3] (322.5 µs)
test_benchmark_scripts[query_path4] (321.9 µs)
test_benchmark_scripts[query_path5] (320.2 µs)
test_benchmark_scripts[query_path6] (319 µs)
test_benchmark_scripts[query_path7] (319 µs)
test_benchmark_scripts[query_path8] (325.1 µs)
test_benchmark_scripts[query_path9] (319.2 µs)

FBruzzesi · 2025-05-08T14:49:54Z

Ok, so:

even with 10% of data, it takes 30 mins to run
in the codspeed website I can see that these benchmarks come with a warning:

Warning

This benchmark contains 32 system calls, totalling 39.1 s of execution time. Since they cannot be consistently instrumented, those calls are not included in the measure. Please switch to the Walltime instrument to accurately measure system calls. Learn more about measurement and system calls.

which to me indicates that the numbers in the report here are not tracking what we would like to see.

Additionally, we don't get the split by backend, which is also something I would like to see if we integrate performance tooling.
To do that we would need to have the benchmark on a lower level such as in execute_query

dangotbanned · 2025-05-18T19:10:37Z

tpch/generate_data.py

@@ -8,10 +8,11 @@
 import pyarrow.csv as pc
 import pyarrow.parquet as pq

-if not Path("data").exists():
-    Path("data").mkdir()
-
 SCALE_FACTOR = 0.1


@FBruzzesi (#805 (comment))

In #972 we were using TPCH with 0.25 ratio and it was taking ~40mins to run IIRC. That's a bit much for what I would consider fast iteration - maybe a ratio of 0.1 is more reasonable to start with

IIRC the docs for the duckdb TPCH tests used 0.01 - so we can go lower

I found the bit in the docs that used 0.01 (https://duckdb.org/docs/0.10/extensions/tpch#listing-expected-answers)

To produced the expected results for all queries on scale factors 0.01, 0.1, and 1, run

If we can run these with 10x less data, surely we should right?

The current run has been going for almost 2 hours 😅
(https://github.com/narwhals-dev/narwhals/actions/runs/15098359607/job/42436026213?pr=2516)

The current run has been going for almost 2 hours 😅

Yes I have been monitoring it - it's a bit odd, isn't it? I am not fully sure what's going on 🤔

ci: Add codspeed for performance monitoring

8d0199d

FBruzzesi added ci performance labels May 8, 2025

FBruzzesi added 3 commits May 8, 2025 15:54

simplify

d130a87

typo

b9e8ff4

need to use proper action

4a4c47a

FBruzzesi added 9 commits May 18, 2025 18:09

Merge branch 'main' into ci/add-codspeed-for-perf-monitoring

44234d2

refactor in the wild

dfac302

try with binding

5d3a598

try to simplify

f8c4bd2

parametrize backend

12bdeed

something is off with dask

f698e7a

one more

04b13e2

numpy<2

6a62144

ok I think we are there

cda5ee0

dangotbanned reviewed May 18, 2025

View reviewed changes

FBruzzesi closed this Jun 2, 2025

FBruzzesi deleted the ci/add-codspeed-for-perf-monitoring branch June 16, 2025 10:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci: Add codspeed for performance monitoring #2516

ci: Add codspeed for performance monitoring #2516

Uh oh!

FBruzzesi commented May 8, 2025

Uh oh!

codspeed-hq bot commented May 8, 2025

Detected benchmarks

Uh oh!

FBruzzesi commented May 8, 2025

Uh oh!

dangotbanned May 18, 2025 •

edited

Loading

Uh oh!

FBruzzesi May 18, 2025

Uh oh!

Uh oh!

ci: Add codspeed for performance monitoring #2516

ci: Add codspeed for performance monitoring #2516

Uh oh!

Conversation

FBruzzesi commented May 8, 2025

What type of PR is this? (check all applicable)

Related issues

Checklist

If you have comments or can explain your changes, please do so below

Uh oh!

codspeed-hq bot commented May 8, 2025

CodSpeed Performance Report

Congrats! CodSpeed is installed 🎉

Detected benchmarks

Uh oh!

FBruzzesi commented May 8, 2025

Uh oh!

dangotbanned May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FBruzzesi May 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dangotbanned May 18, 2025 •

edited

Loading