Add check for slow tests to CI #1033

stan-dot · 2025-02-03T11:41:08Z

Instructions to reviewer on how to test:

Check the tests
Confirm thing print("✅ All tests ran within the acceptable time limit.") or print("❌ The following tests exceeded the 1s threshold:") happened

Checks for reviewer

Would the PR title make sense to a scientist on a set of release notes
If a new device has been added does it follow the standards
If changing the API for a pre-existing device, ensure that any beamlines using this device have updated their Bluesky plans accordingly
Have the connection tests for the relevant beamline(s) been run via dodal connect ${BEAMLINE}

DominicOram

Great, thank you. I think once we merge into main it will pass again as I moved the CLI test in #1000. Only issue is that it now looks like the tests are being run twice? The logs are printing results twice and if you compare the time on https://github.com/DiamondLightSource/dodal/actions/runs/13132250037/job/36639728998 to https://github.com/DiamondLightSource/dodal/actions/runs/13133823256/job/36644639445?pr=1033 its ~2x the time.

stan-dot · 2025-02-04T11:25:25Z

looking into it now

update: noticed where is this happening

stan-dot · 2025-02-04T12:13:59Z

reproducing lcoally:

 1027 passed, 4 skipped, 40 deselected in 31.79s ==================================================================
_____________________________________________________________________________________ summary ______________________________________________________________________________________
  report: commands succeeded
  congratulations :)

================================================================= 1027 passed, 4 skipped, 40 deselected in 31.28s ==================================================================
_____________________________________________________________________________________ summary ______________________________________________________________________________________
tests: commands succeeded
congratulations :)

beginning to suspect that the culprit is pytest that runs twice due to a request to have both coverage and report json for time
---------- coverage: platform linux, python 3.10.16-final-0 ----------

this line only occurs once in this run https://github.com/DiamondLightSource/dodal/actions/runs/13135068486/job/36648559977?pr=1033

reading the docs...
https://pytest-cov.readthedocs.io/en/latest/reporting.html

stan-dot · 2025-02-04T12:21:57Z

tried to replicate time doubling on a smaller scale and the undesired doubling does not happen

stan-dot · 2025-02-04T12:22:29Z

❌ The following tests exceeded the 1s threshold:
 - system_tests/test_cli.py::test_cli_version: 3.46s

should I exclude this?

stan-dot · 2025-02-04T12:28:15Z

it's possible that addition of dev-dependency pytest-json-report is not good and a json report could be generated solely from pytest-cov

update: the cov.json does not cover time

therefore the pytest-json-report is good actually

DominicOram · 2025-02-04T13:03:19Z

should I exclude this?

No, as I said above, it will be fixed if you merge in main. Happy to discuss this issue further but with so many comments above it's hard for me to know if I'm getting a notification that requires action or not. It would be helpful to me if you could summarise findings into one comment once you have solved the issue or need input from someone else, thanks!

stan-dot · 2025-02-04T15:56:27Z

I thought that I had already merged in main at the moment of making that comment

DominicOram · 2025-02-04T16:18:07Z

I thought that I had already merged in main at the moment of making that comment

Ah, my bad. I think that the solution is that we need different thresholds for system tests vs unit tests. Can we set like a 5s limit on system tests or we can just ignore if they go over the duration, I don't mind which.

stan-dot · 2025-02-04T16:30:55Z

Can we set like a 5s limit on system tests or we can just ignore if they go over the duration, I don't mind which.

making it 5 s for system tests

codecov · 2025-02-04T16:48:31Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.69%. Comparing base (81496ff) to head (04b7aa5).
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1033   +/-   ##
=======================================
  Coverage   97.69%   97.69%           
=======================================
  Files         160      160           
  Lines        6629     6629           
=======================================
  Hits         6476     6476           
  Misses        153      153

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

stan-dot · 2025-02-04T16:48:35Z

this seems to work now @DominicOram

stan-dot · 2025-02-04T16:49:37Z

coverage took a nosedive though

DominicOram · 2025-02-06T09:04:43Z

this seems to work now @DominicOram
coverage took a nosedive though

Similar to above, it's not clear to me from these comments what you feel the next steps are on this? Are you still looking at the coverage issue? Would you like help on it? Do you want me to merge it despite the coverage issue?

stan-dot · 2025-02-07T15:21:34Z

These are files that didn't have author revisions, but contain unexpected coverage changes

I haven't seen this before, investigating

https://docs.codecov.com/docs/unexpected-coverage-changes

https://app.codecov.io/gh/DiamondLightSource/dodal/pull/1033/indirect-changes#f0948950da8d813eb6b190daf1763a51-R2

stan-dot · 2025-02-10T11:05:08Z

it's not clear to me from these comments what you feel the next steps are on this? Are you still looking at the coverage issue?

yes I have no idea why is the coverage going down. I have never seen this before. Presumably it'd be good if someone took a look at it too.

stan-dot · 2025-02-10T11:05:34Z

atm the CI fail is due to test time:

Run python .github/scripts/check_test_durations.py unit-report.json 1
❌ The following tests exceeded the 1.00s threshold:

tests/devices/unit_tests/test_apple2_undulator.py::test_given_gate_never_closes_then_setting_gaps_times_out: 3.01s
tests/devices/unit_tests/test_apple2_undulator.py::test_given_gate_never_closes_then_setting_phases_times_out: 1.02s
tests/devices/unit_tests/test_apple2_undulator.py::test_given_gate_never_closes_then_setting_jaw_phases_times_out: 1.01s

Relm-Arrowny · 2025-02-10T11:29:28Z

atm the CI fail is due to test time:

Run python .github/scripts/check_test_durations.py unit-report.json 1 ❌ The following tests exceeded the 1.00s threshold:

tests/devices/unit_tests/test_apple2_undulator.py::test_given_gate_never_closes_then_setting_gaps_times_out: 3.01s

tests/devices/unit_tests/test_apple2_undulator.py::test_given_gate_never_closes_then_setting_phases_times_out: 1.02s

tests/devices/unit_tests/test_apple2_undulator.py::test_given_gate_never_closes_then_setting_jaw_phases_times_out: 1.01s

I can have a look I am pretty sure they run close to 1 sec but I am not sure about the 3.01s.

Relm-Arrowny · 2025-02-10T11:59:58Z

I can have a look I am pretty sure they run close to 1 sec but I am not sure about the 3.01s.

yes the main version definitely took 3 second and it will be fix with #1032.

DominicOram · 2025-02-10T13:58:48Z

add timeout = 1 in tool.pytest.ini_options ?

Yh, I don't mind solving it like this. But I guess it would be annoying if this meant that:

No subsequent tests are run after the timeout one
You don't get to see if the timeout test failed for other reasons

But I think we can live with these things :)

Presumably it'd be good if someone took a look at it too.

Ok, I will try and find some time to take a look

stan-dot · 2025-02-10T14:02:41Z

============================================= 1050/1090 tests collected (40 deselected) in 5.48s =============================================

for

pytest -m 'not (s03 or adsim)' --collect-only

while>>
for system_tests =============================================== 2/42 tests collected (40 deselected) in 3.43s ================================================

and tests: ======================================================= 1048 tests collected in 4.02s ========================================================

so the same number of tests is collected.

stan-dot · 2025-02-10T14:53:07Z

waiting for #1032 to be fixed

pyproject.toml

stan-dot · 2025-02-18T11:29:24Z

@Relm-Arrowny how is that other PR going?

#1033 (comment)

DominicOram · 2025-02-18T14:26:34Z

@Relm-Arrowny how is that other PR going?

#1033 (comment)

This is on me to review. I will do so today

Relm-Arrowny · 2025-02-18T16:24:50Z

add timeout = 1 in tool.pytest.ini_options ?

Yh, I don't mind solving it like this. But I guess it would be annoying if this meant that:

No subsequent tests are run after the timeout one

You don't get to see if the timeout test failed for other reasons

I have not tested it lately but I am 80% sure it run all the test even if it time out and the test will run to completion. The time out just fail it after.

…taking-1s

DominicOram

Great, thank you! Passing now that the long tests have been fixed

stan-dot added enhancement New feature or request github_actions Pull requests that update GitHub Actions code low priority Not needed for production in the near future Developer Experience labels Feb 3, 2025

stan-dot self-assigned this Feb 3, 2025

stan-dot linked an issue Feb 3, 2025 that may be closed by this pull request

Add CI job that fails PRs if tests are taking >1s #1003

Closed

stan-dot force-pushed the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch from d3e68ee to deed495 Compare February 4, 2025 10:48

stan-dot requested a review from DominicOram February 4, 2025 10:51

stan-dot marked this pull request as ready for review February 4, 2025 10:52

stan-dot requested a review from a team as a code owner February 4, 2025 10:52

DominicOram requested changes Feb 4, 2025

View reviewed changes

stan-dot force-pushed the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch from 519fb11 to 8535a08 Compare February 4, 2025 12:07

stan-dot force-pushed the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch from 90c7aa7 to 229f472 Compare February 4, 2025 15:56

stan-dot assigned stan-dot and unassigned stan-dot Feb 4, 2025

stan-dot force-pushed the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch from 01664bd to 7e5c832 Compare February 7, 2025 15:25

stan-dot force-pushed the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch from 7e5c832 to 9a07d22 Compare February 10, 2025 14:52

olliesilvester reviewed Feb 10, 2025

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

stan-dot force-pushed the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch from 372c3dd to 63d628c Compare February 11, 2025 10:41

stan-dot added 7 commits February 18, 2025 11:29

add one second test check to CI

52c8db6

pyproject add tests-report as an available testenv

a6e9c71

fix typo

6c790d4

exclude the option of tox parsing weird

8730151

5 seconds for system tests

60a802b

tox is a pain

85988d7

add two different codecov file names

2702f52

stan-dot force-pushed the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch from 63d628c to 2702f52 Compare February 18, 2025 11:29

This was referenced Feb 19, 2025

Apple2 test run time is too long. #1060

Closed

Correct test function for mocking timeout. #1061

Merged

Merge branch 'main' into 1003-add-ci-job-that-fails-prs-if-tests-are-…

04b7aa5

…taking-1s

DominicOram approved these changes Feb 19, 2025

View reviewed changes

DominicOram merged commit 23f4a16 into main Feb 19, 2025
19 checks passed

DominicOram deleted the 1003-add-ci-job-that-fails-prs-if-tests-are-taking-1s branch February 19, 2025 10:23

DominicOram changed the title ~~add one second test check to CI~~ Add check for slow tests to CI Feb 19, 2025

DominicOram mentioned this pull request Feb 19, 2025

Add CI job that fails PRs if tests are taking >1s DiamondLightSource/mx-bluesky#663

Closed

coretl mentioned this pull request Feb 27, 2025

Make it easier to spot when slow tests are added bluesky/ophyd-async#729

Closed

Add check for slow tests to CI #1033

Add check for slow tests to CI #1033

Uh oh!

Conversation

stan-dot commented Feb 3, 2025

Instructions to reviewer on how to test:

Checks for reviewer

Uh oh!

DominicOram left a comment

Choose a reason for hiding this comment

Uh oh!

stan-dot commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stan-dot commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stan-dot commented Feb 4, 2025

Uh oh!

stan-dot commented Feb 4, 2025

Uh oh!

stan-dot commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DominicOram commented Feb 4, 2025

Uh oh!

stan-dot commented Feb 4, 2025

Uh oh!

DominicOram commented Feb 4, 2025

Uh oh!

stan-dot commented Feb 4, 2025

Uh oh!

codecov bot commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

stan-dot commented Feb 4, 2025

Uh oh!

stan-dot commented Feb 4, 2025

Uh oh!

DominicOram commented Feb 6, 2025

Uh oh!

stan-dot commented Feb 7, 2025

Uh oh!

stan-dot commented Feb 10, 2025

Uh oh!

stan-dot commented Feb 10, 2025

Uh oh!

Relm-Arrowny commented Feb 10, 2025

Uh oh!

Relm-Arrowny commented Feb 10, 2025

Uh oh!

DominicOram commented Feb 10, 2025

Uh oh!

stan-dot commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stan-dot commented Feb 10, 2025

Uh oh!

Uh oh!

stan-dot commented Feb 18, 2025

Uh oh!

DominicOram commented Feb 18, 2025

Uh oh!

Relm-Arrowny commented Feb 18, 2025

Uh oh!

DominicOram left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

stan-dot commented Feb 4, 2025 •

edited

Loading

stan-dot commented Feb 4, 2025 •

edited

Loading

stan-dot commented Feb 4, 2025 •

edited

Loading

codecov bot commented Feb 4, 2025 •

edited

Loading

stan-dot commented Feb 10, 2025 •

edited

Loading