I don't know when exactly this test started failing, but it seems to be now systematically failling for >= py3.8. One example: https://github.com/Epistimio/orion/actions/runs/4737274539/jobs/8428037187?pr=1096#step:5:3870