The hourly import currently succeeds even when there's individual flakes that failed. The PR validation does fail when any fails.
We should probably make the import job run to completion when one flake fails, but then fail with a clear report/issue, so we notice when flakes stop working, rather than only noticing when an unrelated PR fails