Open
Description
Moving this to it's own issue after being mentioned in #1763 (comment) - it wasn't immediately trivial in trying to get the eval harness to ensure a specific ordering of tasks (we would want successive generation - non-generation tasks). Will follow up in another PR.