This is a big one, but it would be nice to have some automated way of comparing e.g. the accelerate benchmarks to the reference implementations.