Building many images in parallel mode seems to overwhelm the docker daemon.
The easy workaround is to first build all images in sequential mode, then run the benchmark in parallel mode.
This isn't high priority to fix, just want a tracking issue so that if we do figure out why this is happening and fix it, I can update my code.
Thanks!