Untrained primitive model, tiny model runs at about 400ms-500ms per frame on CPU, but is normal on GPU, 5ms per frame. Why is CPU running slow?