It would be be great if you could create fp8 versions of the models :) thanks for lower vram or faster generation