is there an example or steps on how to run the model on different GPUs other than H100s? Including the flash attention version needed for it. Thanks!
is there an example or steps on how to run the model on different GPUs other than H100s?
Including the flash attention version needed for it.
Thanks!