replace cutlass submodule references with explicit build step #3234

jacobgorm · 2025-12-08T16:34:32Z

this has the benefit of not weighing down CI builds on non-CUDA platforms, and also enables shallow clone to make builds on CUDA platforms lighter weight. The cutlass checkout alone is 210MiB, which this commit saves on platforms that don't need it.

I don't have a modern enough CUDA GPU available to run with flash-attention, but I have tested on older CUDA machine as well as on MacOS, and modulo the asserts for the NVIDIA hardware version, the build completes as expected.

this has the benefit of not weighing down CI builds on non-CUDA platforms, and also enables shallow clone to make builds on CUDA platforms lighter weight. The cutlass checkout alone is 210MiB, which this commit saves on platforms that don't need it. I don't have a modern enough CUDA GPU available to run with flash-attention, but I have tested on older CUDA machine as well as on MacOS, and modulo the asserts for the NVIDIA hardware version, the build completes as expected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

replace cutlass submodule references with explicit build step #3234

replace cutlass submodule references with explicit build step #3234

Uh oh!

jacobgorm commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

replace cutlass submodule references with explicit build step #3234

Are you sure you want to change the base?

replace cutlass submodule references with explicit build step #3234

Uh oh!

Conversation

jacobgorm commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant