Respect TORCH_CUDA_ARCH_LIST to speed up builds by d3banjan · Pull Request #100 · Dao-AILab/causal-conv1d

d3banjan · 2026-03-13T15:38:50Z

Fixes #39

Summary

When TORCH_CUDA_ARCH_LIST is set, parse it and generate only the requested -gencode flags instead of hardcoding all supported architectures
When unset, behavior is completely unchanged (existing hardcoded flags remain as fallback)
This is the standard PyTorch convention already used by flash-attention, xformers, and PyTorch's own cpp_extension.py

Motivation

Building from source currently compiles for all supported GPU architectures regardless of the target hardware. For users targeting a single architecture (e.g. TORCH_CUDA_ARCH_LIST="8.6"), this makes builds ~5–7x slower than necessary.

Test plan

pip install -e . with TORCH_CUDA_ARCH_LIST unset — verify all existing gencode flags are emitted (unchanged behavior)
TORCH_CUDA_ARCH_LIST="8.6" pip install -e . — verify only -gencode arch=compute_86,code=sm_86 appears in nvcc output
TORCH_CUDA_ARCH_LIST="7.5;8.0" pip install -e . — verify both architectures are emitted
TORCH_CUDA_ARCH_LIST="8.6+PTX" pip install -e . — verify PTX suffix is stripped and compute_86 is used

When TORCH_CUDA_ARCH_LIST is set, use it to generate -gencode flags instead of hardcoding all supported architectures. This is the standard PyTorch convention used by flash-attention, xformers, and PyTorch's own cpp_extension.py. When the env var is unset, behavior is unchanged. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Respect TORCH_CUDA_ARCH_LIST to speed up builds#100

Respect TORCH_CUDA_ARCH_LIST to speed up builds#100
d3banjan wants to merge 1 commit into
Dao-AILab:mainfrom
d3banjan:fix/respect-torch-cuda-arch-list

d3banjan commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

d3banjan commented Mar 13, 2026

Summary

Motivation

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant