Added the device version checks (#2307)

Anerudhan · web-flow · commit bd2b033f6fb5 · 2026-01-08T09:21:20.000-08:00
## 📌 Description  ## 🔍 Related Issues  ## 🚀 Pull Request Checklist Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete. ### ✅ Pre-commit Checks - [ x] I have installed `pre-commit` by running `pip install pre-commit` (or used your preferred method). - [ x] I have installed the hooks with `pre-commit install`. - [ x] I have run the hooks manually with `pre-commit run --all-files` and fixed any reported issues. > If you are unsure about how to set up `pre-commit`, see [the pre-commit documentation](https://pre-commit.com/). ## 🧪 Tests - [ x] Tests have been added or updated as needed. - [ x] All tests are passing (`unittest`, etc.). ## Reviewer Notes   ## Summary by CodeRabbit * **Tests** * Improved test suite with a refined hardware check: an FP8-related test now requires a specific GPU compute capability so it only runs on compatible hardware, reducing false skips and improving reliability. <sub>✏️ Tip: You can customize this high-level summary in your review settings.</sub>
diff --git a/tests/attention/test_cudnn_prefill.py b/tests/attention/test_cudnn_prefill.py
@@ -4,6 +4,8 @@
 import flashinfer
 import cudnn
 
+from flashinfer.utils import get_compute_capability
+
 
 @pytest.mark.parametrize("batch_size", [1, 4])
 @pytest.mark.parametrize("s_qo", [8, 17, 700])
@@ -214,6 +216,13 @@ def test_cudnn_prefill_fp8(
     torch.manual_seed(seed)
     device = "cuda:0"
 
+    major, _ = get_compute_capability(torch.device(device))
+
+    if major != 10:
+        pytest.skip(
+            f"cuDNN FP8 prefill is not supported on compute capability {major}, skipping test"
+        )
+
     actual_seq_lens_q = torch.randint(
         1, s_qo + 1, (batch_size, 1, 1, 1), dtype=torch.int32, device=device
     )