You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
2. Download the tarball (e.g., `llama.cpp-bXXXX-cuda-12.8.tar.gz`)
49
+
2. Download the tarball matching your host CPU architecture — `-amd64` for x86_64, `-arm64` for aarch64. Filename format: `llama.cpp-bXXXX-cuda-<cuda>-<arch>.tar.gz`
39
50
3. Extract the archive:
40
51
41
52
```bash
42
-
tar -xzf llama.cpp-bXXXX-cuda-12.8.tar.gz
43
-
cd cuda-12.6
53
+
# x86_64 host
54
+
tar -xzf llama.cpp-bXXXX-cuda-12.8-amd64.tar.gz
55
+
# aarch64 host (e.g. Grace Blackwell, DGX Spark)
56
+
tar -xzf llama.cpp-bXXXX-cuda-12.8-arm64.tar.gz
57
+
cd cuda-12.8
44
58
```
45
59
46
60
### Run
@@ -73,7 +87,7 @@ cat VERSION.txt
73
87
- NVIDIA GPU with compute capability 7.5 or higher
74
88
- Appropriate NVIDIA driver for your CUDA version:
75
89
- CUDA 12.8+: Driver >= 570.15
76
-
- Linux x86_64 (Ubuntu 22.04 compatible)
90
+
- Linux x86_64 or aarch64 (Ubuntu 22.04 compatible)
0 commit comments