feat(rocm): add gfx908 (MI100) and gfx90a (MI210) GPU support by kenvandine · Pull Request #2092 · lemonade-sdk/lemonade

kenvandine · 2026-06-03T18:37:47Z

Summary

Adds AMD Instinct MI100 (CDNA1/gfx908) and MI200/MI210 (CDNA2/gfx90a) to the lemonade backend, following llamacpp-rocm nightly build support added in lemonade-sdk/llamacpp-rocm#103.

Add gfx908 and gfx90a to ROCM_ARCH_MAPPING with both the direct arch string (used via HSA/WSL path) and KFD-computed variants (gfx9008, gfx9010) produced by the native Linux digit-only parsing path
Extend the gfx arch regex from \d{4} to [0-9a-f]{3,4} to match 3-char and alphanumeric arch strings like gfx908 and gfx90a
Add MI100/MI200/MI210/Arcturus/Aldebaran marketing name recognition as a fallback in identify_rocm_arch_from_name
Register gfx908 and gfx90a as supported families for llamacpp rocm and sd-cpp rocm backends
Add human-readable device family names for both new architectures

Dependencies

Depends on Add MI-100 (gfx908) and MI-210 (gfx90a) GPU support llamacpp-rocm#103 (adds gfx908/gfx90a nightly builds)

Feature request

Closes MI-100 and MI-210 GPU support llamacpp-rocm#102

Test plan

Verify lemonade backends reports rocm as supported on a system with an AMD Instinct MI100 (gfx908)
Verify lemonade backends reports rocm as supported on a system with an AMD Instinct MI200/MI210 (gfx90a)
Verify lemonade pull + lemonade run works end-to-end with a model on both GPU targets once llamacpp-rocm#103 builds land
Confirm existing RDNA2/3/3.5/4 detection is unaffected

🤖 Generated with Claude Code

Adds AMD Instinct MI100 (CDNA1/gfx908) and MI200/MI210 (CDNA2/gfx90a) to the lemonade backend, following the llamacpp-rocm nightly build support added in lemonade-sdk/llamacpp-rocm#103. - Add gfx908 and gfx90a to ROCM_ARCH_MAPPING with both the direct arch string (used via HSA/WSL path) and the KFD-computed variants (gfx9008, gfx9010) produced by the native Linux digit-only parsing path - Extend the gfx arch regex from \d{4} to [0-9a-f]{3,4} to match 3-char and alphanumeric arch strings like gfx908 and gfx90a - Add MI100/MI200/MI210/Arcturus/Aldebaran marketing name recognition as a fallback in identify_rocm_arch_from_name - Register gfx908 and gfx90a as supported families for llamacpp rocm and sd-cpp rocm backends - Add human-readable device family names for both new architectures Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

superm1 · 2026-06-04T02:54:49Z

 // Empty string means "no ROCm binary for this ISA" — skip for get_rocm_arch / install filenames.
 const std::map<std::string, std::string> ROCM_ARCH_MAPPING = {
+    // CDNA1 - AMD Instinct MI100 (Arcturus)
+    {"gfx908",  "gfx908"},   // Direct arch string (from HSA/WSL path)


what is this

superm1 · 2026-06-04T02:54:57Z

+    {"gfx9008", "gfx908"},   // KFD-computed string on native Linux (90008 → gfx9008)
+
+    // CDNA2 - AMD Instinct MI200/MI210 (Aldebaran)
+    {"gfx90a",  "gfx90a"},   // Direct arch string (from HSA/WSL path)


what is this

superm1 · 2026-06-04T02:56:26Z

+    // CDNA1 GPUs (gfx908 architecture) - AMD Instinct MI100
+    if (device_lower.find("mi100") != std::string::npos ||
+        device_lower.find("arcturus") != std::string::npos) {
+        return "gfx908";
+    }
+
+    // CDNA2 GPUs (gfx90a architecture) - AMD Instinct MI200/MI210
+    if (device_lower.find("mi200") != std::string::npos ||
+        device_lower.find("mi210") != std::string::npos ||
+        device_lower.find("aldebaran") != std::string::npos) {
+        return "gfx90a";
+    }
+


where did all this come from?

superm1

don't you need lemonade/llama.cpp too?

kenvandine · 2026-06-04T11:55:15Z

Yes, lemonade-sdk/llama.cpp#15

superm1 · 2026-06-04T12:19:31Z

Yes, lemonade-sdk/llama.cpp#15

That's for openvino though. We need rocm change.

kenvandine · 2026-06-04T15:46:27Z

Yes, lemonade-sdk/llama.cpp#15

That's for openvino though. We need rocm change.

Sorry, crossed the streams there. Our llama.cpp already includes these in the gpu_targets.

kenvandine marked this pull request as draft June 3, 2026 18:38

Merge branch 'main' into kenvandine/gfx90a_gfx908

40520d7

kenvandine requested a review from superm1 June 3, 2026 22:17

kenvandine marked this pull request as ready for review June 3, 2026 23:52

superm1 reviewed Jun 4, 2026

View reviewed changes

Merge branch 'main' into kenvandine/gfx90a_gfx908

4fa6eb8

github-actions Bot added engine::llamacpp llama.cpp backend (LlamaCppServer); GPU/CPU LLM inference (Vulkan, ROCm, Metal) runtime::rocm AMD ROCm runtime enhancement New feature or request labels Jun 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(rocm): add gfx908 (MI100) and gfx90a (MI210) GPU support#2092

feat(rocm): add gfx908 (MI100) and gfx90a (MI210) GPU support#2092
kenvandine wants to merge 3 commits into
mainfrom
kenvandine/gfx90a_gfx908

kenvandine commented Jun 3, 2026

Uh oh!

superm1 Jun 4, 2026

Uh oh!

superm1 Jun 4, 2026

Uh oh!

superm1 Jun 4, 2026

Uh oh!

superm1 left a comment

Uh oh!

kenvandine commented Jun 4, 2026

Uh oh!

superm1 commented Jun 4, 2026

Uh oh!

kenvandine commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kenvandine commented Jun 3, 2026

Summary

Dependencies

Feature request

Test plan

Uh oh!

superm1 Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

superm1 Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

superm1 Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

superm1 left a comment

Choose a reason for hiding this comment

Uh oh!

kenvandine commented Jun 4, 2026

Uh oh!

superm1 commented Jun 4, 2026

Uh oh!

kenvandine commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants