You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|`gfx-target-chooser`| Pick the right `gfx942` / `gfx90a` / `gfx1100` target and matching compiler flags. |
57
65
|`mi300x-tuner`| Opinionated training and inference tuning for MI300X, including TunableOp, FSDP, and FlashAttention. |
58
66
|`rocm-container-picker`| Map a workload to a known-good `rocm/*` container image. |
59
-
|`ryzen-ai-deploy`| Prepare, quantize, and deploy models to Ryzen AI NPUs across the ONNX, PyTorch, and hybrid CPU/NPU/iGPU paths. |
60
-
61
-
### Application integration
62
-
63
-
Embed AMD-optimized AI into end-user applications.
67
+
|`gfx-target-chooser`| Pick the right `gfx942` / `gfx90a` / `gfx1100` target and matching compiler flags. |
64
68
65
-
| Skill | What it does |
66
-
| --- | --- |
67
-
|`local-ai-app-integration`| Integrate local AI into cloud LLM apps for offline support, better privacy, and lower API costs. |
68
-
|`local-ai-use`| Route image generation, text-to-speech, and speech-to-text through a local AI Server to reduce token/cost usage. |
69
69
70
70
### Cross-stack porting
71
71
@@ -86,14 +86,12 @@ Close the loop from trace to fix to ship.
86
86
| --- | --- |
87
87
|`rocprof-capture`| Capture and interpret a `rocprof` trace for a workload. |
88
88
|`omniperf-tune`| Run `omniperf`, locate the bottleneck, and suggest the fix. |
89
-
|`migraphx-deploy`| Compile an ONNX model with MIGraphX and benchmark it on a target. |
90
-
|`rocm-ci-template`| Drop-in GitHub Actions for AMD-targeted projects. |
91
89
92
90
> Skills land incrementally; see [Status](#status) for what is available today.
93
91
94
92
## A federated catalog
95
93
96
-
The AMD stack is large and moves fast. ROCm, HIP, MIGraphX, vLLM-AMD, Ryzen AI, and framework integrations each have their own team, release cadence, and validation matrix. A single monorepo of skills, maintained by one central team, would always be a step behind.
94
+
The AMD stack is large and moves fast. ROCm, HIP, Ryzen AI, and framework integrations each have their own team, release cadence, and validation matrix. A single monorepo of skills, maintained by one central team, would always be a step behind.
97
95
98
96
So skills here are **federated**: each skill is owned and versioned by the team that owns the product it describes, and this repository is the **catalog** that brings them together.
99
97
@@ -116,13 +114,6 @@ So skills here are **federated**: each skill is owned and versioned by the team
- The `cuda-to-hip` skill lives with the HIP project.
122
-
-`rocm-doctor` lives with the ROCm release tree.
123
-
-`ryzen-ai-deploy` ships with Ryzen AI.
124
-
-`local-ai-app-integration` is incubating in this repo today and will graduate to `lemonade-sdk/lemonade`.
125
-
126
117
Each skill stays close to the engineers who ship the underlying product, the CI that validates it, and the release tag that pins it.
127
118
128
119
This repo also acts as an **incubator**: a skill can start its life under `skills/` here to iterate quickly, then graduate to its product repo and be re-pointed from `catalog/` once it has a clear owner, with no change for installed users.
Once a skill is installed, reference it in plain language while talking to your agent. For example:
184
175
185
-
- "Integrate local AI capabilities into my app with Embeddable Lemonade."
186
-
- "Use the `pytorch-rocm-setup` skill to get PyTorch running on this MI300X node."
187
-
- "Use the `cuda-to-hip` skill to convert these CUDA kernels and flag anything that needs manual review."
188
-
- "Use the `migraphx-deploy` skill to compile this ONNX model for `gfx942` and benchmark it."
189
-
- "Use the `omniperf-tune` skill to find the bottleneck in this training step."
176
+
- "Use AMD Skills to integrate local AI capabilities into my app with Embeddable Lemonade."
177
+
- "Use AMD Skills to get PyTorch running on this MI300X node."
178
+
- "Use AMD Skills to convert these CUDA kernels and flag anything that needs manual review."
190
179
191
180
The agent loads the matching `SKILL.md` and any helper scripts, then carries out the task. In most cases the agent will pick the right skill on its own from the description; explicit invocation is a fallback, not a requirement.
192
181
193
182
## Contributing a skill
194
183
195
-
We welcome contributions from AMD engineers, partners, and the community. There are two contribution paths, matching how the catalog is organized.
184
+
We welcome contributions from AMD engineers, and selected partners. There are two contribution paths, matching how the catalog is organized.
0 commit comments