Add serving-llms-on-instinct skill by Mahdi-CV · Pull Request #58 · amd/skills

Mahdi-CV · 2026-06-12T23:28:48Z

Adds the serving-llms-on-instinct skill: end-to-end LLM inference serving on AMD Instinct GPUs (MI300X/MI325X/MI350X/MI355X) with vLLM on ROCm. The skill handles GPU detection, environment validation, vLLM configuration, launch, and health verification, and refuses non-servable models (diffusion, audio, embeddings, rerankers) with an explanation.

What's included

SKILL.md and reference.md: skill definition and runtime guidance
scripts/detect.py: GPU detection via amd-smi (local or remote host)
scripts/validate.py: environment validation with auto-fix
scripts/sync_recipes.py: refresh recipes from vLLM recipes + Docker Hub
scripts/estimate_vram.py: weight + KV-cache VRAM estimation (handles quantized models)
data/recipes_cache.json: model configs synced from vllm-project/recipes
data/gpu_overrides.json: GPU-specific docker flags and legacy model configs
data/blacklist.json: models that cannot be served as LLM endpoints

Registration

Added the plugin entry to .claude-plugin/marketplace.json and .cursor-plugin/marketplace.json
Updated the README skills table: serving-llms-on-instinct moved from planned to in-repo

danielholanda

PR looks great. Next step here is to add a quick walkthrough so other folks have a bit more guidance when trying this:
#58

Add serving-llms-on-instinct skill

3cb2f89

Mahdi-CV requested a review from danielholanda June 12, 2026 23:28

Show connection table after endpoint is healthy

9bde991

danielholanda assigned Mahdi-CV Jun 15, 2026

danielholanda added 3 commits June 15, 2026 13:20

Address direct prompting

2ab0bef

Address false positives

d0861ba

Address false positives

8f5c755

danielholanda approved these changes Jun 15, 2026

View reviewed changes

danielholanda merged commit f553383 into main Jun 15, 2026
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add serving-llms-on-instinct skill#58

Add serving-llms-on-instinct skill#58
danielholanda merged 5 commits into
mainfrom
instinct-inference-skill

Mahdi-CV commented Jun 12, 2026

Uh oh!

danielholanda left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Mahdi-CV commented Jun 12, 2026

What's included

Registration

Uh oh!

danielholanda left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants