Merge branch 'main' into sreeram/cvml-tests

sreeram-11 · web-flow · commit ff5ea2ccbafc · 2026-05-07T00:17:56.000-07:00
diff --git a/.github/scripts/fetch_github_issues.py b/.github/scripts/fetch_github_issues.py
@@ -12,7 +12,7 @@
 
 def fetch_github_issues():
     """Fetch GitHub issues from the repository"""
-    repo = "amd/halo_playbooks"
+    repo = "amd/playbooks"
     token = os.environ.get("GITHUB_TOKEN", "")
 
     headers = {
diff --git a/README.md b/README.md
@@ -31,26 +31,26 @@ This is AMD's official repository of playbooks for AMD developer platforms. Each
 | **Running LLMs with PyTorch and AMD ROCm™ software** | Run powerful language models locally with PyTorch and ROCm |
 | **Running and Serving LLMs with LM Studio** | Set up LM Studio to run and serve large language models |
 | **Automating Workflows with n8n and Local LLMs** | Build an AI-powered news summarizer using n8n and Lemonade |
-| **Local LLM Coding with VSCode and Qwen3-Coder** | Use VSCode with locally-running Qwen3-Coder for private code assistance |
+| **Local LLM Coding with VS Code and Qwen3-Coder** | Use VS Code with locally-running Qwen3-Coder for private code assistance |
 | **Generating Images with ComfyUI and Z Image Turbo** | Create AI-generated images using ComfyUI with Z Image Turbo |
+| **Chat with LLMs in Open WebUI** | Set up Open WebUI to chat with local LLMs |
+| **Fine-tune LLMs with PyTorch and AMD ROCm™ software** | Fine-tune large language models using PyTorch and ROCm |
+| **Using Lemonade Across CPU, GPU, and NPU** | Learn how to use the Lemonade framework across CPU, GPU, and NPU |
+| **Optimized Fine-tuning with Unsloth** | Memory-efficient LoRA fine-tuning with Unsloth |
+| **Speech-to-Speech Translation** | Build a real-time speech-to-speech translation system |
 
 ## Coming Soon
 
 | Playbook | Description |
 |----------|-------------|
-| **Chat with LLMs in Open WebUI** | Set up Open WebUI to chat with local LLMs |
-| **Fine-tune LLMs with PyTorch and ROCm** | Fine-tune large language models using PyTorch and ROCm |
-| **Using Lemonade Across CPU, GPU, and NPU** | Learn how to use the Lemonade framework across CPU, GPU, and NPU |
 | **Local Computer Vision with Ryzen™ AI NPU** | Build local perception capabilities using CVML SDK on Ryzen AI and ROCm |
 | **Clustering Two Devices with llama.cpp RPC** | Distributed inference using RPC server across two AMD devices with llama.cpp |
 | **Getting Started with Ollama** | Install Ollama and run LLMs locally from the terminal, desktop app, or REST API |
 | **Getting Started Creating Agents with GAIA** | Build and deploy AI agents using the GAIA framework |
 | **Fine-tuning LLMs with LLaMA-Factory** | LoRA fine-tuning of large language models using LLaMA-Factory |
 | **Custom GPU Kernels with PyTorch ROCm** | Write and optimize custom GPU kernels using PyTorch and ROCm |
-| **Optimized Fine-tuning with Unsloth** | Memory-efficient LoRA fine-tuning with Unsloth |
 | **Quick Start on vLLM** | Run inference and serving using vLLM |
 | **Clustering with RCCL** | Multi-node cluster using two AMD devices with RCCL |
-| **Speech-to-Speech Translation** | Build a real-time speech-to-speech translation system |
 
 ## AMD AI Developer Program
 
diff --git a/assets/banner.png b/assets/banner.png
diff --git a/playbooks/supplemental/llama-factory-finetuning/README.md b/playbooks/supplemental/llama-factory-finetuning/README.md
@@ -21,6 +21,20 @@ This playbook teaches you how to fine-tune LLMs using LLaMA Factory on your loca
 
 ## Setting up the Environment
 
+<!-- @os:linux -->
+<!-- @test:id=create-venv timeout=120 hidden=True -->
+```bash
+sudo apt update
+sudo apt install -y python3-venv
+python3 -m venv venv
+source venv/bin/activate
+python3 --version
+pip --version
+```
+<!-- @test:end --> 
+<!-- @setup:id=activate-venv command="source venv/bin/activate" --> 
+<!-- @os:end -->
+
 ### Installing Basic Dependencies
 
 <!-- @os:linux -->
@@ -37,18 +51,44 @@ This playbook teaches you how to fine-tune LLMs using LLaMA Factory on your loca
 pip install huggingface_hub
 ```
 
+<<<<<<< sreeram/llama-factory-ft_tests
+<!-- @os:linux -->
+<!-- @test:id=install-deps timeout=300 hidden=True setup=activate-venv -->
+```bash
+python3 -m pip install --upgrade pip
+python3 -m pip install huggingface_hub
+```
+<!-- @test:end --> 
+<!-- @os:end -->
+
+=======
+>>>>>>> main
 ### Install LLaMA Factory
 
 LLaMA Factory depends on PyTorch. You should already have it installed per the above requirements.
 
 Download the source code from [LLaMA Factory official GitHub repository](https://github.com/hiyouga/LlamaFactory), and install its dependencies.
 
+<!-- @os:linux -->
+<!-- @test:id=install-llamafactory timeout=900 setup=activate-venv -->
 ```bash
 git clone --depth 1 https://github.com/hiyouga/LlamaFactory.git
 cd LlamaFactory
 pip install -e .
 pip install -r requirements/metrics.txt
 ```
+<!-- @test:end --> 
+<!-- @os:end -->
+
+<!-- @os:linux -->
+<!-- @test:id=verify-llamafactory-cli timeout=60 hidden=True setup=activate-venv -->
+```bash
+cd LlamaFactory
+llamafactory-cli version || python -m llamafactory.cli version || true
+command -v llamafactory-cli
+```
+<!-- @test:end --> 
+<!-- @os:end -->
 
 Having successfully installed LLaMA Factory, let's run fine-tuning on it.
 
@@ -73,6 +113,32 @@ LLaMA Factory supports multiple fine-tuning schemes.
 | LoRA fine-tuning  | [examples/train_lora](https://github.com/hiyouga/LlamaFactory/tree/main/examples/train_lora) |
 | QLoRA fine-tuning | [examples/train_qlora](https://github.com/hiyouga/LlamaFactory/tree/main/examples/train_qlora) |
 
+<<<<<<< sreeram/llama-factory-ft_tests
+<!-- @os:linux -->
+<!-- @test:id=verify-llamafactory-files timeout=60 hidden=True setup=activate-venv -->
+```python
+import os
+import sys
+
+base = "LlamaFactory"
+required = [
+    "examples/train_lora/qwen3_lora_sft.yaml",
+    "examples/inference/qwen3_lora_sft.yaml",
+    "examples/merge_lora/qwen3_lora_sft.yaml",
+]
+
+missing = [p for p in required if not os.path.exists(os.path.join(base, p))]
+if missing:
+    print(f"FAIL: Missing required files: {missing}")
+    sys.exit(1)
+
+print("PASS: Required LLaMA Factory example files exist")
+```
+<!-- @test:end --> 
+<!-- @os:end -->
+
+=======
+>>>>>>> main
 These example configuration files have specified model parameters, fine-tuning method parameters, dataset parameters, evaluation parameters, and more. You can configure them according to your own needs. In this playbook, we will use [qwen3_lora_sft.yaml](https://github.com/hiyouga/LlamaFactory/blob/main/examples/train_lora/qwen3_lora_sft.yaml). 
 
 **Key parameters explained:**
@@ -112,12 +178,69 @@ You can run LLaMA Factory fine-tuning using the following command, which is base
 llamafactory-cli train examples/train_lora/qwen3_lora_sft.yaml
 ```
 
+<<<<<<< sreeram/llama-factory-ft_tests
+<!-- @os:linux -->
+<!-- @test:id=quick-train-llamafactory-lora timeout=1800 hidden=True setup=activate-venv -->
+```bash
+cd LlamaFactory
+
+cp examples/train_lora/qwen3_lora_sft.yaml examples/train_lora/qwen3_lora_sft_ci.yaml
+
+sed -i 's/lora_rank: 8/lora_rank: 6/g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+sed -i 's|output_dir: .*|output_dir: saves/qwen3_lora_sft_ci|g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+sed -i 's/overwrite_output_dir: false/overwrite_output_dir: true/g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+sed -i 's/per_device_train_batch_size: .*/per_device_train_batch_size: 1/g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+sed -i 's/gradient_accumulation_steps: .*/gradient_accumulation_steps: 1/g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+sed -i 's/num_train_epochs: .*/num_train_epochs: 1/g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+sed -i 's/logging_steps: .*/logging_steps: 1/g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+sed -i 's/save_steps: .*/save_steps: 5/g' examples/train_lora/qwen3_lora_sft_ci.yaml || true
+
+llamafactory-cli train examples/train_lora/qwen3_lora_sft_ci.yaml
+```
+<!-- @test:end --> 
+<!-- @os:end -->
+
+After running LLM finetuning, all generated outputs are stored in the "output_dir", including model checkpoint files, configuration files, and training metrics.
+=======
 After running LLM fine-tuning, output files can be found in the path of `output_dir`, like the model checkpoint files, model configuration files, training metrics data files.
+>>>>>>> main
 
 <p align="center">
   <img src="assets/qwen3_lora.png" alt="Qwen3 LoRA Fine-tuning" width="600"/>
 </p>
 
+<!-- @os:linux -->
+<!-- @test:id=verify-llamafactory-train-output timeout=120 hidden=True setup=activate-venv -->
+```python
+import os
+import sys
+import glob
+
+out_dir = "LlamaFactory/saves/qwen3_lora_sft_ci"
+if not os.path.isdir(out_dir):
+    print(f"FAIL: Missing output directory: {out_dir}")
+    sys.exit(1)
+
+required = [
+    "adapter_config.json",
+    "trainer_state.json",
+    "training_args.bin",
+]
+missing = [f for f in required if not os.path.exists(os.path.join(out_dir, f))]
+if missing:
+    print(f"FAIL: Missing required files: {missing}")
+    sys.exit(1)
+
+adapter_weights = glob.glob(os.path.join(out_dir, "adapter_model*.safetensors")) + glob.glob(os.path.join(out_dir, "adapter_model*.bin"))
+if not adapter_weights:
+    print("FAIL: Missing adapter weights")
+    sys.exit(1)
+
+print("PASS: LLaMA Factory training output looks correct")
+print(f"Found adapter weights: {adapter_weights}")
+```
+<!-- @test:end --> 
+<!-- @os:end -->
 
 ### Test the fine-tuned model 
 
@@ -150,6 +273,63 @@ The result of exporting the fine-tuned model is shown below.
   <img src="assets/qwen3_export.png" alt="Export Qwen3 Fine-Tuned model " width="600"/>
 </p>
 
+<!-- @os:linux -->
+<!-- @test:id=export-llamafactory-model timeout=1800 hidden=True setup=activate-venv -->
+```bash
+cd LlamaFactory
+pip install pyyaml
+
+python - <<'PY'
+import yaml
+from pathlib import Path
+
+src = Path("examples/merge_lora/qwen3_lora_sft.yaml")
+dst = Path("examples/merge_lora/qwen3_lora_sft_ci.yaml")
+
+cfg = yaml.safe_load(src.read_text())
+
+cfg["adapter_name_or_path"] = "saves/qwen3_lora_sft_ci"
+cfg["export_dir"] = "saves/qwen3_lora_sft_ci_merged"
+
+dst.write_text(yaml.safe_dump(cfg, sort_keys=False))
+print(f"Wrote {dst}")
+PY
+
+llamafactory-cli export examples/merge_lora/qwen3_lora_sft_ci.yaml
+```
+<!-- @test:end --> 
+<!-- @os:end -->
+
+<!-- @os:linux -->
+<!-- @test:id=verify-llamafactory-export-output timeout=120 hidden=True setup=activate-venv -->
+```python
+import os
+import sys
+import glob
+
+out_dir = "LlamaFactory/saves/qwen3_lora_sft_ci_merged"
+if not os.path.isdir(out_dir):
+    print(f"FAIL: Missing export directory: {out_dir}")
+    sys.exit(1)
+
+required = ["config.json",]
+missing = [f for f in required if not os.path.exists(os.path.join(out_dir, f))]
+if missing:
+    print(f"FAIL: Missing required export files: {missing}")
+    sys.exit(1)
+
+model_files = (
+    glob.glob(os.path.join(out_dir, "*.safetensors")) +
+    glob.glob(os.path.join(out_dir, "pytorch_model*.bin"))
+)
+if not model_files:
+    print("FAIL: Missing merged model weights")
+    sys.exit(1)
+
+print("PASS: Exported merged model output looks correct")
+```
+<!-- @test:end --> 
+<!-- @os:end -->
 
 ## Using LLaMA Factory GUI
 
diff --git a/playbooks/supplemental/llama-factory-finetuning/playbook.json b/playbooks/supplemental/llama-factory-finetuning/playbook.json
@@ -6,13 +6,24 @@
   "supported_platforms": {
     "halo": [
       "linux"
+    ],
+    "halo_box": [
+      "linux"
     ]
   },
   "tested_platforms": {
     "halo": [
       "linux"
     ]
   },
+  "required_platforms": {
+    "halo": [
+      "linux"
+    ],
+    "halo_box": [
+      "linux"
+    ]
+  },
   "platforms": ["linux"],
   "difficulty": "intermediate",
   "isNew": false,
diff --git a/website/src/app/api/dashboard/playbook-test-matrix/route.ts b/website/src/app/api/dashboard/playbook-test-matrix/route.ts
@@ -4,7 +4,7 @@ import path from "path";
 import JSZip from "jszip";
 
 const REPO_OWNER = "amd";
-const REPO_NAME = "halo_playbooks";
+const REPO_NAME = "playbooks";
 const GITHUB_API = `https://api.github.com/repos/${REPO_OWNER}/${REPO_NAME}`;
 const WORKFLOW_FILE = "test-playbooks.yml";
 const PLAYBOOKS_ROOT = path.join(process.cwd(), "..", "playbooks");
diff --git a/website/src/app/api/runners/route.ts b/website/src/app/api/runners/route.ts
@@ -1,7 +1,7 @@
 import { NextResponse } from "next/server";
 
 const REPO_OWNER = "amd";
-const REPO_NAME = "halo_playbooks";
+const REPO_NAME = "playbooks";
 const GITHUB_API = `https://api.github.com/repos/${REPO_OWNER}/${REPO_NAME}`;
 
 interface GHRunner {
diff --git a/website/src/app/dashboard/page.tsx b/website/src/app/dashboard/page.tsx
@@ -216,7 +216,7 @@ function SetupGuide() {
             <li>
               Set <strong className="text-white">Repository access</strong> to{" "}
               <code className="bg-[#242424] px-1.5 py-0.5 rounded text-[#D4915D] text-xs">Only select repositories</code>{" "}
-              and pick <code className="bg-[#242424] px-1.5 py-0.5 rounded text-[#D4915D] text-xs">amd/halo_playbooks</code>
+              and pick <code className="bg-[#242424] px-1.5 py-0.5 rounded text-[#D4915D] text-xs">amd/playbooks</code>
             </li>
             <li>
               Under <strong className="text-white">Repository permissions</strong>, set{" "}
diff --git a/website/src/components/DeviceCarousel.tsx b/website/src/components/DeviceCarousel.tsx
@@ -4,7 +4,6 @@ import { useCallback } from "react";
 import raiImg from "@/app/assets/rai.png";
 import haloImg from "@/app/assets/halo.png";
 import radeonImg from "@/app/assets/radeon.png";
-import { COMING_SOON_CATEGORIES } from "@/types/playbook";
 
 const families = [
   { id: "reference", name: "AMD Ryzen\u2122 AI Halo", img: haloImg },
@@ -41,34 +40,24 @@ export default function DeviceCarousel({
       {/* Device Family Tabs */}
       <div className="flex justify-center mb-8">
         <div className="inline-flex items-center bg-[#1a1a1a] border border-[#333333] rounded-xl p-1.5 gap-1">
-          {families.map((family) => {
-            const comingSoon = COMING_SOON_CATEGORIES.has(family.id);
-            return (
-              <button
-                key={family.id}
-                onClick={() => selectFamily(family.id)}
-                className={`relative px-5 py-2.5 rounded-lg text-sm font-medium transition-all duration-300 flex flex-col items-center gap-0.5 ${
-                  activeId === family.id
-                    ? "text-black shadow-lg"
-                    : "text-[#a0a0a0] hover:text-white hover:bg-[#242424]"
-                }`}
-                style={
-                  activeId === family.id
-                    ? { backgroundColor: ACCENT }
-                    : undefined
-                }
-              >
-                {family.name}
-                {comingSoon && (
-                  <span className={`text-[9px] font-medium leading-none tracking-wide ${
-                    activeId === family.id ? "text-black/60" : "text-[#D4915D]/70"
-                  }`}>
-                    Coming Soon
-                  </span>
-                )}
-              </button>
-            );
-          })}
+          {families.map((family) => (
+            <button
+              key={family.id}
+              onClick={() => selectFamily(family.id)}
+              className={`relative px-5 py-2.5 rounded-lg text-sm font-medium transition-all duration-300 flex flex-col items-center gap-0.5 ${
+                activeId === family.id
+                  ? "text-black shadow-lg"
+                  : "text-[#a0a0a0] hover:text-white hover:bg-[#242424]"
+              }`}
+              style={
+                activeId === family.id
+                  ? { backgroundColor: ACCENT }
+                  : undefined
+              }
+            >
+              {family.name}
+            </button>
+          ))}
           <button
             onClick={() => selectFamily(ALL_ID)}
             className={`relative px-5 py-2.5 rounded-lg text-sm font-medium transition-all duration-300 ${
diff --git a/website/src/components/PlaybooksSection.tsx b/website/src/components/PlaybooksSection.tsx
diff --git a/website/src/lib/github-test-results.ts b/website/src/lib/github-test-results.ts
diff --git a/website/src/types/playbook.ts b/website/src/types/playbook.ts