| name | triton-hip-reference-kernel-search |
|---|---|
| description | Search and adapt Triton/HIP kernel patterns from a corpus to optimize AMD GPUs; use to find similar ops and reuse tiling/occupancy strategies. |
- Use when you need real kernel templates (attention, layernorm, matmul, activations) to adapt for AMD/ROCm.
- Do not load the entire corpus; grep targeted snippets instead.
- Search
references/train_crawl.jsonwith ripgrep for relevant ops; keep context tight. - Extract only needed code and descriptions; rewrite for wave64 occupancy, LDS tiling, vectorized/coalesced access, and bank-conflict avoidance.
- Cite source file and lines; pair with reflection prompts to validate correctness and performance.
references/SEARCH.md: Grep commands and tips for slicing snippets efficiently.