Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 851 Bytes

File metadata and controls

17 lines (13 loc) · 851 Bytes
name triton-hip-reference-kernel-search
description Search and adapt Triton/HIP kernel patterns from a corpus to optimize AMD GPUs; use to find similar ops and reuse tiling/occupancy strategies.

AMD Kernel Patterns

  • Use when you need real kernel templates (attention, layernorm, matmul, activations) to adapt for AMD/ROCm.
  • Do not load the entire corpus; grep targeted snippets instead.

How to use

  • Search references/train_crawl.json with ripgrep for relevant ops; keep context tight.
  • Extract only needed code and descriptions; rewrite for wave64 occupancy, LDS tiling, vectorized/coalesced access, and bank-conflict avoidance.
  • Cite source file and lines; pair with reflection prompts to validate correctness and performance.

References

  • references/SEARCH.md: Grep commands and tips for slicing snippets efficiently.