Skip to content

Modified sageattn_ultravico for AMD/ROCm compatibility with `triton-windows`

Notifications You must be signed in to change notification settings

0xDELUXA/ComfyUI-WanVideoWrapper-ultravico_AMD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 

Repository files navigation

View Changes

You can see the exact modifications made for AMD and triton-windows compatibility here.

Technical Changes:

  • Block sizes modified for better ROCm compatibility:
    • BLKQ: 128, BLKK: 64
    • BLOCK_M: 128, BLOCK_N: 64
  • Numerical Stability: Kept accumulation in float32.
  • Optimization: Triton autotuning added.

Tested on RDNA 4 with TheRock ROCm 7, PyTorch and triton-windows release v3.6.0-windows.post25.

Installation (PowerShell)

cd ComfyUI/custom_nodes/ComfyUI-WanVideoWrapper/ultravico/
Remove-Item -Recurse -Force sageattn
git clone https://github.com/0xDELUXA/ComfyUI-WanVideoWrapper-ultravico_AMD sageattn

About

Modified sageattn_ultravico for AMD/ROCm compatibility with `triton-windows`

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages