Skip to content

fast cpu backend#1961

Open
mshuaibii wants to merge 4 commits intomainfrom
umas-fast-cpu-backend-v2
Open

fast cpu backend#1961
mshuaibii wants to merge 4 commits intomainfrom
umas-fast-cpu-backend-v2

Conversation

@mshuaibii
Copy link
Copy Markdown
Contributor

@mshuaibii mshuaibii commented Apr 14, 2026

image

misko added 2 commits April 13, 2026 05:59
C++ fused Wigner rotation kernels (gather+rotate+permute) with OpenMP
parallelism, compiled via torch.utils.cpp_extension. Falls back to
pure PyTorch if compilation fails.

Includes unit tests verifying forward/backward correctness against
PyTorch reference implementation.
Register umas_fast_cpu execution mode that uses C++ fused Wigner
kernels for CPU inference. Includes _EdgeDegreeScatterFunction for
reduced backward overhead, and auto-fallback from GPU to CPU backend
in maybe_update_settings_backend when CUDA is unavailable.
@mshuaibii mshuaibii added enhancement New feature or request minor Minor version release labels Apr 14, 2026
@meta-cla meta-cla Bot added the cla signed label Apr 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed enhancement New feature or request minor Minor version release

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants