Skip to content

add rvv fp16 kernels for f16-gemm, f16-igemm, f16-dwconv, f16-spmm#9594

Merged
copybara-service[bot] merged 1 commit intogoogle:masterfrom
ken-unger:f16-gemm-spmm-dwconv-rvv
Mar 2, 2026
Merged

add rvv fp16 kernels for f16-gemm, f16-igemm, f16-dwconv, f16-spmm#9594
copybara-service[bot] merged 1 commit intogoogle:masterfrom
ken-unger:f16-gemm-spmm-dwconv-rvv

Conversation

@ken-unger
Copy link
Copy Markdown
Contributor

Add RVV FP16 kernels for f16-gemm, f16-igemm, f16-dwconv, f16-spmm

  • Relevant tests pass and benchmarks executed.

Fixed definition of XNN_HAVE_FLOAT16 for RVV in src/xnnpack/math.h

Note that since there is no scalar fallback for FP16 kernels, not all the operator and subgraph tests will execute for RVV FP16 until the remaining kernels are implemented/integrated. I'll continue with qd8-f16 and other RVV kernels in subsequent PRs.

Copy link
Copy Markdown
Collaborator

@dsharlet dsharlet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good, but see my comment on the previous fp16 PR: #9516 (comment)

I think we need hardware-config support for detecting fp16 support.

copybara-service bot pushed a commit that referenced this pull request Mar 2, 2026
--
8dc139c by Ken Unger <ken.j.unger@gmail.com>:

add rvv fp16 kernels for f16-gemm, f16-igemm, f16-dwconv, f16-spmm

FUTURE_COPYBARA_INTEGRATE_REVIEW=#9594 from ken-unger:f16-gemm-spmm-dwconv-rvv 8dc139c
PiperOrigin-RevId: 876474909
@copybara-service copybara-service bot merged commit cd2a0d4 into google:master Mar 2, 2026
24 checks passed
@ken-unger ken-unger deleted the f16-gemm-spmm-dwconv-rvv branch March 26, 2026 03:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants