Skip to content

Improve performance for SGEMVN on NEONVERSEN1 #5225

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Apr 17, 2025

Conversation

annop-w
Copy link
Contributor

@annop-w annop-w commented Apr 14, 2025

Performance improvement with 1 thread on NEOVERSEV2

sgemv_n

@annop-w
Copy link
Contributor Author

annop-w commented Apr 15, 2025

Closed in favor of #5220

@annop-w annop-w closed this Apr 15, 2025
@abhishek-iitmadras
Copy link
Contributor

Hi @annop-w

can this be used for Neoverse N1 ?

@annop-w
Copy link
Contributor Author

annop-w commented Apr 16, 2025

@abhishek-iitmadras Yes, absolutely. Actually I can try to benchmark on N1.

@annop-w annop-w reopened this Apr 16, 2025
@annop-w
Copy link
Contributor Author

annop-w commented Apr 16, 2025

Here is performance speedup on NEOVERSEN1

c6g_sgemv_n

I will modify this PR to target N1.

@annop-w annop-w changed the title Improve performance for SGEMVN on NEONVERSEN2 Improve performance for SGEMVN on NEONVERSEN1 Apr 16, 2025
@annop-w
Copy link
Contributor Author

annop-w commented Apr 16, 2025

I will modify this PR to target N1.

Done

@martin-frbg martin-frbg merged commit dd38b4e into OpenMathLib:develop Apr 17, 2025
85 of 86 checks passed
@annop-w annop-w deleted the gemv_n branch April 22, 2025 09:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants