Skip to content

Use faster scalar version for trigo function#1073

Open
serge-sans-paille wants to merge 1 commit intomasterfrom
feature/faster-scalar-trigo
Open

Use faster scalar version for trigo function#1073
serge-sans-paille wants to merge 1 commit intomasterfrom
feature/faster-scalar-trigo

Conversation

@serge-sans-paille
Copy link
Copy Markdown
Owner

Second try.

@serge-sans-paille
Copy link
Copy Markdown
Owner Author

@JohanMabille what's your take on this?

@JohanMabille
Copy link
Copy Markdown

If it turns out to be faster than the scalar (std) version, we can implement the same algorithms for scalars as the ones operating on batches (so we save the transfer to and from simd registers).

@serge-sans-paille
Copy link
Copy Markdown
Owner Author

the speedup may come from the usage of sse instruction though. This needs more investigation. But yes, this is faster for the flagged functions, at least on my laptop, but this is probably hardware-dependent.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants