-
Notifications
You must be signed in to change notification settings - Fork 31
Pass Discrete Sequence to GPU #652
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
… sequences, moved spatial gradient calculation to own function.
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #652 +/- ##
==========================================
- Coverage 91.22% 91.16% -0.06%
==========================================
Files 57 58 +1
Lines 3248 3250 +2
==========================================
Hits 2963 2963
- Misses 285 287 +2
Flags with carried forward coverage won't be shown. Click here to find out more.
🚀 New features to boost your workflow:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
KomaMRI Benchmarks
Details
| Benchmark suite | Current: 17e586e | Previous: 11d3180 | Ratio |
|---|---|---|---|
MRI Lab/Bloch/CPU/2 thread(s) |
387199837.5 ns |
337633469 ns |
1.15 |
MRI Lab/Bloch/CPU/4 thread(s) |
289067546 ns |
276282606 ns |
1.05 |
MRI Lab/Bloch/CPU/8 thread(s) |
242467570.5 ns |
209852165 ns |
1.16 |
MRI Lab/Bloch/CPU/1 thread(s) |
655549193 ns |
555065232 ns |
1.18 |
MRI Lab/Bloch/GPU/CUDA |
20962776 ns |
21231134 ns |
0.99 |
MRI Lab/Bloch/GPU/oneAPI |
85944808 ns |
77053690 ns |
1.12 |
MRI Lab/Bloch/GPU/Metal |
104643083 ns |
95540333 ns |
1.10 |
MRI Lab/Bloch/GPU/AMDGPU |
25102288 ns |
24763685 ns |
1.01 |
Slice Selection 3D/Bloch/CPU/2 thread(s) |
1651013192 ns |
1592087059 ns |
1.04 |
Slice Selection 3D/Bloch/CPU/4 thread(s) |
911694348 ns |
889539516 ns |
1.02 |
Slice Selection 3D/Bloch/CPU/8 thread(s) |
599249406.5 ns |
565401528.5 ns |
1.06 |
Slice Selection 3D/Bloch/CPU/1 thread(s) |
3109371230 ns |
3029269071 ns |
1.03 |
Slice Selection 3D/Bloch/GPU/CUDA |
31822745 ns |
32639703 ns |
0.97 |
Slice Selection 3D/Bloch/GPU/oneAPI |
133804680 ns |
121489006.5 ns |
1.10 |
Slice Selection 3D/Bloch/GPU/Metal |
112492041 ns |
111152750 ns |
1.01 |
Slice Selection 3D/Bloch/GPU/AMDGPU |
32559879.5 ns |
32636098 ns |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
|
The macos fails are due to actions/runner-images#13341. |
|
Compare benchmarks with #653. Update: macos cache problem fixed. |
cncastillo
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks very good, no significant performance regressions, but some of them could be explained by some of the suggestions below.
In preparation for higher order sequences, this adds: