Tidy up GPU code, speed up and simplify

I think we can probably simplify the GPU code quite a bit.

- [ ] Look at maybe simplifying error handling using the examples in the CUDA developer docs.
- [ ] Use streams to improve performance.
- [ ] Get CUFFT working with MPI and multi-GPU (hopefully we can do this if we have one rank orchestrate things).
- [ ] Allow Mhysa to specify what GPU each rank should connect to, that would allow us to do away with the use of MPS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tidy up GPU code, speed up and simplify #40

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Tidy up GPU code, speed up and simplify #40

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions