Open
Description
Checklist
- I've read the contribution guidelines.
- I've searched other issues and no duplicate issues were found.
- I've agreed with the maintainers that I can plan this task.
Description
Background:
We have long struggled with the latency in our sensing/perception pipeline. Even though we use very rudimentary algorithms, the latency is considerable enough that we can not opt to use more complex ones due to fear of increasing reaction times.
In the same vein, autoware has proven to be too heavy for new sensors with higher data rates and we have had to resort to down sample approaches to bound the execution times, even though using the new sensors fully holds the potential to increase the overall performance of autoware.
Previous conversations:
- In the sensing / perception WG, the idea of acceleration and some PoCs were discussed. There was no opposition other than compatibility and granularity when possible.
- A similar feature request was mentioned in https://github.com/orgs/autowarefoundation/discussions/5396
Purpose
Accelerate the sensing/perception pipeline via cuda-based acceleration
Possible approaches
- Accelerate the pointcloud preprocessing pipeline
- Accelerate nodes that use the pointcloud in the perception pipeline
- Implement a transport layer for CUDA so that we can avoid host->device->host copies whenever possible
Definition of done
PRs that implement
- acceleration of the sensing perception pipeline
- type adaptation and negotiation
are merged to universe and/or autoware
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
In Progress