lucas/onnx_gpu_backend by lucasreljic · Pull Request #24 · WATonomous/deep_ros

lucasreljic · 2025-11-22T01:33:02Z

Adding deep_ort_gpu_backend_plugin to allow for inference with onnxruntime with GPU-based execution providers. Uses the onnxruntime_gpu_vendor package.

Tested with object_detection branch on CUDA v12.2
Currently, only tested with CUDA as the execution provider; through further debugging, TensorRT might be possible
Passes in inputs under CPU memory just like deep_ort_backend_plugin, since onnxruntime converts to GPU memory anyway. There are methods for passing in GPU memory, which could be explored in the future when ROS Nitros is implemented.

Edwardius

Thanks for working on this! Some comments.

deep_ort_gpu_backend_plugin/CMakeLists.txt

deep_ort_gpu_backend_plugin/include/deep_ort_gpu_backend_plugin/ort_gpu_backend_executor.hpp

deep_ort_gpu_backend_plugin/src/ort_gpu_backend_executor.cpp

deep_ort_gpu_backend_plugin/src/ort_gpu_memory_allocator.cpp

model_farm/COLCON_IGNORE

…tests in the process

…one), unused members and hardcoded float types

Edwardius

@lucasreljic tests pass. Works with tensorrt and Cuda

deep_sample/test/test_sample_node.cpp

lucasreljic added 3 commits November 22, 2025 00:50

Initial deep_ort_gpu plugin with internal library issues

153a467

Functioning onnxruntime with CUDA as execution provider

ca1af75

Removed requirement for cuda toolkit

e9ad787

lucasreljic requested a review from Edwardius November 22, 2025 02:50

Edwardius requested changes Nov 28, 2025

View reviewed changes

cleaning up cmake

e785893

Edwardius reviewed Dec 5, 2025

View reviewed changes

model_farm/COLCON_IGNORE Show resolved Hide resolved

Edwardius added 4 commits December 5, 2025 02:35

updating tests

ffe6813

parameterizing parameters that weren't actually parameters, updating …

9d881aa

…tests in the process

addressing issues with IO binding, thread-loca caching bug (idk this …

d55506d

…one), unused members and hardcoded float types

launch tests for local use only

cbc805f

Edwardius self-requested a review December 5, 2025 04:11

Edwardius added 4 commits December 5, 2025 10:01

trying to get the tensorrt EP working

c21c7e2

tensorrt tested to be working apparently, check deep_sample

9f04578

debugging ci

dc01add

testing

54b566d

Edwardius approved these changes Dec 5, 2025

View reviewed changes

Edwardius marked this pull request as ready for review December 5, 2025 18:49

Edwardius reviewed Dec 5, 2025

View reviewed changes

deep_sample/test/test_sample_node.cpp Show resolved Hide resolved

Edwardius merged commit 3f95427 into main Dec 5, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lucas/onnx_gpu_backend#24

lucas/onnx_gpu_backend#24
Edwardius merged 12 commits intomainfrom
lucas/onnx_gpu_backend

lucasreljic commented Nov 22, 2025

Uh oh!

Edwardius left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Edwardius left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lucasreljic commented Nov 22, 2025

Uh oh!

Edwardius left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Edwardius left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants