Skip to content

Brian/tensorrt ep#25

Closed
brianzheng206 wants to merge 29 commits intomainfrom
brian/tensorrt_ep
Closed

Brian/tensorrt ep#25
brianzheng206 wants to merge 29 commits intomainfrom
brian/tensorrt_ep

Conversation

@brianzheng206
Copy link
Contributor

@brianzheng206 brianzheng206 commented Dec 6, 2025

tensorrt ep + cuda fallback for yolo inferencing at around 28-32 hz

  • probably gonna change the name of the node later on, its called deep_yolo_inference for now
  • tensorrt engines can be optionally cached, so only the first startup is slow
  • batch size is fixed at 3. can be dynamic but that means multiple trt engines

tested for 12.4.1 and 12.8.0 cudnn runtime on ubuntu22.04

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants