Skip to content

Creating TRT Cache much slower on Linux than on Windows #23380

Open
@BengtGustafsson

Description

Describe the issue

When we create .engine files on Windows we see a run time of 15-30 s and it is not very dependent on the input sizes we use.

On Linux the same networks on the same hardware can take up to 10 minutes to optimize. The hardware includes a T1000 GPU and a recent Intel CPU.

Any ideas on this? Could it be that we run Linux via Docker?

To reproduce

Set up to create .engine files on Windows and Linux. check the time it takes.

Urgency

No response

Platform

Linux

OS Version

Ubuntu 20.04

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

1.20

ONNX Runtime API

C++

Architecture

X64

Execution Provider

TensorRT

Execution Provider Library Version

CUDA 11.6, TrT 10.4.0.26

Metadata

Assignees

Labels

ep:TensorRTissues related to TensorRT execution providerplatform:windowsissues related to the Windows platform

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions