-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Add infrastructure for auto EP selection #24430
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can commit the suggested changes from lintrunner.
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
bunch of build failures. missing some dxcore header ? |
…EpInfrastructure_PR
Address PR comment
Disable EP registration/auto selection testing on non-Windows platforms. There's no device discovery so it can't be used.
…thub.com/microsoft/onnxruntime into skottmckay/AutoSelectEpInfrastructure_PR
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Adding ORT_API_CALL
to the factory functions should hopefully fix the x86 ci errors
Add ORT_API_CALL fixes Co-authored-by: Adrian Lizarraga <[email protected]>
…EpInfrastructure_PR
…cies a lot simpler, and in the future we need to provide that to an EP author so it is reasonable for it to be implemented in onnxruntime_session. Address PR comments. Add onnxruntime_session to the CUDA EP dependencies.
Co-authored-by: Adrian Lizarraga <[email protected]>
…thub.com/microsoft/onnxruntime into skottmckay/AutoSelectEpInfrastructure_PR
- move provider bridge support to the EP side - change SessionOptionsAppendExecutionProvider_V2 to take OrtEpDevices as input - better device discovery (pending final implementation)
Avoid double lock for Provider Get when it calls Load.
Co-authored-by: Adrian Lizarraga <[email protected]>
Disable setting device_id for DML. Need to investigate why we get an invalid bus number in device discovery. The device discovery code is temporary and most likely has an issue.
…thub.com/microsoft/onnxruntime into skottmckay/AutoSelectEpInfrastructure_PR
Will follow up any comments in separate PR |
Description
Add infrastructure to enable auto EP selection.
Device discovery for CPU/GPU/NPU on Windows.
Supports internal (CPU/DML/WebGPU) and provider bridge (CUDA) EPs currently.
Infrastructure will be used with plugin EPs next.
Selection policy implementation will be added next, so in the interim there's a temporary function with manually specified selection so unit tests can cover the end-to-end.
Motivation and Context