-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Add QNN EP HTP shared memory allocator #23136
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
61 commits
Select commit
Hold shift + click to select a range
110a3bc
save work
edgchen1 0ba3a2f
save work
edgchen1 8436b14
add logging for setting QNN tensor memory, update comment
edgchen1 c9826f4
add option to enable HTP shared memory allocator to onnxruntime_perf_…
edgchen1 c07c35e
hack - try to cache mem handles in QnnModel
edgchen1 60dc837
Remove duplicate include.
edgchen1 24e072f
hack, continued - move cache out to SharedContext
edgchen1 e66cbef
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 8c515da
move mem handle registration to allocator
edgchen1 18e2780
hook up some test code
edgchen1 09ddce5
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 a65bb71
rename to RpcMemAllocator to HtpSharedMemoryAllocator
edgchen1 bfb135e
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 f179a0d
remove onnx protobuf dependency from allocator.h, add shared provider…
edgchen1 7645ef4
remove unused CPUAllocator::TensorAlloc declaration
edgchen1 1043732
Check for nullptr when trying to free
baijumeswani 022f4bc
move mem handle management to QNN backend manager
edgchen1 c527dee
remove IAllocator::TensorAlloc()
edgchen1 e4f72b3
document IAllocator::Free
edgchen1 39ff901
remove IAllocator__TensorAlloc
edgchen1 1bed5a4
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 d70db84
fix android build warning
edgchen1 45ef883
remove shared mem handles from shared context
edgchen1 d2e7b3c
remove allocation clean up callback removal, use weak_ptrs in allocat…
edgchen1 c892c18
some clean up
edgchen1 b295eef
more clean up
edgchen1 13f5e30
add helper to get qnn error message
edgchen1 d5eace1
use make_shared for QnnBackendManager
edgchen1 bacbcdc
add test to qnn_basic_test.cc, document allocator parameter.
edgchen1 30cd9ed
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 b29ab61
rename variables
edgchen1 67a54b8
revert changes to onnxruntime/test/providers/qnn/max_min_op_test.cc
edgchen1 c0569e2
fix formatting
edgchen1 dd45c84
skip test if not android and not windows
edgchen1 959d8df
update comment
edgchen1 ab48516
remove QnnBackendManager::ReleaseQnnContextMemHandles declaration, up…
edgchen1 4a3f6c3
add onnxruntime_c_api.h include to ortmemoryinfo.h
edgchen1 65ce4b1
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 ff12541
rename GetQnnTensorDataSize to GetQnnTensorDataSizeInBytes
edgchen1 5e6e103
add QnnBackendManager::Create function to ensure shared_ptr usage
edgchen1 78e86cc
make some QnnBackendManager member functions private, update comment
edgchen1 e665a2b
document GetOrRegister functions
edgchen1 425023b
add enable_htp_shared_memory_allocator to available_keys
edgchen1 781a4a0
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 4d29208
make DlError return const char*
edgchen1 568c9a7
Use ORT_DISALLOW_COPY_ASSIGNMENT_AND_MOVE for SharedContext
edgchen1 8b95535
use safeint instead of manually checking against int max
edgchen1 515999c
add/update doc for enable_htp_shared_memory_allocator option
edgchen1 6986839
formatting
edgchen1 00b286b
add some comments about HtpSharedmemoryAllocator impl
edgchen1 88dec64
initialize with QNN_MEM_DESRIPTOR_INIT
edgchen1 4ca3ea7
address comments
edgchen1 7a88c3f
rework context handle ownership
edgchen1 f373035
add / update tests
edgchen1 e86ff2e
add check for qnn tensor dynamic shape
edgchen1 6fa33f0
Add comment about multi-threading considerations
edgchen1 4101cca
fix test comment
edgchen1 14af7ad
fix formatting
edgchen1 2f5c93c
add ifdef to use htp backend if on arm64 or linux.
edgchen1 b868a9f
Merge remote-tracking branch 'origin/main' into edgchen1/qnn_ep_rpcmem
edgchen1 7ca4552
fix typo
edgchen1 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.