onnxruntime/tools
Adrian Lizarraga 02b1ff5fa2
[QNN EP] Support multithreaded inference of a single session (#18981)
### Description
- Add mutex to protect QNN API calls for executing a graph and
extracting the corresponding profile data.
- Ensures QNN EP's execute function does not store unnecessary state
(i.e., input and output buffer pointers do not need to be stored as
class members.)

### Motivation and Context
Allow calling `session.Run()` from multiple threads when using QNN EP.
2024-01-04 13:32:48 -08:00
..
android_custom_build Update NDK version to 26.1.10909125 (#18493) 2023-11-17 14:14:01 -08:00
ci_build [QNN EP] Support multithreaded inference of a single session (#18981) 2024-01-04 13:32:48 -08:00
doc Disable PERF* rules in ruff to allow better readability (#16834) 2023-07-25 15:38:22 -07:00
nuget Split Onnxruntime Nuget GPU package (#18819) 2023-12-22 16:57:16 +08:00
perf_view fixed #16873 (#16932) 2023-09-26 09:57:01 -07:00
python Update to allow large models to be checked for mobile support. (#18357) 2023-11-17 07:20:16 +10:00
scripts Remove dnf update from docker build scripts (#17551) 2023-09-21 07:33:29 -07:00