onnxruntime/onnxruntime
wejoncy 86870114eb
[CoreML] support coreml model cache (#23065)
### Description
Refactor compute plan profiling

Support cache coreml model to speed up session initialization. this is
only support by user provided entry and user responsible to manage the
cache


With the cache, session initialization time can be reduced by 50% or
more:
|model| before| after|
|--|--|--|
|yolo11.onnx| 0.6s|0.1s|
|yolo11-fp16.onnx|1.8s|0.1s|


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: wejoncy <wejoncy@.com>
Co-authored-by: Scott McKay <skottmckay@gmail.com>
2024-12-31 09:29:41 +08:00
..
contrib_ops Enable pointer-generator T5 models in BeamSearch (#23134) 2024-12-22 21:30:49 -08:00
core [CoreML] support coreml model cache (#23065) 2024-12-31 09:29:41 +08:00
lora
python Integrate onnx 1.17.0 (#21897) 2024-12-24 09:02:02 -08:00
test [CoreML] support coreml model cache (#23065) 2024-12-31 09:29:41 +08:00
tool/etw
wasm [WebNN] Fixed WebNN Module undefined issue (#22795) 2024-11-11 21:31:24 -08:00
__init__.py
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings