onnxruntime/include/onnxruntime/core
Ted Themistokleous 11e7a1b8f2
[MIGraphX EP] Add migraphx ep save load compiles (#20643)
### Description

Adds the ability for MIGraphX EP to save off or load compiled models to
save time between inferences.

Via Command line

User should be able to set the save ability with
ORT_MIGRAPHX_SAVE_COMPILED_MODEL
ORT_MIGRAPHX_SAVE_COMPILE_PATH

User should be able to set the load ability with
ORT_MIGRAPHX_LOAD_COMPILED_MODEL
ORT_MIGRAPHX_LOAD_COMPILE_PATH

via Onnxruntime API

migx_save_compiled_model
migx_save_model_name
migx_load_compiled_model
migx_load_model_name

### Motivation and Context

The motivation for this is to leverage MIGraphX's existing API to
save/load models after our compile step of graph optimization. For
larger models or models which were compiled with additional tuning
steps, this saves time after first compile and inference run, and thus
speeds up the user experience in order to encourage development.

---------

Co-authored-by: Ted Themistokleous <tedthemistokleous@amd.com>
2024-06-17 11:24:31 +08:00
..
common Fully dynamic ETW controlled logging for ORT and QNN logs (#20537) 2024-06-06 21:11:14 -07:00
eager Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
framework Release backward inputs per static graph ref count (#20804) 2024-06-14 14:33:01 +08:00
graph Introduce memory efficient topological sort (#20258) 2024-04-23 08:00:23 +08:00
optimizer fix compilation error in no absl build (#15769) 2023-05-02 08:20:49 -07:00
platform Bump linter versions (#18341) 2023-11-08 13:04:40 -08:00
providers [TensorRT EP] Support engine hardware compatibility (#20669) 2024-05-28 18:12:56 -07:00
session [MIGraphX EP] Add migraphx ep save load compiles (#20643) 2024-06-17 11:24:31 +08:00