onnxruntime/onnxruntime/python
Ted Themistokleous 11e7a1b8f2
[MIGraphX EP] Add migraphx ep save load compiles (#20643)
### Description

Adds the ability for MIGraphX EP to save off or load compiled models to
save time between inferences.

Via Command line

User should be able to set the save ability with
ORT_MIGRAPHX_SAVE_COMPILED_MODEL
ORT_MIGRAPHX_SAVE_COMPILE_PATH

User should be able to set the load ability with
ORT_MIGRAPHX_LOAD_COMPILED_MODEL
ORT_MIGRAPHX_LOAD_COMPILE_PATH

via Onnxruntime API

migx_save_compiled_model
migx_save_model_name
migx_load_compiled_model
migx_load_model_name

### Motivation and Context

The motivation for this is to leverage MIGraphX's existing API to
save/load models after our compile step of graph optimization. For
larger models or models which were compiled with additional tuning
steps, this saves time after first compile and inference run, and thus
speeds up the user experience in order to encourage development.

---------

Co-authored-by: Ted Themistokleous <tedthemistokleous@amd.com>
2024-06-17 11:24:31 +08:00
..
backend Bump linter versions (#18341) 2023-11-08 13:04:40 -08:00
datasets
providers/tvm
tools Fix wrong quantization type in quantization tool (#20954) 2024-06-14 07:55:13 -07:00
torch_cpp_extensions remove const cast for DLManagedTensor (#20015) 2024-03-22 10:39:19 -07:00
training
__init__.py
_ld_preload.py
_pybind_state.py.in Fix invalid escape sequence (#17145) 2023-08-15 10:29:54 -07:00
exported_symbols.lst
numpy_helper.h
onnxruntime_collect_build_info.py
onnxruntime_inference_collection.py Implement numpy array over CPU OrtValues on return values (#20539) 2024-05-08 10:56:36 -07:00
onnxruntime_pybind.h
onnxruntime_pybind_exceptions.cc
onnxruntime_pybind_exceptions.h
onnxruntime_pybind_iobinding.cc Implement numpy array over CPU OrtValues on return values (#20539) 2024-05-08 10:56:36 -07:00
onnxruntime_pybind_mlvalue.cc [CPU EP] Int4 support for QuantizeLinear, DequantizeLinear, and Transpose (#20362) 2024-05-30 18:56:24 -07:00
onnxruntime_pybind_mlvalue.h Implement numpy array over CPU OrtValues on return values (#20539) 2024-05-08 10:56:36 -07:00
onnxruntime_pybind_module.cc Fix warning C4003 in ORT python binding code (#18612) 2023-11-30 08:07:47 -08:00
onnxruntime_pybind_ortvalue.cc Implement numpy array over CPU OrtValues on return values (#20539) 2024-05-08 10:56:36 -07:00
onnxruntime_pybind_quant.cc Block-wise 4b quantization matmul operator change (#18172) 2023-11-03 15:29:43 -07:00
onnxruntime_pybind_schema.cc [DML EP] Improve python API perf (#20331) 2024-04-17 17:33:37 -07:00
onnxruntime_pybind_sparse_tensor.cc Implement numpy array over CPU OrtValues on return values (#20539) 2024-05-08 10:56:36 -07:00
onnxruntime_pybind_state.cc [MIGraphX EP] Add migraphx ep save load compiles (#20643) 2024-06-17 11:24:31 +08:00
onnxruntime_pybind_state.h
onnxruntime_pybind_state_common.cc
onnxruntime_pybind_state_common.h OpenVINO EP Rel 1.18 Changes (#20337) 2024-04-19 00:31:38 -07:00
onnxruntime_validation.py fix: "UserWarning: Unsupported Windows version (11). ONNX Runtime sup… (#19845) 2024-03-15 12:41:44 -07:00
pybind.def
version_script.lds
version_script_expose_onnx_protobuf.lds