onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-22 22:01:08 +00:00

History

Ted Themistokleous 11e7a1b8f2 [MIGraphX EP] Add migraphx ep save load compiles (#20643 ) ### Description Adds the ability for MIGraphX EP to save off or load compiled models to save time between inferences. Via Command line User should be able to set the save ability with ORT_MIGRAPHX_SAVE_COMPILED_MODEL ORT_MIGRAPHX_SAVE_COMPILE_PATH User should be able to set the load ability with ORT_MIGRAPHX_LOAD_COMPILED_MODEL ORT_MIGRAPHX_LOAD_COMPILE_PATH via Onnxruntime API migx_save_compiled_model migx_save_model_name migx_load_compiled_model migx_load_model_name ### Motivation and Context The motivation for this is to leverage MIGraphX's existing API to save/load models after our compile step of graph optimization. For larger models or models which were compiled with additional tuning steps, this saves time after first compile and inference run, and thus speeds up the user experience in order to encourage development. --------- Co-authored-by: Ted Themistokleous <tedthemistokleous@amd.com>		2024-06-17 11:24:31 +08:00
..
common	Fully dynamic ETW controlled logging for ORT and QNN logs (#20537 )	2024-06-06 21:11:14 -07:00
eager	Run clang-format in CI (#15524 )	2023-04-18 09:26:58 -07:00
framework	Release backward inputs per static graph ref count (#20804 )	2024-06-14 14:33:01 +08:00
graph	Introduce memory efficient topological sort (#20258 )	2024-04-23 08:00:23 +08:00
optimizer	fix compilation error in no absl build (#15769 )	2023-05-02 08:20:49 -07:00
platform	Bump linter versions (#18341 )	2023-11-08 13:04:40 -08:00
providers	[TensorRT EP] Support engine hardware compatibility (#20669 )	2024-05-28 18:12:56 -07:00
session	[MIGraphX EP] Add migraphx ep save load compiles (#20643 )	2024-06-17 11:24:31 +08:00