From 02c122d6e478d69ee97b3ecdd0aac5cd4f76ba8c Mon Sep 17 00:00:00 2001 From: Emma Ning <43255631+EmmaNingMS@users.noreply.github.com> Date: Fri, 27 Sep 2019 13:10:40 -0700 Subject: [PATCH] Add OLive in perf tuning section (#1772) * Add OLive in perf tuning section * Add OLive to perf tuning section * Update README.md * Update ONNX_Runtime_Perf_Tuning.md --- README.md | 2 ++ docs/ONNX_Runtime_Perf_Tuning.md | 2 ++ 2 files changed, 4 insertions(+) diff --git a/README.md b/README.md index 1810b24fd3..6804b197af 100644 --- a/README.md +++ b/README.md @@ -132,6 +132,8 @@ ONNX Runtime can be deployed to the cloud for model inferencing using [Azure Mac ## Performance Tuning ONNX Runtime is open and extensible, supporting a broad set of configurations and execution providers for model acceleration. For performance tuning guidance, please see [this page](./docs/ONNX_Runtime_Perf_Tuning.md). +To tune performance for ONNX models, the [ONNX Go Live tool "OLive"](https://github.com/microsoft/OLive) provides an easy-to-use pipeline for converting models to ONNX and optimizing performance for inferencing with ONNX Runtime. + *** # Examples and Tutorials ## Python diff --git a/docs/ONNX_Runtime_Perf_Tuning.md b/docs/ONNX_Runtime_Perf_Tuning.md index ec39b06e00..129fe7ba74 100644 --- a/docs/ONNX_Runtime_Perf_Tuning.md +++ b/docs/ONNX_Runtime_Perf_Tuning.md @@ -91,6 +91,8 @@ whether next task is ready or not. Use PASSIVE if your CPU usage already high, u Yes, we have created a tool named onnxruntime_perf_test.exe, and you find it at the build drop. You can use this tool to test all those knobs easily. Please find the usage of this tool by onnxruntime_perf_test.exe -h +The [ONNX Go Live "OLive" tool](https://github.com/microsoft/OLive) provides an easy-to-use pipeline for converting models to ONNX and optimizing performance with ONNX Runtime. The tool can help identify the optimal runtime configuration to get the best performance on the target hardware for the model. + ## How to enable profiling and view the generated JSON file? You can enable ONNX Runtime latency profiling in code: