onnxruntime/tools/python/util/optimize_onnx_model.py

#!/usr/bin/env python3
# Copyright (c) Microsoft Corporation. All rights reserved.
# Licensed under the MIT License.
from __future__ import annotations

import argparse
import os
import pathlib

from .onnx_model_utils import get_optimization_level, optimize_model


def optimize_model_helper():
    parser = argparse.ArgumentParser(
        f"{os.path.basename(__file__)}:{optimize_model_helper.__name__}",
        description="""
                                     Optimize an ONNX model using ONNX Runtime to the specified level.
                                     See https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html for more
                                     details of the optimization levels.""",
    )

    parser.add_argument(
        "--opt_level",
        default="basic",
        choices=["disable", "basic", "extended", "all"],
        help="Optimization level to use.",
    )
    parser.add_argument(
        "--log_level",
        choices=["debug", "info", "warning", "error"],
        type=str,
        required=False,
        default="error",
        help="Log level. Defaults to Error so we don't get output about unused initializers "
        "being removed. Warning or Info may be desirable in some scenarios.",
    )

    parser.add_argument("input_model", type=pathlib.Path, help="Provide path to ONNX model to update.")
    parser.add_argument("output_model", type=pathlib.Path, help="Provide path to write optimized ONNX model to.")

    args = parser.parse_args()

    if args.log_level == "error":
        log_level = 3
    elif args.log_level == "debug":
        log_level = 0  # ORT verbose level
    elif args.log_level == "info":
        log_level = 1
    elif args.log_level == "warning":
        log_level = 2

    optimize_model(args.input_model, args.output_model, get_optimization_level(args.opt_level), log_level)


if __name__ == "__main__":
    optimize_model_helper()
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00			`#!/usr/bin/env python3`
			`# Copyright (c) Microsoft Corporation. All rights reserved.`
			`# Licensed under the MIT License.`
Update win-ort-main to tip main 250123 (#23473) ### Description This PR is to update the win-ort-main branch to the tip main branch as of 2025-01-23. ### PR List ddf0d377a7 [QNN EP] Add LoggingManager::HasDefaultLogger() to provider bridge API (#23467) 05fbbdf91f [QNN EP] Make QNN EP a shared library (#23120) 1336566d7f Add custom vcpkg ports (#23456) 2e1173c411 Update the compile flags for vcpkg packages (#23455) 1f628a9858 [Mobile] Add BrowserStack Android MAUI Test (#23383) 009cae0ec8 [js/webgpu] Optimize ConvTranspose (Continue) (#23429) 04a4a694cb Use onnx_protobuf.h to suppress some GCC warnings (#23453) 2e3b62b4b0 Suppress some strict-aliasing related warnings in WebGPU EP (#23454) b708f9b1dc Bump ruff from 0.9.1 to 0.9.2 (#23427) c0afc66b2a [WebNN] Remove workarounds for TFLite backend (#23406) 8a821ff7f9 Bump vite from 6.0.7 to 6.0.11 in /js/web/test/e2e/exports/testcases/vite-default (#23446) 220c1a203e Make ORT and Dawn use the same protobuf/abseil source code (#23447) b7b5792147 Change MacOS-13 to ubuntu on for android-java-api-aar-test.yml. (#23444) 19d0d2a30f WIP: Dp4MatMulNBits accuracy level 4 matmul for WebGPU EP (#23365) 95b8effbc4 [QNN EP]: Clean up QNN logging resources if an error occurs during initialization (#23435) 626134c5b5 Bump clang-format from 19.1.6 to 19.1.7 (#23428) 0cf975301f Fix eigen external deps (#23439) f9440aedce Moving RN_CI Android Testing to Linux (#23422) 1aa5902ff4 [QNN EP] workaround for QNN validation bug for Tanh with uint16 quantized output (#23432) 7f5582a0e2 Seperate RN andriod and IOS into 2 separated Stages. (#23400) 73deac2e7f Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs (#23090) 949fe42af4 Upgrade Java version from react-native/android to Java 17 (#23066) 0892c23463 Update Qnn SDK default version to 2.30 (#23411) 94c099bcec Fix type cast build error (#23423) d633e571d1 [WebNN EP] Fix AddInitializersToSkip issues (#23354) e988ef00e2 [QNN EP] Fix regression for MatMul with two quantized/dynamic uint16 inputs (#23419) 7538795f6b Update onnxruntime binary size checks ci pipeline's docker image (#23405) 6c5ea41cad Revert "[QNN EP] Clean up correctly from a partial setup (#23320)" (#23420) e866804bbe Enable comprehension simplification in ruff rules (#23414) 0a5f1f392c bugfix: string_view of invalid memory (#23417) 4cc38e0277 fix crash when first input of BatchNormalization is 1-D (#23387) 033441487f Target py310 and modernize codebase with ruff (#23401) 87341ac010 [QNN EP] Fix segfault when unregistering HTP shared memory handles (#23402) ### Motivation and Context This update includes the change to make QNN-EP a shared library. --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com> Co-authored-by: Justin Chu <justinchuby@users.noreply.github.com> Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: Changming Sun <chasun@microsoft.com> Co-authored-by: Peishen Yan <peishen.yan@intel.com> Co-authored-by: Tianlei Wu <tlwu@microsoft.com> Co-authored-by: Hector Li <hecli@microsoft.com> Co-authored-by: Jian Chen <cjian@microsoft.com> Co-authored-by: Alexis Tsogias <1114095+Zyrin@users.noreply.github.com> Co-authored-by: junchao-zhao <68935141+junchao-loongson@users.noreply.github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: sushraja-msft <44513542+sushraja-msft@users.noreply.github.com> Co-authored-by: Wanming Lin <wanming.lin@intel.com> Co-authored-by: Jiajia Qin <jiajiaqin@microsoft.com> Co-authored-by: Caroline Zhu <wolfivyaura@gmail.com> 2025-01-23 17:12:03 +00:00			`from __future__ import annotations`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00
			`import argparse`
			`import os`
			`import pathlib`

			`from .onnx_model_utils import get_optimization_level, optimize_model`


			`def optimize_model_helper():`
Format all python files under onnxruntime with black and isort (#11324) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316 2022-04-26 16:35:16 +00:00			`parser = argparse.ArgumentParser(`
			`f"{os.path.basename(__file__)}:{optimize_model_helper.__name__}",`
			`description="""`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00			`Optimize an ONNX model using ONNX Runtime to the specified level.`
Fix url in the code documentation (graph optimizations) (#16770) ### Description Fix a wrong url in the documentation as mentioned in issue #16678. ### Motivation and Context Better documentation. 2023-07-20 14:02:22 +00:00			`See https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html for more`
Format all python files under onnxruntime with black and isort (#11324) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316 2022-04-26 16:35:16 +00:00			`details of the optimization levels.""",`
			`)`

			`parser.add_argument(`
			`"--opt_level",`
			`default="basic",`
			`choices=["disable", "basic", "extended", "all"],`
			`help="Optimization level to use.",`
			`)`
			`parser.add_argument(`
			`"--log_level",`
			`choices=["debug", "info", "warning", "error"],`
			`type=str,`
			`required=False,`
			`default="error",`
			`help="Log level. Defaults to Error so we don't get output about unused initializers "`
			`"being removed. Warning or Info may be desirable in some scenarios.",`
			`)`

			`parser.add_argument("input_model", type=pathlib.Path, help="Provide path to ONNX model to update.")`
			`parser.add_argument("output_model", type=pathlib.Path, help="Provide path to write optimized ONNX model to.")`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00
			`args = parser.parse_args()`

Format all python files under onnxruntime with black and isort (#11324) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316 2022-04-26 16:35:16 +00:00			`if args.log_level == "error":`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00			`log_level = 3`
Format all python files under onnxruntime with black and isort (#11324) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316 2022-04-26 16:35:16 +00:00			`elif args.log_level == "debug":`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00			`log_level = 0 # ORT verbose level`
Format all python files under onnxruntime with black and isort (#11324) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316 2022-04-26 16:35:16 +00:00			`elif args.log_level == "info":`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00			`log_level = 1`
Format all python files under onnxruntime with black and isort (#11324) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316 2022-04-26 16:35:16 +00:00			`elif args.log_level == "warning":`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00			`log_level = 2`

			`optimize_model(args.input_model, args.output_model, get_optimization_level(args.opt_level), log_level)`


Format all python files under onnxruntime with black and isort (#11324) Description: Format all python files under onnxruntime with black and isort. After checking in, we can use .git-blame-ignore-revs to ignore the formatting PR in git blame. #11315, #11316 2022-04-26 16:35:16 +00:00			`if __name__ == "__main__":`
Add range of helpers for making usage of ORT Mobile easier. (#10458) * Add range of helpers for making usage of ORT Mobile easier. 2022-02-17 21:35:25 +00:00			`optimize_model_helper()`