onnxruntime/tools
Scott McKay e7a524fea9
Update to allow large models to be checked for mobile support. (#18357)
### Description
<!-- Describe your changes. -->
Update usability checker and related infrastructure to support checking
models > 2GB.
- Add ability to set flag to keep initializers as external data
- we optimize the model as part of the checking so need to write out a
new copy.
- Handle issue with ONNX shape inferencing silently failing
- use API that supports large models but requires writing the model to a
new file
  - automate cleanup of that copy of the model

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
Allow analysis of LLMs to determine gaps for mobile usage.

---------

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2023-11-17 07:20:16 +10:00
..
android_custom_build Update NDK to 26.0.10792818 (#17852) 2023-10-12 14:08:43 -07:00
ci_build Adding new yaml file for downloading cuda, and trt from azure blob (#18443) 2023-11-14 19:47:39 -08:00
doc Disable PERF* rules in ruff to allow better readability (#16834) 2023-07-25 15:38:22 -07:00
nuget Rework/cleanup the C# build infrastructure for nuget packages. (#18127) 2023-11-03 09:05:17 -07:00
perf_view fixed #16873 (#16932) 2023-09-26 09:57:01 -07:00
python Update to allow large models to be checked for mobile support. (#18357) 2023-11-17 07:20:16 +10:00
scripts Remove dnf update from docker build scripts (#17551) 2023-09-21 07:33:29 -07:00