onnxruntime/include/onnxruntime/core
Scott McKay eb8f6c7c52
Transpose optimizer enhancements (#15117)
### Description
<!-- Describe your changes. -->
- Add debug infrastructure to dump out model at various stages of
transpose optimization.
- Handle more scenarios where Transpose -> Reshape can be merged.
- Run L1 optimizers after layout transform to constant fold initializers
that had their layout changed.
- Use cost check for Concat post layout transform as pushing a Transpose
through it can potentially add Transpose nodes to multiple other inputs.
- Update internal testing EP to support test where you want it to take
all nodes, use NHWC layout, and to use dummy static kernels instead of
compiling so the ops in the graph post-initialization can be counted.
- Misc cleanup in InferenceSession to not unnecessarily pass args to
TransposeGraph for class members.

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
Address perf issue seen with model where a Transpose gets blocked by a
Reshape that could have been treated as a Transpose.

---------

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2023-03-28 08:28:17 +10:00
..
common Improve compatibility with certain STL's 2023-02-21 14:06:16 -08:00
eager
framework FasterTransformer model wrapper using custom op (#15013) 2023-03-20 09:05:30 -07:00
graph Introduce RemovableAttributes (#14868) 2023-03-07 12:37:12 +01:00
optimizer Pass SessionOptions to XnnpackProviderFactoryCreator. (#13318) 2022-12-10 14:23:46 +08:00
platform Improve thread pool creation failure handling. (#13313) 2022-10-15 17:57:19 -07:00
providers remove disable_cpu_soft temporarily 2023-03-15 13:23:56 +08:00
session Transpose optimizer enhancements (#15117) 2023-03-28 08:28:17 +10:00