onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-18 21:21:17 +00:00

History

Scott McKay eb8f6c7c52 Transpose optimizer enhancements (#15117 ) ### Description <!-- Describe your changes. --> - Add debug infrastructure to dump out model at various stages of transpose optimization. - Handle more scenarios where Transpose -> Reshape can be merged. - Run L1 optimizers after layout transform to constant fold initializers that had their layout changed. - Use cost check for Concat post layout transform as pushing a Transpose through it can potentially add Transpose nodes to multiple other inputs. - Update internal testing EP to support test where you want it to take all nodes, use NHWC layout, and to use dummy static kernels instead of compiling so the ops in the graph post-initialization can be counted. - Misc cleanup in InferenceSession to not unnecessarily pass args to TransposeGraph for class members. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Address perf issue seen with model where a Transpose gets blocked by a Reshape that could have been treated as a Transpose. --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>		2023-03-28 08:28:17 +10:00
..
common	Improve compatibility with certain STL's	2023-02-21 14:06:16 -08:00
eager
framework	FasterTransformer model wrapper using custom op (#15013 )	2023-03-20 09:05:30 -07:00
graph	Introduce RemovableAttributes (#14868 )	2023-03-07 12:37:12 +01:00
optimizer	Pass SessionOptions to XnnpackProviderFactoryCreator. (#13318 )	2022-12-10 14:23:46 +08:00
platform	Improve thread pool creation failure handling. (#13313 )	2022-10-15 17:57:19 -07:00
providers	remove disable_cpu_soft temporarily	2023-03-15 13:23:56 +08:00
session	Transpose optimizer enhancements (#15117 )	2023-03-28 08:28:17 +10:00