onnxruntime/onnxruntime/core/optimizer
cloudhan 7ed3bfde51
Fix FusedConv for ROCm (#15460)
1. Fix undesired runtime optimization for non-Relu activation.
3. Fix false positive runtime error log due to fusion failure.
2023-04-17 11:41:00 +08:00
..
compute_optimizer Optimize SCE loss compute (#15401) 2023-04-13 13:02:12 +08:00
qdq_transformer [QNN EP] Qnn batchnorm Op support (#15222) 2023-04-10 10:36:57 -07:00
selectors_actions Switch GSL to MS GSL 4.0.0 (#13416) 2022-10-29 04:15:20 -07:00
transpose_optimizer Make internal nhwc schema registrations complete (#15278) 2023-03-30 08:55:14 -07:00
attention_fusion.cc Add Bert/GPT2 fusion change for new attribute mask_filter_value in ORT optimizer (#14333) 2023-01-19 12:52:09 -08:00
attention_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
attention_fusion_helper.h Fix CI build with no Abseil. (#14400) 2023-01-24 09:17:35 -08:00
bias_dropout_fusion.cc [CUDA] Implement BitmaskDropout, BitmaskBiasDropout and BitmaskDropoutGrad (#11534) 2022-05-27 17:24:47 +08:00
bias_dropout_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
bias_gelu_fusion.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
bias_gelu_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
bias_softmax_fusion.cc Test and fix optimizers LayerNormFusion, BiasSoftmaxFusion, Transpose for opset 18 (#14542) 2023-02-08 14:11:31 -08:00
bias_softmax_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
cast_elimination.cc
cast_elimination.h
common_subexpression_elimination.cc Ignore DequantizeLinear nodes in CommonSubexpressionElimination optimizer (#10934) 2022-03-23 08:46:01 +10:00
common_subexpression_elimination.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
concat_slice_elimination.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
concat_slice_elimination.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
constant_folding.cc Add constant folding for simple QDQ Node Units (#15138) 2023-03-24 08:46:07 +10:00
constant_folding.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
constant_sharing.cc Share more constant initializers (#15461) 2023-04-14 07:41:07 -07:00
constant_sharing.h Share scalar constant initializer (#12878) 2022-10-10 13:32:33 +08:00
conv_activation_fusion.cc Fix FusedConv for ROCm (#15460) 2023-04-17 11:41:00 +08:00
conv_activation_fusion.h Convert ConvActivationFusion transformer to a selector action transformer. (#10687) 2022-03-02 13:47:55 +10:00
conv_add_act_fusion.cc Enable LeakyRelu latest and refactor fast_gelu_fusion to enable the script (#15003) 2023-03-13 14:20:11 -07:00
conv_add_act_fusion.h fuse Conv+Add+activation for CPU from different op-branch (#10987) 2022-04-01 09:25:17 +08:00
conv_add_fusion.cc
conv_add_fusion.h
conv_bn_fusion.cc Include opset 15 in Conv+BatchNormalization fusion (#11960) 2022-06-27 10:59:14 -07:00
conv_bn_fusion.h
conv_mul_fusion.cc
conv_mul_fusion.h
div_mul_fusion.cc
div_mul_fusion.h
double_qdq_pairs_remover.cc Removing Double QDQ from Graphs (#14024) 2023-01-16 19:06:57 -08:00
double_qdq_pairs_remover.h Removing Double QDQ from Graphs (#14024) 2023-01-16 19:06:57 -08:00
dropout_elimination.cc Create dedicated build for training api (#14136) 2023-01-10 20:58:04 -08:00
dropout_elimination.h
dynamic_quantize_matmul_fusion.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
dynamic_quantize_matmul_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
embed_layer_norm_fusion.cc Switch GSL to MS GSL 4.0.0 (#13416) 2022-10-29 04:15:20 -07:00
embed_layer_norm_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
expand_elimination.cc
expand_elimination.h
fast_gelu_fusion.cc Enable LeakyRelu latest and refactor fast_gelu_fusion to enable the script (#15003) 2023-03-13 14:20:11 -07:00
fast_gelu_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
free_dim_override_transformer.cc Ensure graph resolve occurs after free dimension is overridden (#13634) 2022-11-14 18:39:29 -08:00
free_dim_override_transformer.h Switch GSL to MS GSL 4.0.0 (#13416) 2022-10-29 04:15:20 -07:00
gather_fusion.cc Fix Gather to Split optimizer (#14478) 2023-02-02 13:29:44 -08:00
gather_fusion.h Fix Gather to Split optimizer (#14478) 2023-02-02 13:29:44 -08:00
gelu_approximation.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
gelu_approximation.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
gelu_fusion.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
gelu_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
gemm_activation_fusion.cc Clarify usage of kOnnxDomainAlias. (#10962) 2022-03-25 09:52:59 +10:00
gemm_activation_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
gemm_sum_fusion.cc
gemm_sum_fusion.h
gemm_transpose_fusion.cc
gemm_transpose_fusion.h
graph_transformer.cc
graph_transformer_mgr.cc
graph_transformer_mgr.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
graph_transformer_utils.cc Share more constant initializers (#15461) 2023-04-14 07:41:07 -07:00
identical_children_consolidation.cc IdentityBuilder should add Delimit for each input (#14592) 2023-02-06 15:36:42 -08:00
identical_children_consolidation.h Consolidate Identical Children Nodes (#14026) 2023-01-06 09:03:10 -08:00
identity_elimination.cc
identity_elimination.h
initializer.cc Switch GSL to MS GSL 4.0.0 (#13416) 2022-10-29 04:15:20 -07:00
initializer.h [xnnpack] Have Initializer in Mobile related EPs in Minimal_build and creating EP specific dynamic-schema (#12555) 2022-09-06 14:32:15 +08:00
insert_cast_transformer.cc Share scalar constant initializer (#12878) 2022-10-10 13:32:33 +08:00
insert_cast_transformer.h
isinf_reducesum_fusion.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
isinf_reducesum_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
layer_norm_fusion.cc Fix simplified layer norm fusion for training (#14866) 2023-03-07 13:59:20 -08:00
layer_norm_fusion.h Fix simplified layer norm fusion for training (#14866) 2023-03-07 13:59:20 -08:00
matmul_activation_fusion.cc [DML EP] Add MatMul + SoftMax fusion (#15240) 2023-04-11 08:31:04 -07:00
matmul_activation_fusion.h [DML EP] Add MatMul + SoftMax fusion (#15240) 2023-04-11 08:31:04 -07:00
matmul_add_fusion.cc
matmul_add_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
matmul_integer_to_float.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
matmul_integer_to_float.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
matmul_scale_fusion.cc rocblas alt impl during backward pass only (#13352) 2022-11-10 00:47:06 +08:00
matmul_scale_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
matmul_transpose_fusion.cc rocblas alt impl during backward pass only (#13352) 2022-11-10 00:47:06 +08:00
matmul_transpose_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
nchwc_transformer.cc code clean (#12392) 2022-08-01 14:12:35 +08:00
nchwc_transformer.h
nhwc_transformer.cc Refactor the cost check used by the transpose optimizer (#14690) 2023-02-22 08:56:29 +10:00
nhwc_transformer.h make nhwc transformer only apply to CPU ep. (#11882) 2022-09-16 18:46:28 +10:00
noop_elimination.cc Support More Cases in NoOpElimination (#13460) 2022-11-01 10:39:52 +08:00
noop_elimination.h Support More Cases in NoOpElimination (#13460) 2022-11-01 10:39:52 +08:00
not_where_fusion.cc
not_where_fusion.h
optimizer_execution_frame.cc remove device_id parameter out of ExecutionProvider::GetAllocator() (#14580) 2023-02-13 10:01:07 -08:00
optimizer_execution_frame.h remove device_id parameter out of ExecutionProvider::GetAllocator() (#14580) 2023-02-13 10:01:07 -08:00
propagate_cast_ops.cc Switch GSL to MS GSL 4.0.0 (#13416) 2022-10-29 04:15:20 -07:00
propagate_cast_ops.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
quick_gelu_fusion.cc Do not apply QuickGeluFusion if an intermediate tensor is a graph output (#15109) 2023-04-06 10:17:06 -07:00
quick_gelu_fusion.h QuickGelu Fusion (#12417) 2022-10-28 18:12:07 +08:00
relu_clip_fusion.cc
relu_clip_fusion.h
reshape_fusion.cc Add support for 'axes' attr of unsqueeze in opset 13 and add ut (#14071) 2023-01-12 10:45:15 +08:00
reshape_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
rocm_blas_alt_impl.cc rocblas alt impl during backward pass only (#13352) 2022-11-10 00:47:06 +08:00
rocm_blas_alt_impl.h rocblas alt impl during backward pass only (#13352) 2022-11-10 00:47:06 +08:00
rule_based_graph_transformer.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
skip_layer_norm_fusion.cc SkipLayerNorm fusion with different input and output type (#15500) 2023-04-13 23:07:47 -07:00
skip_layer_norm_fusion.h Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
slice_elimination.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
slice_elimination.h
transformer_memcpy.cc
transformer_memcpy.h
unsqueeze_elimination.cc Refactor transformers and other code to reduce memory allocation calls (#10523) 2022-02-24 16:17:14 -08:00
unsqueeze_elimination.h
utils.cc Share more constant initializers (#15461) 2023-04-14 07:41:07 -07:00
utils.h Add Bert/GPT2 fusion change for new attribute mask_filter_value in ORT optimizer (#14333) 2023-01-19 12:52:09 -08:00