..
compute_optimizer
Define recomputable op list with domain/opset ( #19722 )
2024-03-07 09:12:12 +08:00
layout_transformation
Integration with ONNX 1.16.0 ( #19745 )
2024-04-12 09:46:49 -07:00
qdq_transformer
Update the functions in tensorprotoutils.h to use std::filesystem::path instead ( #20920 )
2024-06-28 20:03:57 -07:00
selectors_actions
MatMulNBits + Add fusion ( #20587 )
2024-05-16 11:00:59 -07:00
transpose_optimization
Update the functions in tensorprotoutils.h to use std::filesystem::path instead ( #20920 )
2024-06-28 20:03:57 -07:00
attention_fusion.cc
attention_fusion.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
attention_fusion_helper.h
Replace float_t with float ( #16484 )
2023-06-27 09:28:38 -07:00
bias_dropout_fusion.cc
bias_dropout_fusion.h
bias_gelu_fusion.cc
[CUDA] Optimize BiasGelu/BiasGeluGrad Kernel ( #16608 )
2023-07-07 08:28:38 +08:00
bias_gelu_fusion.h
bias_softmax_fusion.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
bias_softmax_fusion.h
cast_elimination.cc
cast_elimination.h
common_subexpression_elimination.cc
Some Shape Related Fusions ( #19832 )
2024-03-12 10:29:27 +08:00
common_subexpression_elimination.h
concat_slice_elimination.cc
ORTModule log clean up ( #16795 )
2023-07-26 12:42:50 +08:00
concat_slice_elimination.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
constant_folding.cc
Make session configuration options available to kernels via OpKernelInfo ( #18897 )
2024-01-13 10:02:43 +10:00
constant_folding.h
Make session configuration options available to kernels via OpKernelInfo ( #18897 )
2024-01-13 10:02:43 +10:00
constant_sharing.cc
Optimize constant sharing perf ( #20143 )
2024-04-09 12:04:36 +08:00
constant_sharing.h
Optimize constant sharing perf ( #20143 )
2024-04-09 12:04:36 +08:00
conv_activation_fusion.cc
Fix shape conv fuse opt ( #20282 )
2024-04-23 16:19:57 -07:00
conv_activation_fusion.h
conv_add_act_fusion.cc
[JS/Web]Added FusedConv. ( #17766 )
2023-11-01 15:34:51 -07:00
conv_add_act_fusion.h
conv_add_fusion.cc
conv_add_fusion.h
conv_bn_fusion.cc
conv_bn_fusion.h
conv_mul_fusion.cc
conv_mul_fusion.h
div_mul_fusion.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
div_mul_fusion.h
double_qdq_pairs_remover.cc
Extend DoubleQDQPairsRemover to handle sequences that end in duplicate DQ nodes ( #20759 )
2024-05-24 18:30:15 -07:00
double_qdq_pairs_remover.h
Extend DoubleQDQPairsRemover to handle sequences that end in duplicate DQ nodes ( #20759 )
2024-05-24 18:30:15 -07:00
dropout_elimination.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
dropout_elimination.h
dynamic_quantize_matmul_fusion.cc
dynamic_quantize_matmul_fusion.h
embed_layer_norm_fusion.cc
embed_layer_norm_fusion.h
expand_elimination.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
expand_elimination.h
fast_gelu_fusion.cc
Introduce float 8 types ( #14731 )
2023-05-30 13:25:58 -07:00
fast_gelu_fusion.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
free_dim_override_transformer.cc
free_dim_override_transformer.h
gather_fusion.cc
Bug fix for gather fusion with on-device training ( #20891 )
2024-06-03 14:41:39 -07:00
gather_fusion.h
Merge GatherToSplitFusion and #19218 to a General Fusion ( #19600 )
2024-02-29 13:45:58 +08:00
gelu_approximation.cc
gelu_approximation.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
gelu_fusion.cc
gelu_fusion.h
gemm_activation_fusion.cc
Disable gemm activation for non-float data types ( #19612 )
2024-02-23 18:20:22 -08:00
gemm_activation_fusion.h
gemm_sum_fusion.cc
Release backward inputs per static graph ref count ( #20804 )
2024-06-14 14:33:01 +08:00
gemm_sum_fusion.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
gemm_transpose_fusion.cc
Fuse Cast + SoftmaxCrossEntropyLossInternal ( #20334 )
2024-04-29 14:12:10 +08:00
gemm_transpose_fusion.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
graph_transformer.cc
General INFO logging tracking occurance of GraphTransformer modification ( #17819 )
2023-10-06 17:03:26 -07:00
graph_transformer_mgr.cc
Re-organize the transpose optimization and layout transformation files. ( #16246 )
2023-07-07 08:24:47 +10:00
graph_transformer_mgr.h
graph_transformer_utils.cc
Rollback 19832, Remove shape_input_merge Fusion ( #21179 )
2024-06-26 10:00:45 -07:00
identical_children_consolidation.cc
Fix static analysis bug ( #16114 )
2023-05-28 10:58:07 -07:00
identical_children_consolidation.h
Fix static analysis bug ( #16114 )
2023-05-28 10:58:07 -07:00
identity_elimination.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
identity_elimination.h
initializer.cc
Update the functions in tensorprotoutils.h to use std::filesystem::path instead ( #20920 )
2024-06-28 20:03:57 -07:00
initializer.h
Update the functions in tensorprotoutils.h to use std::filesystem::path instead ( #20920 )
2024-06-28 20:03:57 -07:00
insert_cast_transformer.cc
Fix cast removal bug ( #17953 )
2023-10-31 15:48:32 -07:00
insert_cast_transformer.h
Refine cast optimizer for safety ( #15658 )
2023-04-28 09:32:54 -07:00
isinf_reducesum_fusion.cc
Introduce float 8 types ( #14731 )
2023-05-30 13:25:58 -07:00
isinf_reducesum_fusion.h
label_encoder_fusion.cc
Label encoder fusion ( #19761 )
2024-04-01 09:41:10 -07:00
label_encoder_fusion.h
Label encoder fusion ( #19761 )
2024-04-01 09:41:10 -07:00
layer_norm_fusion.cc
Keep original name during fusion ( #20097 )
2024-03-28 08:40:34 +08:00
layer_norm_fusion.h
matmul_activation_fusion.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
matmul_activation_fusion.h
[DML EP] Add MatMul + SoftMax fusion ( #15240 )
2023-04-11 08:31:04 -07:00
matmul_add_fusion.cc
Release backward inputs per static graph ref count ( #20804 )
2024-06-14 14:33:01 +08:00
matmul_add_fusion.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
matmul_bn_fusion.cc
Introduce new optimizer MatMul + BatchNormalization ( #17915 )
2023-10-24 19:41:10 -07:00
matmul_bn_fusion.h
Introduce new optimizer MatMul + BatchNormalization ( #17915 )
2023-10-24 19:41:10 -07:00
matmul_integer_to_float.cc
[DML] MatrixMultiplyIntegerToFloat ( #19608 )
2024-03-04 11:55:35 -08:00
matmul_integer_to_float.h
matmul_nbits_fusion.cc
MatMulNBits + Add fusion ( #20587 )
2024-05-16 11:00:59 -07:00
matmul_nbits_fusion.h
MatMulNBits + Add fusion ( #20587 )
2024-05-16 11:00:59 -07:00
matmul_scale_fusion.cc
Update the functions in tensorprotoutils.h to use std::filesystem::path instead ( #20920 )
2024-06-28 20:03:57 -07:00
matmul_scale_fusion.h
Add #pragma once to matmul_scale_fusion.h. ( #20538 )
2024-05-02 15:38:11 -07:00
matmul_transpose_fusion.cc
Keep original name during fusion ( #20097 )
2024-03-28 08:40:34 +08:00
matmul_transpose_fusion.h
nchwc_transformer.cc
nchwc_transformer.h
nhwc_transformer.cc
Re-organize the transpose optimization and layout transformation files. ( #16246 )
2023-07-07 08:24:47 +10:00
nhwc_transformer.h
Re-organize the transpose optimization and layout transformation files. ( #16246 )
2023-07-07 08:24:47 +10:00
noop_elimination.cc
Optimize KahnsTopologicalSort and PriorityNodeCompare ( #19475 )
2024-02-16 05:34:55 -08:00
noop_elimination.h
not_where_fusion.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
not_where_fusion.h
optimizer_execution_frame.cc
Update the functions in tensorprotoutils.h to use std::filesystem::path instead ( #20920 )
2024-06-28 20:03:57 -07:00
optimizer_execution_frame.h
Update the functions in tensorprotoutils.h to use std::filesystem::path instead ( #20920 )
2024-06-28 20:03:57 -07:00
pad_fusion.cc
Fix crash in PadFusion ( #18557 )
2023-12-22 11:03:34 -08:00
pad_fusion.h
Introduce new optimizer Pad + Conv/MaxPool ( #18136 )
2023-11-03 07:17:02 -07:00
pre_shape_node_elimination.cc
Eliminate safe nodes that are followed by a shape node. ( #16065 )
2023-06-26 16:35:07 +08:00
pre_shape_node_elimination.h
Eliminate safe nodes that are followed by a shape node. ( #16065 )
2023-06-26 16:35:07 +08:00
propagate_cast_ops.cc
Fuse Cast + SoftmaxCrossEntropyLossInternal ( #20334 )
2024-04-29 14:12:10 +08:00
propagate_cast_ops.h
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
quick_gelu_fusion.cc
Keep original name during fusion ( #20097 )
2024-03-28 08:40:34 +08:00
quick_gelu_fusion.h
relu_clip_fusion.cc
relu_clip_fusion.h
reshape_fusion.cc
ORTModule log clean up ( #16795 )
2023-07-26 12:42:50 +08:00
reshape_fusion.h
rocm_blas_alt_impl.cc
rocm_blas_alt_impl.h
rule_based_graph_transformer.cc
Run clang-format in CI ( #15524 )
2023-04-18 09:26:58 -07:00
skip_layer_norm_fusion.cc
Broadcasting for SLN for CPU and CUDA ( #16510 )
2023-08-07 09:55:42 -07:00
skip_layer_norm_fusion.h
slice_elimination.cc
slice_elimination.h
stft_decomposition.cc
Implement STFT Decomposition transformer ( #19725 )
2024-03-08 15:02:58 -08:00
stft_decomposition.h
Implement STFT Decomposition transformer ( #19725 )
2024-03-08 15:02:58 -08:00
transformer_memcpy.cc
[ORTModule] ATen Efficient Attention and Triton Flash Attention ( #17959 )
2023-10-27 10:29:27 +08:00
transformer_memcpy.h
transpose_optimizer.cc
Resize and EP specific transpose optimization updates ( #17664 )
2023-09-29 08:11:36 +10:00
transpose_optimizer.h
Resize and EP specific transpose optimization updates ( #17664 )
2023-09-29 08:11:36 +10:00
unsqueeze_elimination.cc
unsqueeze_elimination.h
utils.cc
Fuse Cast + SoftmaxCrossEntropyLossInternal ( #20334 )
2024-04-29 14:12:10 +08:00
utils.h