onnxruntime/onnxruntime/python
Tianlei Wu 9407c3270c
GPT-2 attention fusion for transformers >= 4.27 (#16461)
### Description
Before transformers 4.27, the causal mask uses uint8 data type, so there
is extra Cast node to convert it to bool. This adds a pattern that
without Cast node to support attention fusion for GPT-2 models exported
with transformers >= 4.27.

### Motivation and Context

https://github.com/microsoft/onnxruntime/issues/16453
2023-06-23 15:38:35 -07:00
..
backend Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
datasets Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
providers/tvm Bump ruff in CI (#15533) 2023-04-17 10:11:44 -07:00
tools GPT-2 attention fusion for transformers >= 4.27 (#16461) 2023-06-23 15:38:35 -07:00
torch_cpp_extensions Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
training
__init__.py
_ld_preload.py
_pybind_state.py.in
exported_symbols.lst
numpy_helper.h
onnxruntime_collect_build_info.py Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
onnxruntime_inference_collection.py [CANN] Add IOBinding Support For CANN EP (#15802) 2023-06-01 03:13:38 -07:00
onnxruntime_pybind.h
onnxruntime_pybind_exceptions.cc Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
onnxruntime_pybind_exceptions.h
onnxruntime_pybind_iobinding.cc Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
onnxruntime_pybind_mlvalue.cc [CANN] Add IOBinding Support For CANN EP (#15802) 2023-06-01 03:13:38 -07:00
onnxruntime_pybind_mlvalue.h [CANN] Add IOBinding Support For CANN EP (#15802) 2023-06-01 03:13:38 -07:00
onnxruntime_pybind_module.cc Expose build information in dynamic lib (#15643) 2023-04-28 21:57:31 -07:00
onnxruntime_pybind_ortvalue.cc [CANN] Add IOBinding Support For CANN EP (#15802) 2023-06-01 03:13:38 -07:00
onnxruntime_pybind_schema.cc Fix issues on Windows for Vitis AI (#15810) 2023-05-04 14:42:19 -07:00
onnxruntime_pybind_sparse_tensor.cc
onnxruntime_pybind_state.cc CUDA graph support for TRT EP (#16081) 2023-06-21 09:36:45 -07:00
onnxruntime_pybind_state.h
onnxruntime_pybind_state_common.cc
onnxruntime_pybind_state_common.h Remove onnxruntime_PYBIND_EXPORT_OPSCHEMA definition from onnxruntime (#15776) 2023-05-03 13:08:35 -07:00
onnxruntime_validation.py Adopt linrtunner as the linting tool - take 2 (#15085) 2023-03-24 15:29:03 -07:00
pybind.def
version_script.lds
version_script_expose_onnx_protobuf.lds