onnxruntime/onnxruntime/python/tools
Ye Wang 4e670f7ab1
Support larger hidden size in Attention Cuda kernel (#7002)
* Support larger hidden size in Attention Cuda kernel

* Update attention_transpose.cu

* review comments

* fix typo and add check in quantization

* update readme
2021-03-15 15:46:10 -07:00
..
featurizer_ops Update data_frame_tool to latest (#3919) 2020-05-18 21:13:56 -07:00
quantization Reroute quantization tool readme to /docs page (#6854) 2021-03-02 13:49:42 -08:00
tensorrt/perf Setup perf in docker and add features (#6582) 2021-02-25 09:31:03 -08:00
transformers Support larger hidden size in Attention Cuda kernel (#7002) 2021-03-15 15:46:10 -07:00
__init__.py Enable running PEP8 on python scripts using flake8 (#3928) 2020-05-15 07:15:06 +10:00
onnxruntime_test.py Liqun/speech model loop to scan (#6070) 2021-01-05 15:15:23 -08:00
symbolic_shape_infer.py Support symbolic shape infer in transformers tool (#6899) 2021-03-10 21:37:12 -08:00