onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-16 01:33:39 +00:00

History

Ye Wang 4e670f7ab1 Support larger hidden size in Attention Cuda kernel (#7002 ) * Support larger hidden size in Attention Cuda kernel * Update attention_transpose.cu * review comments * fix typo and add check in quantization * update readme		2021-03-15 15:46:10 -07:00
..
featurizer_ops	Update data_frame_tool to latest (#3919 )	2020-05-18 21:13:56 -07:00
quantization	Reroute quantization tool readme to /docs page (#6854 )	2021-03-02 13:49:42 -08:00
tensorrt/perf	Setup perf in docker and add features (#6582 )	2021-02-25 09:31:03 -08:00
transformers	Support larger hidden size in Attention Cuda kernel (#7002 )	2021-03-15 15:46:10 -07:00
__init__.py	Enable running PEP8 on python scripts using flake8 (#3928 )	2020-05-15 07:15:06 +10:00
onnxruntime_test.py	Liqun/speech model loop to scan (#6070 )	2021-01-05 15:15:23 -08:00
symbolic_shape_infer.py	Support symbolic shape infer in transformers tool (#6899 )	2021-03-10 21:37:12 -08:00