onnxruntime/onnxruntime
Ye Wang 5f516899bf
optimize a bert model converted using tf2onnx (#5492)
* optimize a bert model converted using tf2onnx

* add test data

* update

* remove comments

* format

* Revert "format"

This reverts commit f8ae88cb564bce5caf4780e56561403f3ba3d524.

* Revert "remove comments"

This reverts commit 59d8a693581a731fd0291b70fe2c9cec6c4950fe.

* add a squeeze node to convert a 3-d mask to 2-d

* update

* update

* verify and add comments
2020-12-01 11:19:16 -08:00
..
contrib_ops Add Longformer Attention Cuda Op(#5932) 2020-11-25 13:52:10 -08:00
core Use CUDA's IsAllFinite kernel for ROCm 2020-11-30 09:24:22 -08:00
featurizers_ops/cpu
gsl
python optimize a bert model converted using tf2onnx (#5492) 2020-12-01 11:19:16 -08:00
test [NNAPI EP] Update squeeze ops (#5946) 2020-11-26 21:00:54 +10:00
tool/etw
.style.yapf
__init__.py Expose knobs to create and share (CPU) allocators across sessions in C# and Python (#5634) 2020-11-21 14:12:33 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings