onnxruntime/onnxruntime
Viswanath Boga 6dee9b9d2d
attention fusion kernel refactoring (#8432)
* attention fusion kernel refactored

* consider the case of none in add_qk

* variabled added to check for pre-pack weights

* added a comment to PrePack()

* Optimized prepack and try to free the weights

* making comment sound better

* fixing a bug with optimizer.py

* commented out changes to be done

* removed comments

* make the private fn() private

* fix build

* making clean up fn static

* backed out optimizer tool change, needs more looking into
2021-07-23 17:46:39 -07:00
..
contrib_ops attention fusion kernel refactoring (#8432) 2021-07-23 17:46:39 -07:00
core Add more safety checks to the C API (#8474) 2021-07-23 15:41:27 -07:00
featurizers_ops/cpu
gsl Update C++ Standard from 14 to 17 (#8041) 2021-06-25 14:08:01 -07:00
python attention fusion kernel refactoring (#8432) 2021-07-23 17:46:39 -07:00
test Implement SparseTensor and infrastructure suppport and advance ONNX commit (#8038) 2021-07-22 15:24:36 -07:00
tool/etw
wasm Integrate onnxruntime-extensions into onnxruntime. (#8143) 2021-07-01 09:34:03 -07:00
.style.yapf
__init__.py Implement SparseTensor and infrastructure suppport and advance ONNX commit (#8038) 2021-07-22 15:24:36 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings