onnxruntime/onnxruntime
RajalakshmiSR 8564fc1933
POWER10: Add optimized dgemm kernel (#9652)
* POWER10: Add optimized dgemm kernel

This patch makes use of POWER10 matrix multiply assist feature and
adds new DGEMM kernel.

* Indentation update

Co-authored-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>
2021-11-22 20:28:21 -08:00
..
contrib_ops Decoder Attention CUDA Op (#9792) 2021-11-19 19:25:36 -08:00
core POWER10: Add optimized dgemm kernel (#9652) 2021-11-22 20:28:21 -08:00
gsl Change TensorShape to typically not allocate heap memory (#9542) 2021-11-08 10:29:54 -08:00
python Expose IOBinding SynchronizeInputs/Outputs via C/C++/C# And Python APIs (#9823) 2021-11-22 09:45:31 -08:00
test Expose IOBinding SynchronizeInputs/Outputs via C/C++/C# And Python APIs (#9823) 2021-11-22 09:45:31 -08:00
tool/etw
wasm Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT 2021-10-01 22:47:52 +00:00
.style.yapf
__init__.py Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT 2021-10-01 22:47:52 +00:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings Merged PR 6622174: merge latest onnxruntime into dmldev 2021-10-30 19:59:33 +00:00