onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-03 23:49:44 +00:00

History

Yi Zhang caa67439b5 Add more F16 kernels of XNNPack (#22381 ) ### Description 1. Add Gemm, MatMul, Softmax, AveragePool and Resize F16 kernels This PR has included all changes in #22378 [AB#51066](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/51066) [AB#51026](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/51026) 2. Matrix B must be const and martrix A and B dim_size shoule NOT bigger than 2 in XNNPack, so I added 2 tests in matmul_test.cc to make sure it's really tested. (that is, compute() must be called.) ### Motivation and Context		2024-10-14 17:41:59 +08:00
..
contrib_ops	DecoderMaskedMultiHeadAttention CPU kernel. (#22292 )	2024-10-12 13:43:00 -07:00
core	Add more F16 kernels of XNNPack (#22381 )	2024-10-14 17:41:59 +08:00
lora	Accomodate BE platforms. Make sure we always write flatbuffers LE (#22375 )	2024-10-11 09:14:44 -07:00
python	Fix Whisper export for FP16 CUDA (#22410 )	2024-10-11 17:37:36 -07:00
test	Add more F16 kernels of XNNPack (#22381 )	2024-10-14 17:41:59 +08:00
tool/etw	Fix typos - 1st Wave (#21278 )	2024-07-11 13:35:08 +08:00
wasm	[WebNN EP] Enable IO Bindings with MLTensor (#21301 )	2024-09-27 17:24:21 -07:00
__init__.py	Multi-Lora support (#22046 )	2024-09-30 15:59:07 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings