onnxruntime/onnxruntime
Yi Zhang caa67439b5
Add more F16 kernels of XNNPack (#22381)
### Description
1. Add Gemm, MatMul, Softmax, AveragePool and  Resize F16 kernels

This PR has included all changes in #22378


[AB#51066](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/51066)

[AB#51026](https://aiinfra.visualstudio.com/6a833879-cd9b-44a4-a9de-adc2d818f13c/_workitems/edit/51026)

2. Matrix B must be const and martrix A and B dim_size shoule NOT bigger
than 2 in XNNPack, so I added 2 tests in matmul_test.cc to make sure
it's really tested. (that is, compute() must be called.)
### Motivation and Context
2024-10-14 17:41:59 +08:00
..
contrib_ops DecoderMaskedMultiHeadAttention CPU kernel. (#22292) 2024-10-12 13:43:00 -07:00
core Add more F16 kernels of XNNPack (#22381) 2024-10-14 17:41:59 +08:00
lora Accomodate BE platforms. Make sure we always write flatbuffers LE (#22375) 2024-10-11 09:14:44 -07:00
python Fix Whisper export for FP16 CUDA (#22410) 2024-10-11 17:37:36 -07:00
test Add more F16 kernels of XNNPack (#22381) 2024-10-14 17:41:59 +08:00
tool/etw Fix typos - 1st Wave (#21278) 2024-07-11 13:35:08 +08:00
wasm [WebNN EP] Enable IO Bindings with MLTensor (#21301) 2024-09-27 17:24:21 -07:00
__init__.py Multi-Lora support (#22046) 2024-09-30 15:59:07 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings