onnxruntime/onnxruntime
snadampal 7fa6f4fca4
add arm64 bfloat16 fastmath mode option for transformers benchmarking script (#19294)
Add arm64 bfloat16 fastmath mode option for transformers benchmarking script.

### Motivation and Context
onnxruntime now supports bfloat16 fastmath gemm kernels for arm64 platforms with bfloat16 instruction support. This PR updates benchmark scripts to test that mode.
2024-02-12 15:20:36 -08:00
..
contrib_ops Revert "Revert NeuralSpeed code for x64 MatMulNBits (#19382)" (#19474) 2024-02-09 09:24:54 -08:00
core Ovep 1.17.1 (#19482) 2024-02-12 12:31:08 -08:00
python add arm64 bfloat16 fastmath mode option for transformers benchmarking script (#19294) 2024-02-12 15:20:36 -08:00
test Disable CPU EP's allocator's arena when address sanitizer is enabled (#19485) 2024-02-12 09:39:49 -08:00
tool/etw
wasm [js/webgpu] Support capture and replay for jsep (#18989) 2024-01-30 18:28:03 -08:00
__init__.py [ORT 1.17.0 release] Bump up version to 1.18.0 (#19170) 2024-01-17 11:18:32 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings