onnxruntime/onnxruntime
Edward Chen 150c4cb8fe
[MLAS AArch64] SQNBitGemm CompInt8 kernel (#18953)
Implement ARM NEON SQNBitGemm kernel that first block quantizes A to int8 and then does int8 multiplication.
2024-01-12 17:58:08 -08:00
..
contrib_ops [MLAS AArch64] SQNBitGemm CompInt8 kernel (#18953) 2024-01-12 17:58:08 -08:00
core [MLAS AArch64] SQNBitGemm CompInt8 kernel (#18953) 2024-01-12 17:58:08 -08:00
python [Quantization] Fix get_qnn_qdq_config to use new scale/zp np.array data types (#19114) 2024-01-12 17:02:32 -08:00
test [MLAS AArch64] SQNBitGemm CompInt8 kernel (#18953) 2024-01-12 17:58:08 -08:00
tool/etw
wasm [js/web/training] Add CreateTrainingSession (#17891) 2023-10-26 09:22:10 -07:00
__init__.py Removed all the deprecated python training code and related tests and utils (#18333) 2023-11-17 18:19:21 -08:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings