onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-12 00:59:23 +00:00

History

Adrian Lizarraga 514b4699b4 [QNN EP] Apply workaround for Conv validation bug when bias input is implicit (#21764 ) ### Description - Adds a dummy bias of all zeros when translating a Conv without an explicit bias input. This is a workaround for a QNN validation issue that fails when the optional bias input is not provided. - Corrects logic for unpacking of non-zero int4 zero-points. Bug does not impact models because we currently only support int4 zero-points equal to 0 (symmetric quant). But this would become an issue in the future if/when QNN supports non-zero int4 zero-points (so good to fix now). ### Motivation and Context Support Conv operators without a bias input on QNN EP with the latest QNN SDK.		2024-08-22 10:38:03 -07:00
..
common	add transform part of the dq matmul tool chain (#21374 )	2024-07-19 22:55:15 -07:00
contrib_ops	[CUDA] cuDNN Flash Attention (#21629 )	2024-08-20 08:50:22 -07:00
cuda_host
custom_op_registration
debug_node_inputs_outputs
flatbuffers	Enablement of onnxruntime for AIX and fixing issues related to big-endian platform. (#21133 )	2024-07-17 12:37:06 -07:00
framework	Fail tests gracefully for the minimal cuda build (#21391 )	2024-08-02 18:27:36 -07:00
fuzzing	Fix typos according to reviewdog report. (#21335 )	2024-07-22 13:37:32 -07:00
global_thread_pools	Remove references to the outdated CUDA EP factory method (#21549 )	2024-07-29 21:59:16 -07:00
ir	Fix typos according to reviewdog report. (#21335 )	2024-07-22 13:37:32 -07:00
logging_apis
mlas	Mlas int4 int8 with avx2/512 (#20687 )	2024-08-02 10:20:22 -07:00
onnx	[QNN EP] Add support for GatherElements (#15966 )	2024-08-19 14:33:40 -07:00
opaque_api
optimizer	[TransposeOptimizer] Fix axis for QuantizeLinear inserted after DQ (per-channel) -> Unsqueeze (#21793 )	2024-08-20 16:26:02 -07:00
perftest	OVEP - PR 1.19 (#21443 )	2024-07-24 23:45:31 -07:00
platform	Fix typos according to reviewdog report. (#21335 )	2024-07-22 13:37:32 -07:00
proto
providers	[QNN EP] Apply workaround for Conv validation bug when bias input is implicit (#21764 )	2024-08-22 10:38:03 -07:00
python	[CUDA] Update benchmark_mha.py to capture debug info to identify sdpa kernel (#21804 )	2024-08-21 17:30:16 -07:00
quantization
shared_lib	Remove references to the outdated CUDA EP factory method (#21549 )	2024-07-29 21:59:16 -07:00
testdata	[TransposeOptimizer] Fix axis for QuantizeLinear inserted after DQ (per-channel) -> Unsqueeze (#21793 )	2024-08-20 16:26:02 -07:00
unittest_main
util	[MIGraphX EP] Add support for MIGraphX Exhaustive tune flag (#46 ) (#21599 )	2024-08-21 07:32:12 -07:00
wasm
win_getopt
xctest
run_benchmark.py
run_benchmark.readme.md