onnxruntime/onnxruntime/test
Adrian Lizarraga 514b4699b4
[QNN EP] Apply workaround for Conv validation bug when bias input is implicit (#21764)
### Description
- Adds a dummy bias of all zeros when translating a Conv without an
explicit bias input. This is a workaround for a QNN validation issue
that fails when the optional bias input is not provided.
- Corrects logic for unpacking of **non-zero int4** zero-points. Bug
does not impact models because we currently only support int4
zero-points equal to 0 (symmetric quant). But this would become an issue
in the future if/when QNN supports non-zero int4 zero-points (so good to
fix now).



### Motivation and Context
Support Conv operators without a bias input on QNN EP with the latest
QNN SDK.
2024-08-22 10:38:03 -07:00
..
common add transform part of the dq matmul tool chain (#21374) 2024-07-19 22:55:15 -07:00
contrib_ops [CUDA] cuDNN Flash Attention (#21629) 2024-08-20 08:50:22 -07:00
cuda_host
custom_op_registration
debug_node_inputs_outputs
flatbuffers Enablement of onnxruntime for AIX and fixing issues related to big-endian platform. (#21133) 2024-07-17 12:37:06 -07:00
framework Fail tests gracefully for the minimal cuda build (#21391) 2024-08-02 18:27:36 -07:00
fuzzing Fix typos according to reviewdog report. (#21335) 2024-07-22 13:37:32 -07:00
global_thread_pools Remove references to the outdated CUDA EP factory method (#21549) 2024-07-29 21:59:16 -07:00
ir Fix typos according to reviewdog report. (#21335) 2024-07-22 13:37:32 -07:00
logging_apis
mlas Mlas int4 int8 with avx2/512 (#20687) 2024-08-02 10:20:22 -07:00
onnx [QNN EP] Add support for GatherElements (#15966) 2024-08-19 14:33:40 -07:00
opaque_api
optimizer [TransposeOptimizer] Fix axis for QuantizeLinear inserted after DQ (per-channel) -> Unsqueeze (#21793) 2024-08-20 16:26:02 -07:00
perftest OVEP - PR 1.19 (#21443) 2024-07-24 23:45:31 -07:00
platform Fix typos according to reviewdog report. (#21335) 2024-07-22 13:37:32 -07:00
proto
providers [QNN EP] Apply workaround for Conv validation bug when bias input is implicit (#21764) 2024-08-22 10:38:03 -07:00
python [CUDA] Update benchmark_mha.py to capture debug info to identify sdpa kernel (#21804) 2024-08-21 17:30:16 -07:00
quantization
shared_lib Remove references to the outdated CUDA EP factory method (#21549) 2024-07-29 21:59:16 -07:00
testdata [TransposeOptimizer] Fix axis for QuantizeLinear inserted after DQ (per-channel) -> Unsqueeze (#21793) 2024-08-20 16:26:02 -07:00
unittest_main
util [MIGraphX EP] Add support for MIGraphX Exhaustive tune flag (#46) (#21599) 2024-08-21 07:32:12 -07:00
wasm
win_getopt
xctest
run_benchmark.py
run_benchmark.readme.md