onnxruntime/tools/ci_build
Preetha Veeramalai 6baaaf5165
OVEP options to disable CPU fallback at compile time (#21166)
### Description
Provide user level options to control the fallback on CPU for models not
supported on Intel's NPU hardware.


### Motivation and Context
- Current workflow of OVEP allows safe fallback from OV NPU to OV CPU on
compilation failures. Also supports MLAS CPU fallback in presence of
unsupported custom ops.
- The PR provides a build-time option to disable fallback from OV NPU to
OV CPU.
- The session Option "kOrtSessionOptionsDisableCPUEPFallback" disables
OV CPU and MLAS CPU fallback.
- Also has bug fix for proto creation.

---------

Co-authored-by: jatinwadhwa921 <jatin.wadhwa@intel.com>
Co-authored-by: ankitm3k <ankit.maheshkar@intel.com>
2024-06-28 08:31:02 -07:00
..
github Add FP32 and INT4 test in Llama2 (#21187) 2024-06-28 06:18:26 +08:00
__init__.py
amd_hipify.py
build.py OVEP options to disable CPU fallback at compile time (#21166) 2024-06-28 08:31:02 -07:00
clean_docker_image_cache.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
compile_triton.py
coverage.py
gen_def.py
get_docker_image.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
logger.py
op_registration_utils.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
op_registration_validator.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
patch_manylinux.py
policheck_exclusions.xml
reduce_op_kernels.py
replace_urls_in_deps.py
requirements-transformers-test.txt test: refactor flash_attn tests to use parameterized (#20913) 2024-06-11 15:57:20 -07:00
set-trigger-rules.py
update_tsaoptions.py
upload_python_package_to_azure_storage.py