onnxruntime/tools/ci_build
Patrice Vignola 76434907fb
[DML EP] Add graph capture (#20257)
This adds a new "Graph Capture" option to the DML ep, similar to the
cuda graph functionality. Here's how graph capture works:

- A user can enable graph capture in the session options by setting
`ep.dml.enable_graph_capture` to `true`
- When they want to capture a run, they set `gpu_graph_id` in their
`RunOptions` to a number bigger than 0 (0 is reserved for internal use
according to the cuda graph documentation).
- Then, when they start the inference, the graph will be captured and
stored in the DML EP for future use
- When they execute the run for a second time with the same id, the
`ReplayGraph` function in the DML EP will be called instead of executing
the kernels, resulting in very low overhead and avoiding kernel
recompilation.

This feature can give up-to-par or even better performance than
specifying the static dimensions at session creation time, but is also
much more flexible.
2024-04-18 10:15:00 -07:00
..
github More fixes on random connection excepiton in Mac Build. (#20328) 2024-04-17 08:37:56 +08:00
__init__.py
amd_hipify.py [ROCm] Add SkipGroupNorm for ROCm EP (#19303) 2024-02-21 11:08:48 +08:00
build.py [DML EP] Add graph capture (#20257) 2024-04-18 10:15:00 -07:00
clean_docker_image_cache.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
compile_triton.py
coverage.py
gen_def.py
get_docker_image.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
logger.py
op_registration_utils.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
op_registration_validator.py Bump ruff to 0.3.2 and black to 24 (#19878) 2024-03-13 10:00:32 -07:00
patch_manylinux.py
policheck_exclusions.xml
reduce_op_kernels.py
replace_urls_in_deps.py
requirements-transformers-test.txt Bump transformers from 4.36.0 to 4.38.0 in /tools/ci_build (#20272) 2024-04-16 14:21:12 -07:00
set-trigger-rules.py Add VP test in Stable diffusion pipeline (#19300) 2024-01-29 09:33:58 -08:00
update_tsaoptions.py
upload_python_package_to_azure_storage.py