onnxruntime/tools/ci_build/github/azure-pipelines
Tianlei Wu 72186bbb71
[CUDA] Build nhwc ops by default (#22648)
### Description

* Build cuda nhwc ops by default.
* Deprecate `--enable_cuda_nhwc_ops` in build.py and add
`--disable_cuda_nhwc_ops` option

Note that it requires cuDNN 9.x. If you build with cuDNN 8, NHWC ops
will be disabled automatically.

### Motivation and Context

In general, NHWC is faster than NCHW for convolution in Nvidia GPUs with
Tensor Cores, and this could improve performance for vision models.

This is the first step to prefer NHWC for CUDA in 1.21 release. Next
step is to do some tests on popular vision models. If it help in most
models and devices, set `prefer_nhwc=1` as default cuda provider option.
2024-11-06 09:54:55 -08:00
..
nodejs/templates Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
nuget [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
stages Enable CUDA Python Test (#22717) 2024-11-05 16:26:50 -08:00
templates support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
triggers
android-arm64-v8a-QNN-crosscompile-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
android-x86_64-crosscompile-ci-pipeline.yml Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
bigmodels-ci-pipeline.yml [CUDA] Build nhwc ops by default (#22648) 2024-11-06 09:54:55 -08:00
binary-size-checks-pipeline.yml Clean up some mobile package related files and their usages. (#21606) 2024-08-05 16:38:20 -07:00
build-perf-test-binaries-pipeline.yml Refactor cuda packaging pipeline (#22542) 2024-10-23 08:14:10 -07:00
c-api-noopenmp-packaging-pipelines.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
c-api-training-packaging-pipelines.yml Move on-device training packages publish step (#21539) 2024-07-29 09:59:46 -07:00
cuda-packaging-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
linux-ci-pipeline.yml Update DNNL CI python to 310 (#22691) 2024-11-05 09:14:48 -08:00
linux-cpu-minimal-build-ci-pipeline.yml Update training packaging pipeline's docker files (#20853) 2024-05-30 23:48:42 -07:00
linux-dnnl-ci-pipeline.yml Update DNNL CI python to 310 (#22691) 2024-11-05 09:14:48 -08:00
linux-gpu-ci-pipeline.yml Update CMake (#22516) 2024-10-21 07:51:05 -07:00
linux-gpu-tensorrt-ci-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
linux-gpu-tensorrt-daily-perf-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
linux-migraphx-ci-pipeline.yml [ROCm] Python 3.10 in ROCm CI, and ROCm 6.2.3 in MigraphX CI (#22527) 2024-10-25 11:47:16 -07:00
linux-openvino-ci-pipeline.yml Memory Optimization for Compilation in OVEP (#21872) 2024-09-03 13:52:31 -07:00
linux-qnn-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
linux-rocm-ci-pipeline.yml [ROCm] Python 3.10 in ROCm CI, and ROCm 6.2.3 in MigraphX CI (#22527) 2024-10-25 11:47:16 -07:00
mac-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
mac-coreml-ci-pipeline.yml Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
mac-ios-ci-pipeline.yml Specify iOS simulator runtime version (#22474) 2024-10-18 09:26:06 -07:00
mac-ios-packaging-pipeline.yml Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
mac-react-native-ci-pipeline.yml Re-enable codesign for maven packages (#22308) 2024-10-04 14:30:17 -07:00
npm-packaging-pipeline.yml Re-enable codesign for maven packages (#22308) 2024-10-04 14:30:17 -07:00
nuget-cuda-publishing-pipeline.yml Set CUDA12 as default in GPU packages (#21438) 2024-07-25 10:17:16 -07:00
nuget-windows-ai.yml Nuget Windows AI Pipeline, Disable SDL Submodules. (#22711) 2024-11-04 08:39:28 -08:00
post-merge-jobs.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
publish-nuget.yml Move on-device training packages publish step (#21539) 2024-07-29 09:59:46 -07:00
py-cuda-alt-package-test-pipeline.yml Adding new Python package testing pipeline for Cuda Alt (#22584) 2024-10-24 19:24:53 -07:00
py-cuda-alt-packaging-pipeline.yml Add pipauth to more ADO pipelines and enable CSV (#22612) 2024-10-28 16:39:22 -07:00
py-cuda-package-test-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
py-cuda-packaging-pipeline.yml Enable CUDA Python Test (#22717) 2024-11-05 16:26:50 -08:00
py-cuda-publishing-pipeline.yml Set CUDA12 as default in GPU packages (#21438) 2024-07-25 10:17:16 -07:00
py-dml-packaging-pipeline.yml Add pipauth to more ADO pipelines and enable CSV (#22612) 2024-10-28 16:39:22 -07:00
py-package-build-pipeline.yml OpenVINO EP Rel 1.18 Changes (#20337) 2024-04-19 00:31:38 -07:00
py-package-test-pipeline.yml Adding new Python package testing pipeline for Cuda Alt (#22584) 2024-10-24 19:24:53 -07:00
py-packaging-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
qnn-ep-nuget-packaging-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
rocm-nuget-packaging-pipeline.yml update pipline python version from 3.8 to 3.12 (#22517) 2024-10-21 07:50:31 -07:00
rocm-publish-nuget-pipeline.yml New rocm nuget publish pipeline (#22418) 2024-10-13 08:30:06 +08:00
web-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-ci-fuzz-testing.yml Update Node.js version from 18.x to 20.x in CI pipelines (#22576) 2024-10-24 07:34:42 -07:00
win-ci-pipeline.yml Remove training pipelines from Win CPI CI as redundant (#22190) 2024-09-23 18:15:41 -07:00
win-gpu-cuda-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-dml-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-doc-gen-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-reduce-op-ci-pipeline.yml Move jobs in onnxruntime-Win2022-GPU-T4 machine pool to onnxruntime-Win2022-GPU-A10 (#21023) 2024-06-12 22:04:40 -07:00
win-gpu-tensorrt-ci-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
win-gpu-training-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-webgpu-ci-pipeline.yml Add implementation of WebGPU EP (#22591) 2024-10-29 18:29:40 -07:00
win-qnn-arm64-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
win-qnn-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00