onnxruntime/tools/ci_build/github/azure-pipelines
Tianlei Wu 72186bbb71
[CUDA] Build nhwc ops by default (#22648)
### Description

* Build cuda nhwc ops by default.
* Deprecate `--enable_cuda_nhwc_ops` in build.py and add
`--disable_cuda_nhwc_ops` option

Note that it requires cuDNN 9.x. If you build with cuDNN 8, NHWC ops
will be disabled automatically.

### Motivation and Context

In general, NHWC is faster than NCHW for convolution in Nvidia GPUs with
Tensor Cores, and this could improve performance for vision models.

This is the first step to prefer NHWC for CUDA in 1.21 release. Next
step is to do some tests on popular vision models. If it help in most
models and devices, set `prefer_nhwc=1` as default cuda provider option.
2024-11-06 09:54:55 -08:00
..
nodejs/templates Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
nuget [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
stages Enable CUDA Python Test (#22717) 2024-11-05 16:26:50 -08:00
templates support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
triggers
android-arm64-v8a-QNN-crosscompile-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
android-x86_64-crosscompile-ci-pipeline.yml Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
bigmodels-ci-pipeline.yml [CUDA] Build nhwc ops by default (#22648) 2024-11-06 09:54:55 -08:00
binary-size-checks-pipeline.yml
build-perf-test-binaries-pipeline.yml Refactor cuda packaging pipeline (#22542) 2024-10-23 08:14:10 -07:00
c-api-noopenmp-packaging-pipelines.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
c-api-training-packaging-pipelines.yml
cuda-packaging-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
linux-ci-pipeline.yml Update DNNL CI python to 310 (#22691) 2024-11-05 09:14:48 -08:00
linux-cpu-minimal-build-ci-pipeline.yml
linux-dnnl-ci-pipeline.yml Update DNNL CI python to 310 (#22691) 2024-11-05 09:14:48 -08:00
linux-gpu-ci-pipeline.yml Update CMake (#22516) 2024-10-21 07:51:05 -07:00
linux-gpu-tensorrt-ci-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
linux-gpu-tensorrt-daily-perf-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
linux-migraphx-ci-pipeline.yml [ROCm] Python 3.10 in ROCm CI, and ROCm 6.2.3 in MigraphX CI (#22527) 2024-10-25 11:47:16 -07:00
linux-openvino-ci-pipeline.yml Memory Optimization for Compilation in OVEP (#21872) 2024-09-03 13:52:31 -07:00
linux-qnn-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
linux-rocm-ci-pipeline.yml [ROCm] Python 3.10 in ROCm CI, and ROCm 6.2.3 in MigraphX CI (#22527) 2024-10-25 11:47:16 -07:00
mac-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
mac-coreml-ci-pipeline.yml Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
mac-ios-ci-pipeline.yml Specify iOS simulator runtime version (#22474) 2024-10-18 09:26:06 -07:00
mac-ios-packaging-pipeline.yml Update pool to MacOS-13 (#17361) 2024-09-17 10:07:30 -07:00
mac-react-native-ci-pipeline.yml Re-enable codesign for maven packages (#22308) 2024-10-04 14:30:17 -07:00
npm-packaging-pipeline.yml Re-enable codesign for maven packages (#22308) 2024-10-04 14:30:17 -07:00
nuget-cuda-publishing-pipeline.yml
nuget-windows-ai.yml Nuget Windows AI Pipeline, Disable SDL Submodules. (#22711) 2024-11-04 08:39:28 -08:00
post-merge-jobs.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
publish-nuget.yml
py-cuda-alt-package-test-pipeline.yml Adding new Python package testing pipeline for Cuda Alt (#22584) 2024-10-24 19:24:53 -07:00
py-cuda-alt-packaging-pipeline.yml Add pipauth to more ADO pipelines and enable CSV (#22612) 2024-10-28 16:39:22 -07:00
py-cuda-package-test-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
py-cuda-packaging-pipeline.yml Enable CUDA Python Test (#22717) 2024-11-05 16:26:50 -08:00
py-cuda-publishing-pipeline.yml
py-dml-packaging-pipeline.yml Add pipauth to more ADO pipelines and enable CSV (#22612) 2024-10-28 16:39:22 -07:00
py-package-build-pipeline.yml
py-package-test-pipeline.yml Adding new Python package testing pipeline for Cuda Alt (#22584) 2024-10-24 19:24:53 -07:00
py-packaging-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
qnn-ep-nuget-packaging-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
rocm-nuget-packaging-pipeline.yml update pipline python version from 3.8 to 3.12 (#22517) 2024-10-21 07:50:31 -07:00
rocm-publish-nuget-pipeline.yml New rocm nuget publish pipeline (#22418) 2024-10-13 08:30:06 +08:00
web-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-ci-fuzz-testing.yml Update Node.js version from 18.x to 20.x in CI pipelines (#22576) 2024-10-24 07:34:42 -07:00
win-ci-pipeline.yml Remove training pipelines from Win CPI CI as redundant (#22190) 2024-09-23 18:15:41 -07:00
win-gpu-cuda-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-dml-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-doc-gen-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-reduce-op-ci-pipeline.yml
win-gpu-tensorrt-ci-pipeline.yml [TensorRT EP] Refactor TRT version update logic & apply TRT 10.5 (#22483) 2024-10-29 09:23:41 -07:00
win-gpu-training-ci-pipeline.yml Add a reminder in set-trigger-rules script (#21929) 2024-08-30 12:18:10 -07:00
win-gpu-webgpu-ci-pipeline.yml Add implementation of WebGPU EP (#22591) 2024-10-29 18:29:40 -07:00
win-qnn-arm64-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00
win-qnn-ci-pipeline.yml support Qnn 2 28 (#22724) 2024-11-05 15:41:15 -08:00