onnxruntime/tools/ci_build/github
Ye Wang f35dd1407f
custom allreduce cuda kernel (#20703)
### Description
<!-- Describe your changes. -->

Conditionally route to custom AllReduce kernel when buffer size and gpu
numbers meet certain requirements. Otherwise, keep using NCCL's
AllReduce.

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: Ye Wang <wangye@microsoft.com@h100vm-ort.kxelwkzfzxguje5bxvwxxs135a.gvxx.internal.cloudapp.net>
Co-authored-by: Your Name <you@example.com>
2024-06-13 11:09:49 -07:00
..
android Remove deprecated "mobile" packages (#20941) 2024-06-07 16:20:32 -05:00
apple Remove deprecated "mobile" packages (#20941) 2024-06-07 16:20:32 -05:00
azure-pipelines custom allreduce cuda kernel (#20703) 2024-06-13 11:09:49 -07:00
js Remove deprecated "mobile" packages (#20941) 2024-06-07 16:20:32 -05:00
linux Updating cudnn from 8 to 9 on exsiting cuda 12 docker image (#20925) 2024-06-11 09:37:16 -07:00
pai fix rocm ci pipeline (#19525) 2024-02-15 00:02:08 -08:00
windows Fix Nuget Cuda pipeline package pipeline (#20741) 2024-05-24 09:15:57 -07:00
Doxyfile_csharp.cfg [C#] Rename unreleased API, add utilities (#16806) 2023-08-02 10:06:42 -07:00