onnxruntime/tools/ci_build
Yulong Wang 7a8fa12850
Add implementation of WebGPU EP (#22591)
### Description

This PR adds the actual implementation of the WebGPU EP based on
https://github.com/microsoft/onnxruntime/pull/22318.

This change includes the following:

<details>
<summary><b>core framework of WebGPU EP</b></summary>

  - WebGPU EP factory classes for:
    - handling WebGPU options
    - creating WebGPU EP instance
    - creating WebGPU context
  - WebGPU Execution Provider classes
    - GPU Buffer allocator
    - data transfer
  - Buffer management classes
    - Buffer Manager
    - BufferCacheManager
      - DisabledCacheManager
      - SimpleCacheManager
      - LazyReleaseCacheManager
      - BucketCacheManager
  - Program classes
    - Program (base)
    - Program Cache Key
    - Program Manager
  - Shader helper classes
    - Shader Helper
    - ShaderIndicesHelper
    - ShaderVariableHelper
  - Utils
    - GPU Query based profiler
    - compute context
    - string utils
  - Miscs
    - Python binding webgpu support (basic)
 
</details>

<details>
<summary><b>Kernel implementation</b></summary>


  - onnx.ai (default opset):
- Elementwise (math): Abs, Neg, Floor, Ceil, Reciprocal, Sqrt, Exp, Erf,
Log, Sin, Cos, Tan, Asin, Acos, Atan, Sinh, Cosh, Asinh, Acosh, Atanh,
Tanh, Not, Cast
- Elementwise (activation): Sigmoid, HardSigmoid, Clip, Elu, Relu,
LeakyRelu, ThresholdedRelu, Gelu
- Binary (math): Add, Sub, Mul, Div, Pow, Equal, Greater,
GreaterOrEqual, Less, LessOrEqual
    - (Tensors): Shape, Reshape, Squeeze, Unsqueeze
    - Where
    - Transpose
    - Concat
    - Expand
    - Gather
    - Tile
    - Range
    - LayerNormalization
  - com.microsoft
    - FastGelu
    - MatMulNBits
    - MultiHeadAttention
    - RotaryEmbedding
    - SkipLayerNormalization
    - LayerNormalization
    - SimplifiedLayerNormalization
    - SkipSimplifiedLayerNormalization

</details>

<details>
<summary><b>Build, test and CI pipeline integration</b></summary>

  - build works for Windows, macOS and iOS
  - support onnxruntime_test_all and python node test
  - added a new unit test for `--use_external_dawn` build flag.
  - updated MacOS pipeline to build with WebGPU support
  - added a new pipeline for WebGPU Windows

</details>

This change does not include:

- Node.js binding support for WebGPU (will be a separate PR)
2024-10-29 18:29:40 -07:00
..
github Add implementation of WebGPU EP (#22591) 2024-10-29 18:29:40 -07:00
requirements Replace inline pip install with pip install from requirements*.txt (#21106) 2024-07-22 12:39:10 -07:00
__init__.py
amd_hipify.py fix issue when build with hipblasLt on rocm6.1 (#22553) 2024-10-28 13:57:08 +08:00
build.py Add implementation of WebGPU EP (#22591) 2024-10-29 18:29:40 -07:00
compile_triton.py
coverage.py
gen_def.py Initial WebGPU EP checkin (#22318) 2024-10-08 16:10:46 -07:00
get_docker_image.py Fix docker image layer caching to avoid redundant docker building and transient connection exceptions. (#21612) 2024-08-06 21:37:09 +08:00
logger.py
op_registration_utils.py
op_registration_validator.py
patch_manylinux.py
policheck_exclusions.xml
reduce_op_kernels.py Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00
replace_urls_in_deps.py Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00
set-trigger-rules.py Initial WebGPU EP checkin (#22318) 2024-10-08 16:10:46 -07:00
update_tsaoptions.py
upload_python_package_to_azure_storage.py Update ruff and clang-format versions (#21479) 2024-07-24 11:50:11 -07:00