onnxruntime/onnxruntime
Jeff Daily 07317316cc
CUDA EP vs ROCM EP hipify audit (#17776)
Migrate most CUDA EP improvements and changes to ROCM EP. The process
involves using hipify against all CUDA EP files (i.e. do not exclude any
files from onnxruntime_rocm_hipify.cmake) then vimdiff compare them
against the ROCM EP files that are under source control and pull in most
changes. These changes include functional as well as formatting and
makes comparing CUDA EP and ROCM EP easier, though it makes the PR diff
somewhat less obvious due to formatting changes.

- hipify audit of onnxruntime/core/providers/rocm, enable ops
  - Loop
  - Scan
- hipify audit of onnxruntime/contrib_ops/rocm
- fix contrib ops search implementation
- enable more contrib ops
  - Affine
  - ComplexMul
  - ConvTransposeWithDynamicPads
  - Crop
  - DynamicSlice
  - FFT [Rfft, Irfft]
  - GreedySearch
  - ImageScaler
  - ParametricSoftplus
  - ScaledTanh
  - ThresholdRelu

---------

Co-authored-by: cloudhan <cloudhan@outlook.com>
2023-10-13 10:13:53 +08:00
..
contrib_ops CUDA EP vs ROCM EP hipify audit (#17776) 2023-10-13 10:13:53 +08:00
core CUDA EP vs ROCM EP hipify audit (#17776) 2023-10-13 10:13:53 +08:00
python Fix GroupNorm fusion: skip if num of channels not supported (#17869) 2023-10-11 22:45:22 -07:00
test CUDA EP vs ROCM EP hipify audit (#17776) 2023-10-13 10:13:53 +08:00
tool/etw
wasm [js/webgpu] support IO binding (#17480) 2023-09-29 11:24:42 -07:00
__init__.py Python API to check whether collective ops are available or not (#17730) 2023-09-29 14:11:05 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings