onnxruntime/cmake/external
PeixuanZuo 4eac0db3af
[ROCm] Add GemmFastGelu CK implementation (#13759)
### Description
<!-- Describe your changes. -->

Add GemmFastGelu CK implementation.

TODO 
1. The performance of CK GemmFastGelu in ORT is not good as using CK
directly, still need to investigate the reason and improve the CK in
ORT.
`GemmFastGeluUnfused float16 NN m=49152 n=3072 k=768 2298.8064 us 100.89
tflops`
`withbias DeviceGemmMultipleD_Xdl_CShuffle<256, 256, 128, 32, 8, 8,
Default> LoopScheduler: Default, PipelineVersion: v1 float16 NN m=49152
n=3072 k=768 2401.9799 us 96.56 tflops`

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

Co-authored-by: peixuanzuo <peixuanzuo@linmif39a000004.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>
2023-01-05 17:53:30 +08:00
..
eigen@d10b27fe37 Downgrade Eigen (#8817) 2021-08-23 18:06:23 -07:00
emsdk@c220895fd1 revert cmake external file (#13459) 2022-10-26 11:38:15 -07:00
libprotobuf-mutator@7a2ed51a6b Onnxruntime fuzzing (#4341) 2020-07-06 16:34:34 -07:00
onnx@5a5f8a5935 revert cmake external file (#13459) 2022-10-26 11:38:15 -07:00
onnxruntime-extensions@d4b2aff0c8 Enable linking in exception throwing support library when build onnxruntime wasm. (#8973) 2021-09-10 22:09:16 +08:00
protobuf@a902b39270 revert cmake external file (#13459) 2022-10-26 11:38:15 -07:00
abseil-cpp.cmake Let Cmake decide where to place abseil (#14057) 2022-12-23 12:08:13 -08:00
abseil-cpp.natvis Update absl to the latest release (#13990) 2022-12-19 14:25:13 -08:00
composable_kernel.cmake [ROCm] Add GemmFastGelu CK implementation (#13759) 2023-01-05 17:53:30 +08:00
dml.cmake [DML EP] Upgrade DML to 1.10.0 (#13796) 2022-11-30 21:32:14 -08:00
dnnl.cmake [oneDNN ep] Update from oneDNN v2.7.0 to oneDNN v2.7.1 (#13536) 2022-11-02 15:57:49 -07:00
eigen.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
extensions.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
find_snpe.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
FindNumPy.cmake Initial bootstrap commit. 2018-11-19 16:48:22 -08:00
helper_functions.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
ipp-crypto.cmake [TVM EP] Hot fix of build on Windows of TVM EP with ipp-crypto (#12381) 2022-07-31 14:36:54 +02:00
mimalloc.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnx_minimal.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnx_protobuf.natvis Natvis adjustments to make debugging bearable (#13237) 2022-10-10 10:06:55 -07:00
onnxruntime_external_deps.cmake CloudEP (#13855) 2023-01-03 10:03:15 -08:00
protobuf_function.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
pybind11.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
pyxir.cmake Check for Python_EXECUTABLE in pyxir.cmake to fix Vitis AI EP build (#8631) 2021-08-24 08:39:50 -07:00
triton.cmake CloudEP (#13855) 2023-01-03 10:03:15 -08:00
tvm.cmake [tvm] Add support for int8 models, update TVM revision (#13519) 2022-11-08 11:28:32 -08:00
wil.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
xnnpack.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00