onnxruntime/onnxruntime
Hector Li 2a6c69de2b
Implement the Concat CUDA kernel (#1333)
* Improve CUDA kernel performance for Concat. Implement the kernel code instead of using cudaMemCpy in a loop.

* Update the index lookup part for Concat & Split
2019-07-02 23:08:59 -07:00
..
contrib_ops Implement contrib op CropAndResize (#1277) 2019-06-24 18:34:35 -07:00
core Implement the Concat CUDA kernel (#1333) 2019-07-02 23:08:59 -07:00
python Use manylinux2010 image to build linux python wheels (#1282) 2019-06-27 15:45:06 -07:00
server move environment.h/cc from framework to session project/folder. (#1241) 2019-06-17 18:01:21 -07:00
test Implement the Concat CUDA kernel (#1333) 2019-07-02 23:08:59 -07:00
__init__.py Add ability to change the logging severity of the default logger. (#1165) 2019-06-12 08:54:03 +10:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings