onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-29 23:06:41 +00:00

History

Scott McKay 6cc57721f4 Change CUDA implementation of Transpose to support all fixed size tensor types (#2387 ) * Change CUDA implementation of Transpose to not use a typed kernel so we can support more types with minimum binary size. Add support for 8, 16, 32 and 64 bit types. Add unit tests. Add method so the implementation can be called directly (will be used by CUDA Scan very soon). * Disable TensorRT for MLFloat16 and int8 unit tests. * Address PR comment and add support for calling cublas implementation if type is mlfloat16.		2019-11-15 10:36:28 +10:00
..
automl_ops	Add AutoML to 3 main builds. (#1631 )	2019-08-16 18:06:16 -07:00
common	Move CXX API global into the header (#2228 )	2019-10-23 14:15:53 -07:00
contrib_ops	Avoid using the default logger in the graph lib and optimizers (#2361 )	2019-11-14 13:23:28 -08:00
framework	Avoid using the default logger in the graph lib and optimizers (#2361 )	2019-11-14 13:23:28 -08:00
ir	Avoid using the default logger in the graph lib and optimizers (#2361 )	2019-11-14 13:23:28 -08:00
mlas	MLAS: Implement U8S8 GEMV kernels (#2069 )	2019-10-09 11:54:16 -07:00
onnx	Avoid using the default logger in the graph lib and optimizers (#2361 )	2019-11-14 13:23:28 -08:00
opaque_api	Avoid using the default logger in the graph lib and optimizers (#2361 )	2019-11-14 13:23:28 -08:00
optimizer	Avoid using the default logger in the graph lib and optimizers (#2361 )	2019-11-14 13:23:28 -08:00
perftest	[v2] Add ACL (Arm Compute Library) execution provider (#2258 )	2019-10-31 12:25:36 -07:00
platform	Fixed tensor reference to const data and cleaned up Env API. (#1979 )	2019-10-24 10:28:13 -07:00
proto	Use protobuf-lite to reduce onnxruntime.dll size. (#639 )	2019-03-21 14:06:38 -07:00
providers	Change CUDA implementation of Transpose to support all fixed size tensor types (#2387 )	2019-11-15 10:36:28 +10:00
python	[NupharEP] Multiple optimizations (#2380 )	2019-11-14 10:40:33 -08:00
server	Update ORTSRV integration test model path (#2225 )	2019-10-23 13:48:20 -07:00
shared_lib	Move CXX API global into the header (#2228 )	2019-10-23 14:15:53 -07:00
testdata	Fix reuse logic in allocation planner. (#2393 )	2019-11-13 22:51:12 -08:00
tvm	Replace GSL with GSL-LITE submodule and fix up refs (#1920 )	2019-10-01 12:43:29 -07:00
util	Avoid using the default logger in the graph lib and optimizers (#2361 )	2019-11-14 13:23:28 -08:00
win_getopt