onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-28 20:11:22 +00:00

History

Hector Li ffb4b62826 Fix allocator issue for TensorRT IOBinding (#6240 ) * Fix issue: https://github.com/microsoft/onnxruntime/issues/6094 Root cause: we didn't expose the OrtMemoryInfo for TRT, so it will cause issue if user want use IObinding for Tensorrt. Short term fix, add the OrtMemoryInfo for TRT. Long term should unify the allocator for CUDA and TRT		2020-12-31 20:15:43 -08:00
..
codegen	Add ability to filter GraphViewer using IndexedSubGraph. (#5614 )	2020-11-04 07:08:18 +10:00
common	Support multi-loop parallel sections, use multi-loop sections in GRU (#5602 )	2020-11-10 12:24:57 +00:00
dll	populate file metadata for onnxruntime.dll (#2978 )	2020-02-07 11:00:28 -08:00
flatbuffers	Support for Sparse Initializers (#5540 )	2020-10-27 10:32:06 -07:00
framework	Fix allocator issue for TensorRT IOBinding (#6240 )	2020-12-31 20:15:43 -08:00
graph	Add Longformer Attention Cuda Op(#5932 )	2020-11-25 13:52:10 -08:00
language_interop_ops	Change the OrtCustomOp invocation as a constant. (#5506 )	2020-11-02 10:38:07 -08:00
mlas	MLAS: handle MlasGemm(M/N/K==0) cases (#6238 )	2020-12-30 23:25:10 -08:00
optimizer	Revert "Fuse MatMulIntegerToFloat only when scales are scalar (#6008 )" (#6169 )	2020-12-17 19:53:50 -08:00
platform	Deprecate Python global configuration functions [Part 1] (#5923 )	2020-12-15 11:32:43 -08:00
profile
protobuf	implement per-channel for quantizelinear and dequantizelinear (#4759 )	2020-08-21 12:08:50 -07:00
providers	Fix allocator issue for TensorRT IOBinding (#6240 )	2020-12-31 20:15:43 -08:00
session	Removed executor todo that looks dead. (#6234 )	2020-12-30 17:17:37 -08:00
util	Remove MKLML/openblas/jemalloc build config (#6212 )	2020-12-30 17:18:19 -08:00