onnxruntime/onnxruntime/core
Hector Li ffb4b62826
Fix allocator issue for TensorRT IOBinding (#6240)
* Fix issue: https://github.com/microsoft/onnxruntime/issues/6094

Root cause: we didn't expose the OrtMemoryInfo for TRT, so it will cause issue if user want use IObinding for Tensorrt.

Short term fix, add the OrtMemoryInfo for TRT. Long term should unify the allocator for CUDA and TRT
2020-12-31 20:15:43 -08:00
..
codegen Add ability to filter GraphViewer using IndexedSubGraph. (#5614) 2020-11-04 07:08:18 +10:00
common Support multi-loop parallel sections, use multi-loop sections in GRU (#5602) 2020-11-10 12:24:57 +00:00
dll
flatbuffers Support for Sparse Initializers (#5540) 2020-10-27 10:32:06 -07:00
framework Fix allocator issue for TensorRT IOBinding (#6240) 2020-12-31 20:15:43 -08:00
graph Add Longformer Attention Cuda Op(#5932) 2020-11-25 13:52:10 -08:00
language_interop_ops Change the OrtCustomOp invocation as a constant. (#5506) 2020-11-02 10:38:07 -08:00
mlas MLAS: handle MlasGemm(M/N/K==0) cases (#6238) 2020-12-30 23:25:10 -08:00
optimizer Revert "Fuse MatMulIntegerToFloat only when scales are scalar (#6008)" (#6169) 2020-12-17 19:53:50 -08:00
platform Deprecate Python global configuration functions [Part 1] (#5923) 2020-12-15 11:32:43 -08:00
profile
protobuf implement per-channel for quantizelinear and dequantizelinear (#4759) 2020-08-21 12:08:50 -07:00
providers Fix allocator issue for TensorRT IOBinding (#6240) 2020-12-31 20:15:43 -08:00
session Removed executor todo that looks dead. (#6234) 2020-12-30 17:17:37 -08:00
util Remove MKLML/openblas/jemalloc build config (#6212) 2020-12-30 17:18:19 -08:00