onnxruntime/include/onnxruntime/core/framework
Scott McKay 6e430c0526
A few performance improvements coming out of ssd_mobilenet and ssd_resnet34 analysis (#1578)
* A few performance improvements:
 - Make the iteration in NonZero more efficient by using a raw pointer and simplifying the increment logic
   - add another unit test to check the new logic works with 3 dimensional tensor
   - gains about 2% for ssd_mobilenet
 - Avoid floating point operations on each iteration on Concat
  - about 0.5% for ssd_mobilenet and ssd_resnet34
 - Put common case first in ExecutionFrame::AllocateAsPerAllocationPlan to avoid unnecessary call to IsSparseTensor
  - about 0.05% for ssd_mobilenet
 - Minor tweak to put some ctors in the TensorShape header so they can be inlined more easily
2019-08-08 07:20:00 +10:00
..
alloc_kind.h Avoid copy of pre-existing value to subgraph output (#637) 2019-03-19 06:55:59 +10:00
allocator.h Move CopyTensor out of IExecutionProvider interface. (#1268) 2019-07-11 14:49:20 -07:00
customregistry.h CustomRegistry should use composition instead of inheritence 2019-04-05 14:14:10 -07:00
data_types.h Implementation of sparse tensor (#1121) 2019-06-06 11:50:38 -07:00
execution_provider.h Move CopyTensor out of IExecutionProvider interface. (#1268) 2019-07-11 14:49:20 -07:00
fence.h Remove unnecessary casts from OrtValue to MLValue(#1051) 2019-05-17 07:52:59 -07:00
framework_common.h Combine OrtValue and MLValue into one type (#1043) 2019-05-16 10:22:49 -07:00
func_api.h Change Compute function to return a status code instead of an integer. (#1139) 2019-06-04 08:34:32 -07:00
kernel_def_builder.h Make MemPatternPlanner on/off switchable in model weight loading (#989) 2019-05-16 14:39:09 -07:00
kernel_registry.h Move CopyTensor out of IExecutionProvider interface. (#1268) 2019-07-11 14:49:20 -07:00
ml_value.h Combine OrtValue and MLValue into one type (#1043) 2019-05-16 10:22:49 -07:00
op_kernel.h Return error message from status instead of swallowing it. (#1221) 2019-06-22 06:26:42 +10:00
op_kernel_info.h Move CopyTensor out of IExecutionProvider interface. (#1268) 2019-07-11 14:49:20 -07:00
op_node_proto_helper.h misc: Fix comment in op_node_proto_helper (#460) 2019-02-11 14:38:43 -08:00
run_options.h Add/correct missing SAL annotations + avoid using unsigned types (except where counts are involved). (#1451) 2019-07-22 23:25:53 -07:00
sparse_tensor.h Implementation of sparse tensor (#1121) 2019-06-06 11:50:38 -07:00
tensor.h Rename Tensor.Size() to Tensor.SizeInBytes() (#1502) 2019-07-26 14:15:53 -07:00
tensor_shape.h A few performance improvements coming out of ssd_mobilenet and ssd_resnet34 analysis (#1578) 2019-08-08 07:20:00 +10:00