Commit graph

1543 commits

Author SHA1 Message Date
Changming Sun
104f3b2a59 Exclude candy from CUDA tests 2019-11-05 15:22:09 -08:00
Changming Sun
143ae98a37
Fix a bug in onnxruntime_pybind_state.cc when TENSORRT is enabled (#2326) 2019-11-05 15:04:50 -08:00
George
8a102c6e99 apply eigen patch only for ACL. 2019-11-05 13:53:53 -08:00
Changming Sun
5ce4d4fc49 Fix a test failure when it runs on FreeBSD 2019-11-04 23:47:37 -08:00
Yufeng Li
035913d42f
Support int32_t for Reduction (#2317) 2019-11-04 20:52:01 -08:00
manashgoswami
d5c36bfff2 Updated links in docs (#2303)
* Update README.md

* Update README.md

* Update README.md
2019-11-03 09:10:56 -08:00
Faith Xu
556bae17a5 Fix versions table (#2309)
* Update table values

* Fix onnxml opset version
2019-11-03 08:58:21 -08:00
Yulong Wang
cba93f7c8d fix Gelu CPU: remove MayInplace() declaration (#2306) 2019-11-01 18:10:05 -07:00
Yulong Wang
204a6872d3
remove unused param 'input_count' in ConcatImpl (#2304) 2019-11-01 15:50:11 -07:00
Tianlei Wu
a6b2c9fc09
Fix mask in EmbedLayerNormalization (#2300) 2019-11-01 13:49:55 -07:00
KeDengMS
6e65dcf588
[NupharEP] symbolic_shape_infer improvements (#2299)
- Improves symbolic shape inference in following ways:
1. Extend suggested merge to map to literals with --auto_merge. For example, MatMul of ['ax1', 'ax2'] x [128, 256] would now map 'ax2' to 128
2. Add --int_max option to simplify computations like Min(100000, 'dim') to be 'dim'. This helps ops like Slice to generate correct shape, i.e. start=0, end=Min(100000, dim - 2) on dim. It was previously treated as equal, since sympy cannot determine Min(100000, dim - 2) < dim.
- Fix a bug in create_shared script on Windows, that AOT dll is not generated because of failure in link, when there are too many obj files
- Fix a bug for Split since TOPI does not support split on symbolic dimension.
- Some build warning fixes for NupharEP.
2019-11-01 11:34:52 -07:00
Tianlei Wu
bc85d43809
Dump cuda tensor data (#2243)
* dump cuda tensor

* move data_type definition

* Dump cuda tensors for cuda build only.
Output tensor location (if it is not in CPU or pinned)

* update for cuda build

* Update for code review feedback

* update for CR feedback

* use data transfer manager for tensor copy
2019-10-31 21:09:10 -07:00
Scott McKay
7a5de9c958
Add a python script with a number of helper actions for creating/editing/dumping onnx test runner format pb files (#2294)
* Add a python script with a number of helper actions for creating/editing/dumping onnx test running format pb files.
2019-11-01 06:39:14 +10:00
mikecaraman
358b517d49 [v2] Add ACL (Arm Compute Library) execution provider (#2258)
* Guard unused parameter

Guard unused parameter for Linux Arm and other cases.

* Add ACL (Arm Compute Library) execution provider

Add a new execution provider targeting Arm architecture based on Arm Compute Library.
Validated on NXP i.MX8QM CPU with ResNet50, MobileNetv2 and VGG models.
All unit tests are passing.

Comparative performance improvements for ResNet50v1 model obtained with
onnxruntime_perf_test:
		A72	2xA72	A53	4xA53
ACL vs CPU  	16%	9%	21%	13%

Usage documentation available in ACL-ExecutionProvider.

* Fix eigen unused parameter

Fix eigen unused parameter error for Arm cross-compilation.
2019-10-31 12:25:36 -07:00
Yulong Wang
bf7fa091cc
NonMaxSuppression cuda implementation (#2082) 2019-10-31 11:53:22 -07:00
Changming Sun
67755adfd8 Bug Fix: NodeArg class has a move constructor but doesn't have a move assignment operator 2019-10-31 10:29:54 -07:00
RandySheriffH
d6849bd26c
Rashuai/cuda top k (#1919)
* implement cuda topk

* implement heap

* add type support

* refactor interface

* add support for sorting by index

* add test case

* use cub device radix sort

* register for opset 9 and 10

* add opset 9/10 delaration

* refactor code

* refactor code

* fix comment

* fix comment

* switch to scratched mem
2019-10-31 10:26:00 -07:00
Hariharan Seshadri
4bcd8bfca1
Fix CUDA Reduce ops (#2268)
* Add some tests for Reduction ops

* Exclude tensorrt for new tests

* Fix bug in CUDA Reduce ops

* Fix nit
2019-10-31 10:11:59 -07:00
Changming Sun
a5da5ff6f4 Remove onnxruntime_USE_EIGEN_THREADPOOL cmake option 2019-10-30 21:51:54 -07:00
KeDengMS
ff64d1f55b
Relax check for optimized model saving (#2291)
So user may save model with layout optimization.
2019-10-30 21:48:49 -07:00
Maik Riechert
ecfbb1bb99 Add missing guards to profiling calls (#1374)
* guard remaining profiler calls

* enforce proper usage of profile class
2019-10-31 14:28:49 +10:00
George Wu
aa041026e3
update Dockerfile.openvino (#2286)
* install miniconda before openvino installation. add networkx, defusedxml dependencies.

* apt-get update

* apt-get update

* merge Intel changes.
2019-10-30 13:58:24 -07:00
Tomasz Dołbniak
427e627805 Support for the Expand op with constant shape inputs (#2278)
* Disable the Expand op for non-const shape inputs

* Check if an input is constant with IsConstantInitializer
2019-10-30 13:22:45 -07:00
KeDengMS
e18c9582a8
[NupharEP] performance improvements (#2283)
* [Nuphar EP] performance improvements
1. Add new ops: Shape, Expand
2. Add support for steps in Slice
3. Simplify Gather
4. Always inline alias nodes
5. Transpose nodes with inner loop being symbolic falls back to CPU provider when vectorization is not possible
6. Add opt_inproj option to model_editor to extract MatMuls inside Scan for input projection to outside
2019-10-30 10:15:04 -07:00
zhijxu
63e9961637 fix typo 2019-10-30 09:57:56 -07:00
zhijxu
8dabe0502b merge two RUN to avoid making docker image too larger 2019-10-30 09:57:56 -07:00
Changming Sun
7b11f05a97 Update version number 2019-10-30 08:13:09 -07:00
Changming Sun
2172a9e5ed Fix an issue in the nuget run tests scripts 2019-10-30 08:13:09 -07:00
Changming Sun
138a7f194e Add cleanup step 2019-10-30 08:13:09 -07:00
Scott McKay
002d9c335b Add node and op type info to error message if there's a type or shape inferencing exception thrown by the ONNX checker (#2277)
* Add node and op type info to error message if there's a type or shape inferencing exception thrown by the ONNX checker.

* Fix line break from auto format

* Remove unused param from unit test code.
2019-10-29 23:42:13 -07:00
Ashwini Khade
8d231a32f2 Remove the libc version check in C# code (#2282) 2019-10-29 21:31:38 -07:00
Scott McKay
983a616bda
Revert to using opset 7 as the default for OpTester. Add explanation as to why that is: (#2256) 2019-10-30 09:42:21 +10:00
Scott McKay
47f40ca204
Cleanup add/remove of initializer (#2274)
* Cleanup add/remove of initializer

* Address PR comments
2019-10-30 06:50:41 +10:00
Dmitri Smirnov
09eb8ff8b8
make sure samples run (#2276)
Adjust instructions for imagenet. Fix compile warning.
 Fix input/output names. Update build/run instructions.
2019-10-29 10:07:35 -07:00
Changming Sun
d1096b524f Fix fns candy build error on Linux 2019-10-29 02:54:25 -07:00
Nathan
b0359b5622
register where kernel (#2245) 2019-10-28 15:59:29 -07:00
KeDengMS
5611a528f6
onnxruntime_perf_test: Add -u option to save optimized model (#2227) 2019-10-28 12:36:31 -07:00
Ryan Hill
3ecdd985cb
MNIST Sample Fix (#2259)
* Fix Global variable initialization order

* Remove static initialization, and add error messages
2019-10-28 11:22:45 -07:00
zhijxu
0b88eff43a add docker file to build onnxruntime with different execution providers 2019-10-28 10:03:18 -07:00
zhijxu
3926ce33f4 add --parallel to speed up compiling source code 2019-10-28 10:03:18 -07:00
zhijxu
4eda08534f make smaller step in Dockerfile
so image layers can be reused as much as possible
2019-10-28 10:03:18 -07:00
zhijxu
ce23d628a5 fix bug in cmake/onnxruntime_server.cmake 2019-10-28 10:03:18 -07:00
zhijxu
be7c24247f register execution provider when onnxruntime server creating sessions 2019-10-28 10:03:18 -07:00
pulkittomar
1fa956fb3f Undo integration test skip (#1917) 2019-10-27 09:47:31 -07:00
Yuri
a2596b706b FreeBSD compatibility patch.
* Treat the 'amd64' architecture the same way as 'x86_64'
* Use thr_self() instead of gettid() on FreeBSD
2019-10-26 12:44:12 -07:00
Dmitri Smirnov
88c58c19d4
Improve code readability and performance. (#2257)
Improve code readability and performance. (#2257)  
  Remove one time checks from loops.
  Move out GetType<>() calls from loop as they
  go through local function statics.
  Get rid of index calculations from input and output
  so we can simlpy advance ptrs and potentially do better pre-fetch.
  Improve code readability.
2019-10-25 16:19:59 -07:00
Changming Sun
ce14b07b1c
Fix the GPU nuget pipeline failure (#2255) 2019-10-25 13:55:38 -07:00
Sreekanth Yalachigere
c7599d0705 subgraph parse error fix (#2254) 2019-10-24 22:14:30 -07:00
shahasad
6a0ee7eff6
Fix model path marshalling in csharp, and re-enable the pretrained model tests (#2236) 2019-10-24 20:39:16 -07:00
edgchen1
8be48f47dd
Added Graph::ReplaceInitializedTensor() function. (#2230)
Added Graph::ReplaceInitializedTensor() function to support updating existing initializers.
2019-10-24 19:30:24 -07:00