Commit graph

1692 commits

Author SHA1 Message Date
Yufeng Li
a8efa42d68
Fix build break in macos (#3046) 2020-02-19 20:42:57 -08:00
Yufeng Li
5e594a070a Bump up the version to 1.1.2 2020-02-19 13:58:32 -08:00
Yufeng Li
9959cf8229 Logging in framework.cc should use the session logger 2020-02-19 13:58:32 -08:00
RyanUnderhill
2cec09a483 Disable zfnet512 test for x86 2020-01-22 18:12:11 -08:00
Yufeng Li
1a5a25331d Only fuse when output count of add is 1 2020-01-21 21:26:14 -08:00
RyanUnderhill
5ecabe1b15 Fix pipeline issue 2020-01-21 14:27:59 -08:00
Ryan Hill
9fec34ddab Update version number (#2868)
Update version number
2020-01-17 18:22:46 -08:00
stevenlix
4d8fca0a4e Fix memory leak in TRT (#2815)
* fix memory leak issue

* revert EP_FAIL on enueueV2
2020-01-17 15:18:02 -08:00
Changming Sun
b8114cf3ed Pass BUILD_BUILDNUMBER to linux docker 2020-01-17 15:18:02 -08:00
Changming Sun
a855db780a Fix nightly build version number issue 2020-01-17 15:18:02 -08:00
Yufeng Li
ef078db702 Initialize max of softmax with lowest of float (#2786) 2020-01-17 15:18:02 -08:00
Nathan
917273bfed add uint8 support to where op (#2792) 2020-01-17 15:18:02 -08:00
Yufeng Li
477aa7505c Implement a more stable softmax (#2715)
* Implement a more stable SoftMax
 e^x is represented as infinity if x is large enough, like 100.f. Infinity divided by Infinity is a NAN. Thus, softmax gets a NAN if one or more item are large enough.
A math transform as below is leveraged to get a stable softmax:
e^xi/(e^x1 + ...e^xn) = e^(xi - max) / (e^(x1 - max) + ... + e^(xn - max))

And for convenience, force max to 0.f if all xi are negative
2020-01-17 15:18:02 -08:00
Ashwini Khade
8d03dab7d9 update default optimization level + fix gemm_activation fusion (#2791)
* update defualt optimization level + fix gemm_activation fusion

* fix typo

* add unit test and incorporate review comments

* fix test comment
2020-01-17 15:18:02 -08:00
jignparm
d12cdff969 Fix C# handling of unicode strings (#2697)
* Fix C# handling of unicode strings

* more tests

* check for handle before freesing

* variable reuse efficiency

* refactor and cleanup utf8 o utf16 conversion block
2020-01-17 15:18:02 -08:00
Xavier Dupré
1b85a262fa Propagate documentation modification from rel-1.0.0 (#2713) 2020-01-17 15:18:02 -08:00
Yufeng Li
c33dab394f
fix the CUDNN_BN_MIN_EPSILON difference issue between cudnn7.3 and cudnn7.6 (#2680) 2019-12-17 15:39:39 -08:00
Ryan Hill
82d35ded39
Ryanunderhill/rel 1.1.0 (#2651)
* Add missig env variables for mac pipeline test (#2595)
* Java API for onnxruntime (#2215)
* Rename automl python tools folder to featurizer_ops. (#2593)
* Make sure fenced tensor could not reuse other tensor. (#2561)
* Add support for opset 11 in reshape fusion (#2592)
* Support opset 11 subgraph of Squad model in Embed Layer Normalization (#2605)
* Allow providers to be set for InferenceSession at construction (#2606)
* EmbedLayerNormalization Fusion For Dynamic Squad Model Opset 10 (#2613)
* Improve Embed Layer Norm Fusion for SQuAD with static input shape  (#2621)
* Improve cuda expand() opeator's performance. (#2624)
* Cuda pad optimize when no padding is needed. (#2625)
* Shortcut cuda Pad() when no padding is needed.
* Improve performance of resize() in Nearest mode (#2626)
* Optimize cuda scatter() on 2D compatible. (#2628)
* Optimize cuda scatter() on 2D compatible.
* fix float16 comparison in initializer (#2629)
* epsilon attribute for layernormalization fusion (#2639)
* Fix memory exception in Layer Norm Fusion (#2644)
2019-12-13 17:15:33 -08:00
Ryan Hill
6049de8d26
Ryanunderhill/rel 1.1.0 (#2615)
* Add missig env variables for mac pipeline test (#2595)
* Java API for onnxruntime (#2215)
* Rename automl python tools folder to featurizer_ops. (#2593)
2019-12-13 13:37:02 -08:00
Ryan Hill
36eb1771ba
Update version (#2584) 2019-12-08 18:00:12 -08:00
liuziyue
200f4b4ea6 EmbedLayerNormalization Fusion Improvement (#2553)
Embedding layer norm fusion improvements - add more checks
2019-12-07 23:14:26 -08:00
KeDengMS
0f12346d76
[Nuphar EP] fixes for some object detection models (#2581)
Update notebook tutorial with multi-threaded int8 GEMM from #2517
2019-12-07 13:37:00 -08:00
Ryan Hill
cbc398bb75
Ryanunderhill/packagename test (#2582) 2019-12-07 12:08:46 -08:00
Ashwini Khade
c06dbd8311 Add ConvTranspose1D (#2578) 2019-12-07 08:50:02 -08:00
Mark
79847f39b3 Fix file not found error during docker build. (#2569) 2019-12-07 08:49:47 -08:00
Yufeng Li
5575766a53
Add more check on SkipLayerNorm and BiasGelu fusion (#2574) 2019-12-06 15:36:02 -08:00
Changming Sun
262ee9dc5a Fix a warning found in the latest VS release 2019-12-06 15:07:21 -08:00
Yufeng Li
34beafc51c
make layernorm fusion to support opset 11 (#2545) 2019-12-06 13:06:36 -08:00
shahasad
eeb28a80c0
setup java ci mac (#2570) 2019-12-06 11:43:40 -08:00
Tianlei Wu
038ee91da5
Allow sequence length to be symbolic (#2559) 2019-12-06 10:13:56 -08:00
George Wu
73c682b97c
disable onnx_test_runner -x invocations for dnnl (#2568) 2019-12-05 23:05:34 -08:00
Changming Sun
7eddac16c2
Re-enable Windows C# tests (#2564) 2019-12-05 21:22:31 -08:00
Ryan Hill
854362cf05
Update win-x86-ci.yml (#2557)
Fix build pipeline break
2019-12-05 18:44:12 -08:00
Changming Sun
ace132f9aa
Fix android build (#2558) 2019-12-05 15:03:22 -08:00
Sreekanth Yalachigere
4c996a8699 DNNL CMAKE update (#2548) 2019-12-05 13:48:57 -08:00
Hariharan Seshadri
53a6bc2f07
Fix a bug handling negative begin pad values in Pad op (#2550)
* Fix bug in Pad op

* Update
2019-12-05 11:29:45 -08:00
Changming Sun
bec4abf074 Add back executable bit to build.py 2019-12-04 21:22:02 -08:00
Ashwini Khade
281933fa1c
Fix C API tests for centos and mac (#2544)
* change c++14 to c++11

* add ld lib path for centos

* enable csharp tests on macos

* fix C API test on MacOS + fix manylinux dotnet install

* fix manylinux dotnet install

* fix lib link
2019-12-04 18:01:35 -08:00
Dmitri Smirnov
d34fb62012
Introduce container type runtime checks and other improvements (#2522)
Rework TensorSeq in a manner consistent with Tensor and SparseTensor
  in terms of type system setup.
  Reduce templating. Introduce helpers to ensure the same
  data type.
  Make OrtValue __dtor not virtual.
  Introduce ContainerChecker
2019-12-04 16:04:17 -08:00
Yulong Wang
be56d77a66
Fix integer overflow in cuda NonMaxSuppression implementation (#2540)
* add test case that should pass but fail

* fix nms

* extract int_max_output_boxes_per_class
2019-12-04 13:27:04 -08:00
Xiang Zhang
3e7aaf8fa1 User/xianz/telemetry (#2458)
* enabme telemetry

* enable telemetry

* set enable telemetry as default

* for debugging

* remove log and set disable telemetry as default back

* delete private file while testing

* resolve comment: mainly add license header, rename macro and update docs

* rewording in privacy.md
2019-12-03 23:34:53 -08:00
stevenlix
293b15480b Add dynamic shape support in TensorRT execution provider (#2450)
* remove onnx-tensorrt submodule

* add new onnx-tensorrt submodule (experiment) for trt6

* update engine build for trt6

* update compile and compute for tensorrt6.0

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc

* switch to onnx-tensorrt master for TensorRT6'

* Update tensorrt_execution_provider.cc

* Handle dynamic batch size and add memcpy in TensorRT EP

* update test cases

* Update tensorrt_execution_provider.cc

* update onnx-tensorrt submodule

* Update Dockerfile.ubuntu_tensorrt

* Update Dockerfile.ubuntu_tensorrt

* Update run_dockerbuild.sh

* Update run_dockerbuild.sh

* Update install_ubuntu.sh

* Update concat_op_test.cc

* Update tensorrt_execution_provider.cc

* Upgrade TensorRT to version 6.0.1.5

* Update onnxruntime_providers.cmake

* Update CMakeLists.txt

* Update reduction_ops_test.cc

* Update install_ubuntu.sh

* Update Dockerfile.ubuntu_tensorrt

* Update Dockerfile.tensorrt

* Update BUILD.md

* Update run_dockerbuild.sh

* Update install_ubuntu.sh

* Update onnxruntime_providers.cmake

* Update install_ubuntu.sh

* Update install_ubuntu.sh

* Update gemm_test.cc

* Update gather_op_test.cc

* Update CMakeLists.txt

* Removed submodule

* update onnx-tensorrt submodule

* update header file

* Removed submodule

* add submodule onnx-tensorrt kevin's branch shape-test'

* add debugging code

* Update tensorrt_execution_provider.cc

* Update tensorrt_execution_provider.cc

* merge master

* Removed submodule

* update onnx-tensorrt submodule

* add more changes for dynamic shapes

* Update tensorrt_execution_provider.cc

* update for dynamic shape

* update dynamic shape processing

* fix logger issue

* remove submodule onnx-tensorrt

* add submodule onnx-tensorrt

* add env variable min_subgraph_size

* remove redundency

* update document

* use onnxruntime::make_unique

* fix multi-run issue

* remove some tests to save CI build time

* Add dynamic shape test

* Update TensorRT-ExecutionProvider.md

* Add example of running Faster R-CNN model on TensorRT EP

* Add more details on env variables

* update environment variables

* Update tensorrt_basic_test.cc

* Update model tests

* Update tensor_op_test.cc

* remove --use_full_protobuf

* Update build.py
2019-12-03 23:18:33 -08:00
Yulong Wang
d748f891d8
Revert "Disable thread pool creation when enabled OpenMP (#2485)" (#2535)
This reverts commit 7c7d5a149c.
2019-12-03 22:09:02 -08:00
Hariharan Seshadri
5c2e474751
Add provision in ORT for session options to be parsed when available via model file (#2449)
* Initial commit

* Fix gitmodules

* Nits

* Nits

* Updates

* Update

* More changes

* Updates

* Update

* Some updates

* More changes

* Update

* Update

* Merge

* Update

* Updates

* More changes

* Update

* Fix nits

* Updates

* Fix warning

* Fix build

* Add comment

* PR feedback

* PR feedback

* Updates

* Updates

* Update

* More changes

* Fix build break

* Comment test for now

* Updates

* Updates

* PR feedback

* Updates

* Nits

* Add tests

* Fix build

* Fix build

* Fix build

* Fix build break

* Fix build

* Nits

* PR feedback

* More change

* Expose GetSessionOptions in pybind logic and add unit test for python

* Fix build

* PR feedback

* PR feedback
2019-12-03 16:56:07 -08:00
shahasad
178d059111 Setup java ci (#2528) 2019-12-03 14:21:51 -08:00
Tianlei Wu
b50878dcf0 Disable Attention fusion tests when DISABLE_CONTRIB_OPS is defined (#2529) 2019-12-03 14:21:21 -08:00
Ashwini Khade
e32eff826c
enable nuget package testing on centos7 (#2527)
* add centos tests to linux cpu ci pipeline

* Disable failing test

* use centos6 instead of centos7

* change back to centos7

* add dotnet runtime dependency

* fix dotnet runtime dependencies

* install dotnet sdk instead of runtimes

* add more dotnet dependencies

* temporary skip failing test

* ix lib path

* reenable failing test
2019-12-03 10:16:45 -08:00
RandySheriffH
85a4ed8cf7
fix cuda kernel causing invalid mem access (#2523) 2019-12-03 09:16:00 -08:00
Tianlei Wu
66254eb25a
Update BERT model optimization python script (#2521)
Add support of GPT2 model optimization:
* Match subgraph of Gelu Approximation (using Tanh).
* Fuse LayerNormalization if SkipLayerNormalization is not ready.
* Output model even if embedding layer is not fused.
* Improve Reshape Fusion to improve coverage.
* Refine constant input checking, and output fused op counter.

Update script according to latest op improvements:
* Fusion of Add Bias and Gelu.
* Fuse SkipLayerNormalization and Add Bias.

Other:
* Add ReduceSum for mask as intermediate step.
* Refactor verbose setting.
2019-12-03 08:40:51 -08:00
Sreekanth Yalachigere
31ea11a696 Renaming MKL-DNN as DNNL (#2515)
* DNNL: Moving Files to rename file names

* DNNL name change

* azure pipeline updated

* disable ceil/dialation and enable Opset10

* disable ceil/dialation tests in Python

* mlperf_ssd_resnet34_1200 disabled
2019-12-03 07:34:23 -08:00