Commit graph

1768 commits

Author SHA1 Message Date
Dmitri Smirnov
e37cdbed74 Add manifest missing comma 2020-01-10 16:02:19 -08:00
stevenlix
c4f6db7796 Fix memory leak in TRT (#2815)
* fix memory leak issue

* revert EP_FAIL on enueueV2
2020-01-10 14:07:40 -08:00
Dmitri Smirnov
afa48b7e13
Add timeseries imputer transformer featurizer kernel (#2813)
Make kernels non-template. Add input constraint for learnt data.
  Fixup tests.
  Add two more featurizers along with tests. Tests fail.
  min_max_scalar_transformer
  robust_scalar_transformer
  Fix tests serialized stream by prepending version bytes.
  Add inputation_marker_transfomer and the test.
  Fix up float/double type designations.
 Added label_encoder_transformer along with a test.
  string_throw case is broken at the momement.
  Fix labelencodertransfomer_test.cc string_throw case
  Rename maxabsscalertransformer_test.cc
  Add MissingDummiesTransformer along with the test.
  Update manifest.
  Add TimeSeriesImputerTransformer definition, implementation and tests
2020-01-10 13:27:51 -08:00
Changming Sun
48e042868f
Update test data (#2356) 2020-01-10 10:52:23 -08:00
George Wu
31200ed92c
speed up Windows TRT CI (#2811)
* don't run cuda tests if building with tensorrt

* remove unnecessary build options for win trt ci

* refactor win gpu tensorrt ci yml

* --numpy_version=1.17

* update

* update

* azcopy and cuda path
2020-01-10 08:40:40 -08:00
Ke Zhang
b0019ac7fe
add interface to copy batch tensors. (#2807)
* add interface to copy batch tensors.

* onnxruntime
2020-01-09 16:52:34 -08:00
Tracy Sharpe
7ef6570e27
MLAS: update SGEMM threading parameters (#2808) 2020-01-09 14:48:20 -08:00
Yufeng Li
71b5165ed3
Initialize max of softmax with lowest of float (#2786) 2020-01-09 13:48:18 -08:00
Dmitri Smirnov
2c8179bee4
ML.NET team needs featurizers within a package (#2789)
Add auto ml featurizers to Windows, MacOS as well as to GPU  packaging-pipelines.
2020-01-09 10:54:12 -08:00
George Wu
1978376e1e
add session creation time cost. (#2798) 2020-01-08 11:17:48 -10:00
Tianlei Wu
32c5e76a16
Improve bert optimization script: (#2712)
(1) Move input int64=>int32 conversion to embed layer fusion.
(2) Output epsilon attribute for LayerNormalization fusion.
2020-01-08 11:32:27 -08:00
Nathan
f84240db2b
add uint8 support to where op (#2792) 2020-01-08 09:59:42 -08:00
Hariharan Seshadri
ebfcad1c90
Add script for release Nuget validation (#2719)
* Initial commit

* Nits

* Disable a test temporarily

* Change working directory

* Test

* Add download python step

* Test update

* More changes

* Fix space issue

* Fix

* Verify nuget signing

* Fix

* Spaces

* PR feedback

* Nit

* Fix

* Fix

* Remove temporary changes
2020-01-08 18:42:22 +05:30
Andrews548
3e6f1836eb ACL EP convolution improvements (#2774)
Added the optimized implementation for depthwise convolution for both ACL v19.02 and ACL 19.05.
Also the pointwise convolution seems to be more optimal in the CPU implementation so we opted for that instead.
2020-01-07 06:42:03 -10:00
Andrews548
fdc0106f83 ACL EP GEMM improvements (#2780)
When it is posible we use a fully connected layer instead of the gemm implementation.
This will let the library use the best implementation based on the input data.
2020-01-07 06:35:18 -10:00
Maher Jendoubi
f22bffe0f6 Contributing: Fix a typo (#2784) 2020-01-07 06:32:13 -10:00
Yufeng Li
72bdfc8cd4
Implement a more stable softmax (#2715)
* Implement a more stable SoftMax
 e^x is represented as infinity if x is large enough, like 100.f. Infinity divided by Infinity is a NAN. Thus, softmax gets a NAN if one or more item are large enough.
A math transform as below is leveraged to get a stable softmax:
e^xi/(e^x1 + ...e^xn) = e^(xi - max) / (e^(x1 - max) + ... + e^(xn - max))

And for convenience, force max to 0.f if all xi are negative
2020-01-06 14:28:12 -08:00
Dmitri Smirnov
6f66260372
Import more featurizers (#2781)
Make kernels non-template. Add input constraint for learnt data.
  Add min_max_scalar_transformer, robust_scalar_transformer,
  inputation_marker_transfomer, label_encoder_transformer,
 missing_dummies_transformer along with tests.
 Advance Featurizers library commit.
2020-01-06 13:43:44 -08:00
Changming Sun
1b23118056 Fix nightly build version number issue 2020-01-06 11:16:44 -08:00
Changming Sun
e3f674b563 Disable featurizers in python packages 2020-01-06 11:16:44 -08:00
Changming Sun
7ace7a5bcd Pass BUILD_BUILDNUMBER to linux docker 2020-01-06 11:16:44 -08:00
Changyoung Koh
7666d130e5 Rename MKL-DNN to DNNL to fix broken link (#2730) 2020-01-06 08:50:42 -10:00
Takeshi Watanabe
3d75c1b6be Fix typo in error message (#2736) 2020-01-05 13:59:23 -08:00
Changming Sun
013642ed37 Revert "Change default optimization level to All (from Basic) (#2745)"
This reverts commit 56bb503c2f.
2020-01-03 15:28:23 -08:00
KeDengMS
2096f324f4 [NupharEP] fix a race condition when multiple sessions running different models concurrently (#2772) 2020-01-03 10:05:09 -08:00
Scott McKay
a2c8981a9e
Add basic stacktrace output for posix debug builds. (#2749) 2020-01-03 13:43:57 +10:00
Changming Sun
382fa86af8
Pipeline changes for python 3.8 (#2753)
1. Pipeline changes for python 3.8
2. Fix a regression in setup.py which was just introduced in the previous commit.

Please notice, we still haven't made python 3.8 + Windows + CUDA work.
2020-01-02 15:25:25 -08:00
Changming Sun
fd334aff44
Update numpy to 1.18 (#2758)
* Update numpy to 1.18
2019-12-30 14:51:01 -08:00
Ashwini Khade
56bb503c2f
Change default optimization level to All (from Basic) (#2745)
* change default optimization level to All (from Basic)

* fix test

* fix c# test
2019-12-30 12:31:44 -08:00
Changming Sun
90b708f8a9
Update protobuf to 3.11.2 (#1928)
Update protobuf to 3.11.2 (#1928)
2019-12-27 18:28:18 -08:00
Changyoung Koh
b40a85a0e8 Fix build for Python 3.8 (#2747)
* Fix build for Python 3.8
2019-12-27 12:17:06 -08:00
Changming Sun
c7a9c6b488
Split onnxruntime server to a separated folder (#2744) 2019-12-27 11:21:23 -08:00
zhanyi-ms
6e4ec9b049 fix ngraph wheel (#2737)
* fix ngraph wheel

1.1.0 onnxruntime_ngraph wheel doesn't work

* remove libdnnl.so in nGraph Libs

* make it easy to compare
2019-12-27 05:40:01 -08:00
Andrey
9013fe6340 Fix build on Mac OS (#2731)
mac os ld doesn't support --while-archive, correct option is -all_load
2019-12-25 20:10:39 -08:00
Faith Xu
f142b683a1 Fix broken python docs links (#2740) 2019-12-25 15:58:56 -08:00
Tracy Sharpe
ebf23744eb
MLAS: ARM64 build fix (#2734)
fix bad usage of vreinterpret to cast vector element types
2019-12-24 19:03:05 -08:00
George Wu
71ce0b8e0a
remove 64bit warning message from python validation. (#2727) 2019-12-23 17:06:55 -08:00
Andrey
abda478083 [server] Add supposed for model_name and model_version as cli parameter (#2708) 2019-12-23 12:52:09 -08:00
Ashwini Khade
cd6ff42442
Add schema for new Qops (#2611)
* Add schema for new Qops

* adding shape inference + qlinearaveragepool

* plus review comments

* plus review comments

* updates per review comments

* plus review comments
2019-12-23 10:19:28 -08:00
KeDengMS
71940c0915
Update Nuphar tutorial notebook (#2721)
1. Reflect int8 GEMV improvements for multi-threading from #2696
2. Add notes on multi-threading control using OpenMP
3. Add samples of running multi-isa AOT, and show int8 GEMM differences between AVX and AVX2
4. Add rnn_benchmark example to resolve #1993
2019-12-22 22:42:03 -08:00
Jeff
bbe32f0990
Extend DML kernels (#2641)
* Additional DML operators

* Check unsupported attributes and inputs

* Address PR comments

* Add kernel capability function used for partitioning, and re-enable stride-based int64 support based on value range

* Fix test failures

* Build fix

* PR comments
2019-12-20 23:28:15 -08:00
KeDengMS
4b900dc585 Simplify cache implementation and avoid static variables that may carry over between models 2019-12-20 21:04:17 -08:00
Changming Sun
da03ed4473 Tiny fix to codegen 2019-12-20 21:04:17 -08:00
Changming Sun
b42cb61904
Packaging pipeline changes for VS 2019 (#2711) 2019-12-20 19:53:51 -08:00
Ori Levari
f0046a1531
switch back order of session_state_ and execution_providers_ in InferenceSession.h (#2714) 2019-12-20 17:26:51 -08:00
Xavier Dupré
7c0235c15a
Propagate documentation modification from rel-1.0.0 (#2713) 2019-12-21 00:25:45 +01:00
Ori Levari
967ffc03cc
a subset of the models in testdata were missing an ir_version. update this ir_version to 3. (#2702) 2019-12-20 13:56:11 -08:00
Yufeng Li
72286d91e7
fix the CUDNN_BN_MIN_EPSILON difference issue between cudnn7.3 and cudnn7.6 (#2681) 2019-12-20 13:49:03 -08:00
jignparm
64112db346
Fix C# handling of unicode strings (#2697)
* Fix C# handling of unicode strings

* more tests

* check for handle before freesing

* variable reuse efficiency

* refactor and cleanup utf8 o utf16 conversion block
2019-12-19 21:02:54 -08:00
Dmitri Smirnov
233bdd268b
Advance Featurizers commits and fix up ORT lining issues. (#2705)
Update featurizers. Fix up constraint issue.
  Pass static VCRT library option down to Featurizers CMAKE.
  Make build Featurizers OFF by default.
  Rename registration call.
2019-12-19 20:27:36 -08:00