Commit graph

2102 commits

Author SHA1 Message Date
Hariharan Seshadri
1599562016 Fix BatchNorm CUDA kernel definition 2020-04-18 17:21:29 -07:00
Zhang Lei
c365822808
Refactor some for the calibate.py. Add QLinearAdd and QLinearMul support. Fix bugs loading jpgs not strict RGB, and typoes in load_batch call. (#3542) 2020-04-18 17:10:55 -07:00
Dmitri Smirnov
db9566f70d
Implement Inverse(12) for CPU and CUDA (#3485) 2020-04-18 17:10:21 -07:00
Dmitri Smirnov
38a18023c7
Fix some too popular warnings. (#3578)
Some pointless and noisy warnings either fixed or disabled.
2020-04-18 17:05:05 -07:00
Changming Sun
d68245853e
Disable downloading test data on Linux (#3581) 2020-04-18 15:54:58 -07:00
Sergii Dymchenko
3e884b4b6b
Fix some typos. (#3582)
* Fix some typos.

* Fix a typo.
2020-04-18 14:18:05 -07:00
suryasidd
6fe688c732
Disabled failed maxpool test on GPU (#3549) 2020-04-18 13:49:42 -07:00
Tianlei Wu
7f46f347db
Add GPT2 Attention Fusion in optimization script (#3488)
* Add Attention fusion for GPT2
* Support distilgpt2 in benchmark_gpt2.py
* Add options to disable Attention/SkipLayerNormalization/EmbedLayerNormalization/BiasGelu fusions
* Add logging at the begining of each fusion
* Update notebooks: Add Gpt2OnnxModel.py to list of script files.
* Add test for gpt2 model optimization
* Add optional parameters (--input_ids --segment_ids --input_mask) for graph inputs
* Fuse BiasGelu
* Handle model that does not have segment_ids input.
* Allow fuse embed layer without mask
2020-04-17 16:23:53 -07:00
Tianlei Wu
5d3b217039
Update Attention operator for GPT2 (#3474)
Add unidrectional mask for Attention operator.
Update mask_index to mask broadcast from B->BxS->BxNxSxS to B->BxSxS->BxNxSxS.
2020-04-17 16:20:40 -07:00
Hariharan Seshadri
b4457ecb7a
Fix gen_doc build option and refresh documentation (#3545)
* Support listing keys in custom metadata map via C/C++ API

* nit

* PR feedback

* Nit

* Initial commit

* More changes

* Support listing keys in custom metadata map via C/C++ API

* nit

* PR feedback

* Nit

* Initial commit

* More changes

* Add md files

* Doc changes

* Update

* revert cmake changes

* Update

* Doc change

* Update

* Update
2020-04-17 14:41:04 -07:00
Hector Li
5acd8dbe7d
remove option --enable_lto (#3515) 2020-04-17 14:18:56 -07:00
Yufeng Li
f822a54860
Make De/QuantizeLinear support half (#3531)
* Make QuantizeLinear support half

* remove unnessary type constraint

* refine kernel definition

* add fp16 support for dequantizelinear

* diable QuantizeLinear_per_tensor_half_int8 for tensorrt

* refine unit test and fix saturate issue for MSDomain QuantizeLinear

* fix build break

* include tensorrt for half_uint8 test
2020-04-17 12:17:48 -07:00
Tracy Sharpe
c7b6fab29d
Fix build break in mlas\lib\quantize.cpp: missing nearbyintf (#3572) 2020-04-17 11:50:25 -07:00
Xiang Zhang
43c3a5edba
update onnxruntime version string for telemetry (#3526)
* update onnxruntime version string for telemetry

* use ORT_VERSION

* deleted version.h
2020-04-17 10:46:58 -07:00
Changming Sun
209b41a67d Update dependencies graph 2020-04-17 07:38:45 -07:00
Sheil Kumar
2717c178cc
Fork the WinML APIs into the Microsoft namespace (#3503)
* Migrate winml to Microsoft Namespace (packaging changes are pending)

* add ns_prefix toggle

* fix packaging

* Users/sheilk/add missing raw header (#3484)

* add dualapipartition

* wrong variable for repo root

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>

* remove existence check to force failures

* extra paren

* dualapipartition needs to be referenced from the source

* add microsoft.ai.machinelearning.dll to the output dir

* rename the idl file so that assembly info is correctly added into the winmd

* fix namespaces

* update namespaces

* default to microsoft, and add namespace override as build argument

* update cmakesetings.json as well

* remove from cmakelists.txt

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
Co-authored-by: Changming Sun <chasun@microsoft.com>
2020-04-17 06:18:54 -07:00
ytaous
fcb27c4e8b
hotfix for skiplayernorm (#3543)
Co-authored-by: Ethan Tao <ettao@microsoft.com>
Co-authored-by: Changming Sun <chasun@microsoft.com>
2020-04-17 01:22:08 -07:00
liuziyue
92269ae409
perf tuning docs update (#3520) 2020-04-17 00:23:15 -07:00
Sheil Kumar
951484ba53
Dualapipartitionattibute.h header is missing in nuget package (#3350)
* add dualapipartition

* wrong variable for repo root

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-04-16 22:21:57 -07:00
Changming Sun
1a222b3f6e
Disable downloading test data on Windows (#3551)
* Disable downloading test data on Windows
2020-04-16 22:15:20 -07:00
Andrews548
93b957a55a
Acl improvements (#3463)
* Fixed cornercases for acl ep gemm implementation by setting fully connected as the main layer

* Introduced versioned build for the acl ep. ACL versions supported are 1902, 1905 and 1908

* Added convolution-activation fusion optimization for acl ep. We see improvements of 12% for mobilenetv2 and 4% for resnet50

Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>
2020-04-16 03:14:37 -07:00
Adam Pocock
c91527235a
[Java] Add support for map and sequence information on output nodes (#3468) 2020-04-16 02:29:23 -07:00
Changming Sun
7c89f38a34
Fix static analysis warnings found by VC++ (#3530)
1. Fix static analysis warnings found by VC++
2. Add a new pipeline for static analysis
3. Merge all the windows CI build into one single yaml file.(Easier to queue them all).
4. Make DNNL build faster by disabling building the tests and examples.
5. Enable custom op unitest.
2020-04-16 01:46:47 -07:00
Ye Wang
ec4f6c099b
Resolve comments and make minor changes to Featurizer transformers (#3535) 2020-04-15 13:29:24 -07:00
Hariharan Seshadri
abfb275ac0
Support listing keys in custom metadata map via C/C++ API (#3477)
* Support listing keys in custom metadata map via C/C++ API

* nit

* PR feedback

* Nit
2020-04-15 12:14:03 -07:00
David Brownell
72cd61baae
Removed use of parameters in python wheel build scripts (#3524) 2020-04-15 10:31:14 -07:00
Yulong Wang
cf2fddf760
fix nuget build (#3532) 2020-04-15 10:30:11 -07:00
Changming Sun
b63349c8d6
Fix custom op test failure (#3525) 2020-04-14 20:36:42 -07:00
Adam Pocock
bc9a199b16 Renaming deviceNum to deviceId. 2020-04-14 20:35:03 -07:00
Adam Pocock
e9dc8954ac Adding support for ACL and DML to the Java API. 2020-04-14 20:35:03 -07:00
Changming Sun
a2feb29b0d
Fix build break (#3528)
Ignore some known test failures
Install ONNX package before running Windows CI builds
2020-04-14 18:07:56 -07:00
Negin Raoof
e303f458e4
Add int64 input type for ReduceProd (#3507)
* Add int64 input type

* Fix for cuda

* Fix linking

* Cuda

* Fixed missing registration

* Fix registeration for opsets 1-11

* Adding reduce_matrix_rows for int64

* Update reduction_functions.cu

* Revert cuda
2020-04-14 15:09:28 -07:00
Ori Levari
f564569a80
Adapter Model and Environment tests (#3469)
*Adapter Model and Environment tests
*winml test macro clean up and extension
2020-04-14 13:36:31 -07:00
Tiago Koji Castro Shibata
560f4c5b16
Make GPUTEST macro consistent among TAEF/googletest (#3518) 2020-04-14 10:55:16 -07:00
Du Li
621b3ac03a
FFT contrib ops (#3381)
* add custom op skeleton

* Adding Rfft, Irfft kernels.

* Fix a few errors:
1. make kernel stateless to avoid race condition
2. reclaim cufft plan

* Adding MLFloat16 support

* Adding fp16 support for fft ops.

* Adding cufft plan cache.

* adding a util func

* adding copyright info.

* Accommodating PR comments.
2020-04-14 10:12:04 -07:00
Yufeng Li
baa86f181f
Handle the case that initializers are in graph input (#3449)
warn that initializers are in graph input
provide a tool to move initializer out of graph input
Motivation and Context
ONNX model from IR_VERSION 4 only treats initializers that appear in graph input as non-constant. This may fail some of the graph optimizations, like const folding, operator fusion and etc. Warn the case and provide a tool.
2020-04-14 09:06:04 -07:00
David Brownell
006c5be1b1
Optionally produce a python wheel that includes featurizers (#3491) 2020-04-14 09:00:13 -07:00
Changming Sun
040c28ff39 Remove dead code from HandleNegativeAxis 2020-04-14 01:01:15 -07:00
Colin Jermain
06db89cf13 Using logic for finding README.rst to find requirements.txt 2020-04-13 18:59:44 -07:00
Colin Jermain
43d9f9190e Removing unused six package 2020-04-13 18:59:44 -07:00
Colin Jermain
c2c3102aba Tying install_requires to requirements.txt 2020-04-13 18:59:44 -07:00
Ye Wang
66a79d2c9f
fix (#3512) 2020-04-13 18:30:58 -07:00
Dmitri Smirnov
efd9b92482
Handle Scalars in TernaryOps and Where. (#3509)
Handle Scalars in TernaryOps and Where.
2020-04-13 16:24:35 -07:00
Ye Wang
cbe30f3e19
update FeaturizersLibrary (#3511) 2020-04-13 15:47:51 -07:00
Tracy Sharpe
5aab2671f8
Fix crash in DequantizeLinear with scalar tensor (#3508) 2020-04-13 14:52:52 -07:00
Ye Wang
438353abcd
Fix TruncatedSVDFeaturizer's test failure and re-enable it's kernel test (#3458)
* checkin

* fix linux & macos build

* fix test

* revert the changes for a single-aimed PR

* fix
2020-04-13 13:59:38 -07:00
Tianlei Wu
54bbbb78ae
Change mask_index input of Attention op to be optional (#3459)
Change Mask Index to optional
2020-04-12 22:55:37 -07:00
George Wu
7f6e407e09
fix python packaging manylinux1 build break. (#3482) 2020-04-11 06:58:22 +08:00
Ryan Lai
4223591043
Add automatic generation of tensors for Onnxruntime Perf Runner (#3448)
* Add flag to enable automatic generation of input for models with tensor inputs

* change wording of variable

* Naming convention changes to variables

* Handle free dimensions

* Comment with default allocator

* variable rename

* Remove input_count

* Cast to size_t to avoid warning

Co-authored-by: Ryan Lai <ryalai96@gamil.com>
2020-04-10 11:54:17 -07:00
stevenlix
56e85484ba
Handle optional inputs and remove more empty shape nodes in TensorRT EP (#3455)
* check optional inputs and remove more empty shape affected nodes

* fix some minor issues

* update code according to feedback
2020-04-10 11:13:38 -07:00