Commit graph

2112 commits

Author SHA1 Message Date
Changming Sun
911d125323 Remove openmp from gpu build 2020-04-20 17:13:54 -07:00
Sheil Kumar
31b6629e99
Fork WinML IDL Guids (#3591)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-04-20 09:17:07 -07:00
Prabhat
381fee47ab
Added support to build onnxruntime with ACL (#3586)
* Added support to build onnxruntime with ACL

* Added ACL build instructions
2020-04-20 13:35:28 +05:30
Changming Sun
75426a3091 Fix build break 2020-04-19 18:32:46 -07:00
Zhang Lei
422266c445
Support conv transpos 1D in cuda provider. (#3300)
* Support conv transpos 1D in cuda provider.

* Clear some old comment. Enable conv_transpose_1d onnx test for cuda.
2020-04-19 22:07:34 +08:00
Scott McKay
7d5348f87e
Add ability to batch device copy for graph inputs and outputs. (#3580)
* Add ability to batch device copy for graph inputs and outputs.
2020-04-19 17:51:07 +10:00
Prabhat
ea62b3435a
Clean up build.py code (#3466) 2020-04-18 20:48:30 -07:00
Maxim Kalinin
fcf0f6ee9f
Generalize reshape fusion (#3554)
* Generalize reshape fusion

* Allow arbitrary number of Concat arguments
* Apply fusion even when an output of an internal node is used elsewhere
* Fix a bug when an internal node's output is the subgraph output
* Simplify code
2020-04-18 20:47:23 -07:00
Tiago Koji Castro Shibata
14e387aa1a
Fix WinML namespace build break (#3583)
* Add missing winrt namespace

* Conditional compilation of dxcore code

* Fix TAEF macros
2020-04-18 20:46:01 -07:00
Sherlock
56b223bc60
Implement OneHot CUDA Kernels (#3390)
* Implement OneHot CUDA Kernels

* Support fp16

* Use HandleNegativeAxis

* Make MLFloat16 test GPU only
2020-04-18 17:41:39 -07:00
Hariharan Seshadri
1599562016 Fix BatchNorm CUDA kernel definition 2020-04-18 17:21:29 -07:00
Zhang Lei
c365822808
Refactor some for the calibate.py. Add QLinearAdd and QLinearMul support. Fix bugs loading jpgs not strict RGB, and typoes in load_batch call. (#3542) 2020-04-18 17:10:55 -07:00
Dmitri Smirnov
db9566f70d
Implement Inverse(12) for CPU and CUDA (#3485) 2020-04-18 17:10:21 -07:00
Dmitri Smirnov
38a18023c7
Fix some too popular warnings. (#3578)
Some pointless and noisy warnings either fixed or disabled.
2020-04-18 17:05:05 -07:00
Changming Sun
d68245853e
Disable downloading test data on Linux (#3581) 2020-04-18 15:54:58 -07:00
Sergii Dymchenko
3e884b4b6b
Fix some typos. (#3582)
* Fix some typos.

* Fix a typo.
2020-04-18 14:18:05 -07:00
suryasidd
6fe688c732
Disabled failed maxpool test on GPU (#3549) 2020-04-18 13:49:42 -07:00
Tianlei Wu
7f46f347db
Add GPT2 Attention Fusion in optimization script (#3488)
* Add Attention fusion for GPT2
* Support distilgpt2 in benchmark_gpt2.py
* Add options to disable Attention/SkipLayerNormalization/EmbedLayerNormalization/BiasGelu fusions
* Add logging at the begining of each fusion
* Update notebooks: Add Gpt2OnnxModel.py to list of script files.
* Add test for gpt2 model optimization
* Add optional parameters (--input_ids --segment_ids --input_mask) for graph inputs
* Fuse BiasGelu
* Handle model that does not have segment_ids input.
* Allow fuse embed layer without mask
2020-04-17 16:23:53 -07:00
Tianlei Wu
5d3b217039
Update Attention operator for GPT2 (#3474)
Add unidrectional mask for Attention operator.
Update mask_index to mask broadcast from B->BxS->BxNxSxS to B->BxSxS->BxNxSxS.
2020-04-17 16:20:40 -07:00
Hariharan Seshadri
b4457ecb7a
Fix gen_doc build option and refresh documentation (#3545)
* Support listing keys in custom metadata map via C/C++ API

* nit

* PR feedback

* Nit

* Initial commit

* More changes

* Support listing keys in custom metadata map via C/C++ API

* nit

* PR feedback

* Nit

* Initial commit

* More changes

* Add md files

* Doc changes

* Update

* revert cmake changes

* Update

* Doc change

* Update

* Update
2020-04-17 14:41:04 -07:00
Hector Li
5acd8dbe7d
remove option --enable_lto (#3515) 2020-04-17 14:18:56 -07:00
Yufeng Li
f822a54860
Make De/QuantizeLinear support half (#3531)
* Make QuantizeLinear support half

* remove unnessary type constraint

* refine kernel definition

* add fp16 support for dequantizelinear

* diable QuantizeLinear_per_tensor_half_int8 for tensorrt

* refine unit test and fix saturate issue for MSDomain QuantizeLinear

* fix build break

* include tensorrt for half_uint8 test
2020-04-17 12:17:48 -07:00
Tracy Sharpe
c7b6fab29d
Fix build break in mlas\lib\quantize.cpp: missing nearbyintf (#3572) 2020-04-17 11:50:25 -07:00
Xiang Zhang
43c3a5edba
update onnxruntime version string for telemetry (#3526)
* update onnxruntime version string for telemetry

* use ORT_VERSION

* deleted version.h
2020-04-17 10:46:58 -07:00
Changming Sun
209b41a67d Update dependencies graph 2020-04-17 07:38:45 -07:00
Sheil Kumar
2717c178cc
Fork the WinML APIs into the Microsoft namespace (#3503)
* Migrate winml to Microsoft Namespace (packaging changes are pending)

* add ns_prefix toggle

* fix packaging

* Users/sheilk/add missing raw header (#3484)

* add dualapipartition

* wrong variable for repo root

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>

* remove existence check to force failures

* extra paren

* dualapipartition needs to be referenced from the source

* add microsoft.ai.machinelearning.dll to the output dir

* rename the idl file so that assembly info is correctly added into the winmd

* fix namespaces

* update namespaces

* default to microsoft, and add namespace override as build argument

* update cmakesetings.json as well

* remove from cmakelists.txt

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
Co-authored-by: Changming Sun <chasun@microsoft.com>
2020-04-17 06:18:54 -07:00
ytaous
fcb27c4e8b
hotfix for skiplayernorm (#3543)
Co-authored-by: Ethan Tao <ettao@microsoft.com>
Co-authored-by: Changming Sun <chasun@microsoft.com>
2020-04-17 01:22:08 -07:00
liuziyue
92269ae409
perf tuning docs update (#3520) 2020-04-17 00:23:15 -07:00
Sheil Kumar
951484ba53
Dualapipartitionattibute.h header is missing in nuget package (#3350)
* add dualapipartition

* wrong variable for repo root

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-04-16 22:21:57 -07:00
Changming Sun
1a222b3f6e
Disable downloading test data on Windows (#3551)
* Disable downloading test data on Windows
2020-04-16 22:15:20 -07:00
Andrews548
93b957a55a
Acl improvements (#3463)
* Fixed cornercases for acl ep gemm implementation by setting fully connected as the main layer

* Introduced versioned build for the acl ep. ACL versions supported are 1902, 1905 and 1908

* Added convolution-activation fusion optimization for acl ep. We see improvements of 12% for mobilenetv2 and 4% for resnet50

Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>
2020-04-16 03:14:37 -07:00
Adam Pocock
c91527235a
[Java] Add support for map and sequence information on output nodes (#3468) 2020-04-16 02:29:23 -07:00
Changming Sun
7c89f38a34
Fix static analysis warnings found by VC++ (#3530)
1. Fix static analysis warnings found by VC++
2. Add a new pipeline for static analysis
3. Merge all the windows CI build into one single yaml file.(Easier to queue them all).
4. Make DNNL build faster by disabling building the tests and examples.
5. Enable custom op unitest.
2020-04-16 01:46:47 -07:00
Ye Wang
ec4f6c099b
Resolve comments and make minor changes to Featurizer transformers (#3535) 2020-04-15 13:29:24 -07:00
Hariharan Seshadri
abfb275ac0
Support listing keys in custom metadata map via C/C++ API (#3477)
* Support listing keys in custom metadata map via C/C++ API

* nit

* PR feedback

* Nit
2020-04-15 12:14:03 -07:00
David Brownell
72cd61baae
Removed use of parameters in python wheel build scripts (#3524) 2020-04-15 10:31:14 -07:00
Yulong Wang
cf2fddf760
fix nuget build (#3532) 2020-04-15 10:30:11 -07:00
Changming Sun
b63349c8d6
Fix custom op test failure (#3525) 2020-04-14 20:36:42 -07:00
Adam Pocock
bc9a199b16 Renaming deviceNum to deviceId. 2020-04-14 20:35:03 -07:00
Adam Pocock
e9dc8954ac Adding support for ACL and DML to the Java API. 2020-04-14 20:35:03 -07:00
Changming Sun
a2feb29b0d
Fix build break (#3528)
Ignore some known test failures
Install ONNX package before running Windows CI builds
2020-04-14 18:07:56 -07:00
Negin Raoof
e303f458e4
Add int64 input type for ReduceProd (#3507)
* Add int64 input type

* Fix for cuda

* Fix linking

* Cuda

* Fixed missing registration

* Fix registeration for opsets 1-11

* Adding reduce_matrix_rows for int64

* Update reduction_functions.cu

* Revert cuda
2020-04-14 15:09:28 -07:00
Ori Levari
f564569a80
Adapter Model and Environment tests (#3469)
*Adapter Model and Environment tests
*winml test macro clean up and extension
2020-04-14 13:36:31 -07:00
Tiago Koji Castro Shibata
560f4c5b16
Make GPUTEST macro consistent among TAEF/googletest (#3518) 2020-04-14 10:55:16 -07:00
Du Li
621b3ac03a
FFT contrib ops (#3381)
* add custom op skeleton

* Adding Rfft, Irfft kernels.

* Fix a few errors:
1. make kernel stateless to avoid race condition
2. reclaim cufft plan

* Adding MLFloat16 support

* Adding fp16 support for fft ops.

* Adding cufft plan cache.

* adding a util func

* adding copyright info.

* Accommodating PR comments.
2020-04-14 10:12:04 -07:00
Yufeng Li
baa86f181f
Handle the case that initializers are in graph input (#3449)
warn that initializers are in graph input
provide a tool to move initializer out of graph input
Motivation and Context
ONNX model from IR_VERSION 4 only treats initializers that appear in graph input as non-constant. This may fail some of the graph optimizations, like const folding, operator fusion and etc. Warn the case and provide a tool.
2020-04-14 09:06:04 -07:00
David Brownell
006c5be1b1
Optionally produce a python wheel that includes featurizers (#3491) 2020-04-14 09:00:13 -07:00
Changming Sun
040c28ff39 Remove dead code from HandleNegativeAxis 2020-04-14 01:01:15 -07:00
Colin Jermain
06db89cf13 Using logic for finding README.rst to find requirements.txt 2020-04-13 18:59:44 -07:00
Colin Jermain
43d9f9190e Removing unused six package 2020-04-13 18:59:44 -07:00