Commit graph

5838 commits

Author SHA1 Message Date
RandySheriffH
21eb747a0f
Custom thread creation and join hooks (#9426) 2021-11-12 19:10:31 -08:00
Edward Chen
9f69d8bbae
Disable partial runtime optimization implementation by default (#9748)
* Only serialize runtime optimization records container if non-empty.

* Remove runtime optimizations from onnxruntime/core/flatbuffers/schema/README.md as it's not completely implemented yet.

* Disable partial runtime optimization implementation by default.
2021-11-12 17:37:29 -08:00
Sheil Kumar
a17bdaf725
Enable JoinModels API in WinML+RT Experimental API (#9746)
* Dynamic onnx model fusion

* empty node names shoudl remain empty

* comments and cleanup

* logic reversed for promoting_unlined_outputs

* PR feedback

* type

* typo

* fix model outputs with promote unlinked output

* remove disembodied model

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-11-12 16:56:31 -08:00
Guoyu Wang
5ad6dbb314
Remove experimental from ORT format namespace (#9729)
* schema change

* cc channges

* remove temp debug code

* Adding fbs namespace to session_state_flatbuffers_utils.h

* Add fbs namepsace to all ort format utils
2021-11-11 19:46:30 -08:00
Aidan Beggs
f6edf13513
Implement a Gemm/Sum fusion pattern (#9699)
When the pattern Sum(Gemm(A, B), C) exists, we can convert it to
Gemm(A, B, C), assuming that C the output of the original Gemm is
not used elsewhere, and this change does not break broadcasting.
2021-11-11 18:33:13 -08:00
Edward Chen
997266a620
Add build.py option to disable ORT format model runtime optimization (#9723)
ORT format model runtime optimization implementation is in progress.
This change adds a build.py option to disable the partial runtime optimization implementation, adds CI builds to test it, and disables runtime optimizations in mobile package builds.
2021-11-11 18:05:45 -08:00
Gary Miguel
93e239747f
Construct valid graphs for ONNX checker for IR version < 4. (#9665)
* Construct valid graphs for ONNX checker for IR version < 4.

Previously the constructed graph was not guaranteed to have its
initializers be a subset of its inputs, which is required for IR
version < 4. This resulted in spurious failures.

Fixes #9663
2021-11-12 09:13:28 +10:00
zhaoyang-intel
32c896df6d
Add DynamicQuantizeLinear to DNNL EP (#9620)
implement dynamicquantizelinear in DNNL EP

add debug log in EP for operator coverage

block gpu elementwise op with 5 dims or more

Signed-off-by: Wang <zhaoyang.wang@intel.com>
2021-11-11 14:01:09 -08:00
Nick Kreeger
559adbb534
Replace deprecated PyEval_CallObject() call with PyObject_CallObject(). (#9737)
This API was changed in 3.9:
https://docs.python.org/3/whatsnew/3.9.html#changes-in-the-c-api
https://docs.python.org/3/c-api/call.html#c.PyObject_Call
2021-11-11 15:37:55 -06:00
Scott McKay
724009289b
Fix Issue #9671 (#9691)
* Fix #9671 by running the level 1 rewrite rules first and allowing the transpose optimizer to run multiple times to ensure it completes in level 1.

Removed unnecessary call to GenerateRuleBasedGraphTransformer as there are no level 2 rewrite rules.
2021-11-12 07:18:16 +10:00
Yi-Hong Lyu
03f9d77e17
Fix index out of bound bug of generated Gather (#9673)
Handle the NHWC support for quantized Resize that roi and scales are
rank 0
2021-11-11 11:32:50 -08:00
Suffian Khan
e6f0fdd653
Strip AMD libraries bundled with Python package due to libonnxruntime_providers_rocm.so change (#9679)
* remove AMD library depedence from libonnxruntime_providers_rocm.so

* fix flake error

* remove rocm dependency from original library as well
2021-11-11 09:32:09 -08:00
Sherlock
acb5459268
Random CPU kernels should respect global seed (#9726)
* Random CPU kernels should respect global seed
2021-11-10 22:28:48 -08:00
Viswanath Boga
9d84811fb6
fixing pypi pipeline for release (#9716)
* fixing pypi pipeline for release

* updated the script and correct python version

* updated the version correctly with script changes

* Remove 1.9.1
2021-11-10 17:33:51 -08:00
Edward Chen
962feff7f1
Update linux-ci-pipeline.yml to run debug unit tests. (#9722) 2021-11-10 16:43:40 -08:00
Yulong Wang
31dc768e07
update ONNX Runtime Web CI to use same script for package versioning (#9698)
* use ps script for package generating

* allow e2e test has no common pkg
2021-11-10 12:52:34 -08:00
George Wu
1541784f6c
[python api] align api with other language bindings' treatment of explicit provider registrations. enforce use of providers param in python InferenceSession when execution providers other than default CPU are enabled. (#9712)
* remove default python ep registration. raise exception if providers are not explicitly set if there are available providers

* temporarily disable exception

* fix python tests

* explicitly set CUDAProvider for python iobinding tests

* explicitly set providers param for InferenceSession())

* onnxrt

* raise ValueError if not explicitly set providers when creating InferenceSession

* add required providers param

* explicitly set providers

* typo
2021-11-10 12:17:53 -08:00
Guoyu Wang
517fff0a39
Add opset 15 support for NNAPI/CoreML EPs (#9711) 2021-11-09 17:21:42 -08:00
Vincent Wang
adf98feb2c
ATenOp Support for BCEWithLogitsLoss (#9670) 2021-11-10 08:36:57 +08:00
Tiago Koji Castro Shibata
1b70a14c51
Remove usage of wstring_convert (#9251)
* Remove usage of wstring_convert

* Use local codepage per PR feedback
2021-11-09 13:52:57 -08:00
Chi Lo
3db2390dab
QDQ tool modification (#9616)
* Add 1.option for enable qdq for node's output 2.force qdq appear as a pair

* modify description

* modify description

* Revert the logic of variable

* Revert the logic of variable

* Code refactor based on review's suggestions

* Update init

* Code refactor for able to specify nodes to exclude output quantization

* rename variable

* Fix bug

* code refactor

* remove the exposure of APIs

* fix bug

* fix bug

* fix bug

* fix bug

* exposure one API

Co-authored-by: Ubuntu <onnxruntime@ort-trt-ep-linux-t4.bxgbzpva45kedp3rhbsbit4phb.jx.internal.cloudapp.net>
Co-authored-by: Chi Lo <Chi.Lo@gmail.com>
2021-11-09 12:22:12 -08:00
Sunghoon
e65f284476
[js/web] Support WebGL for ort format models in benchmarks (#9661)
* add p50 in test

* Support FusedConv in WebGL

* resolve comments

* add a comment for longToNumber change

Co-authored-by: Yulong Wang <yulongw@microsoft.com>
2021-11-09 11:58:47 -08:00
Changming Sun
de018f58e8
Update manylinux build scripts (#9701) 2021-11-09 11:55:49 -08:00
Wei-Sheng Chin
bdc279a7ed
Use the same allocator following Pytorch (#9697)
* Use the same allocator following Pytorch

* Polish

* Fix AMD build
2021-11-09 11:25:16 -08:00
satyajandhyala
229c9a4e1c
Added Trilu CUDA kernel. (#9633)
* Added Trilu CUDA kernel.

* Added TriluGrad.

* Added a training testcase for Trilu.

* Added Trilu gradient checker test.
2021-11-09 11:20:17 -08:00
Tang, Cheng
6420530b3a
fix the mkl dependency for eager mode (#9702)
* explicit link with libtorch instead of use cmake var to avoid introduce mkl dependency

* use find_lib to get libtorch lib name

* temp fix

* add missing libraries

Co-authored-by: Cheng Tang <chenta@microsoft.com>
2021-11-09 08:52:55 -08:00
Guoyu Wang
a70ae24475
Add QDQ::Selector::Select to use const GraphViewer instead of mutable Graph (#9621)
* Move qdq selector to use const GraphViewer

* minor update

* Move qdq logic from NodeSelector to QDQ Selectors

* Fix build break

* Move selector result to NodesToOptimizeIndexes

* fix build break

* address CR comments

* move indexes -> indices

* Pass  graph_viewer to avoid recreating many times

* Update after merge master

* update graph viewer remarks

* update comments

* Add ut for new qdq selector logic

* Increase minimal binary size limit

* UT minor update

* Address CR comments
2021-11-08 21:36:29 -08:00
Hariharan Seshadri
65590b049c
Expose an API to query the CUDA compute stream to launch a custom kernel (#9141) 2021-11-08 21:10:34 -08:00
mindest
c579ebfbc3
change a for iteration (#9678)
Co-authored-by: Min Lin <linmin@microsoft.com>
2021-11-09 08:33:50 +08:00
Changming Sun
53afaefe3b
Refactor Windows CI pipeline yaml files (#9672) 2021-11-08 11:11:49 -08:00
Ryan Hill
24e35fba32
Change TensorShape to typically not allocate heap memory (#9542) 2021-11-08 10:29:54 -08:00
Ginés Hidalgo
13e64f8ff7
Remove all warnings C4800: Implicit conversion from 'int32_t/int64_t' to bool. Possible information loss (#9535) 2021-11-08 10:12:27 -08:00
Xavier Dupré
7e207ba3be
Use ORTMODULE_ONNX_OPSET_VERSION to modify the opset version in OrtModule (#9529)
* Use environment variable to change the ONNX opset in ORTModule
* overwrite ONNX_OPSET_VERSION
* store envvar in module constant
2021-11-08 17:03:16 +01:00
ashari4
1151c661eb
Add gi overload (#9690) 2021-11-07 16:04:00 -08:00
Yulong Wang
c6fddb263f
Add Node.js binding support to packaging pipeline (#9577) 2021-11-05 15:29:40 -07:00
Changming Sun
1cbbafdbe0
Change the default value of onnxruntime_DISABLE_RTTI (#9674) 2021-11-05 15:27:04 -07:00
Edward Chen
3e5dbfd94f
Increase binary size limit and publish binary size data even if limit is exceeded. (#9675) 2021-11-05 09:41:20 -07:00
Abhishek Jindal
dfe4d0a330
Abjindal/eager windows ci pipeline (#9587)
* adding eager ci pipelines files

* adding import torch before onnxruntime

* finding os environ path

* finding os environ path corrected

* print OS environ path variables

* adding environ path for torch

* changing python version

* changing python python for torch libs

* removing import torch statements

* removing unncecessary torch path

* removing path variable

* add dll_path

* test for python 3.7

* adding dll directory path for python 3.8+

* print dll directory path for python 3.8+

* adding requirements file

* change requirements directory

* print more

* adding dll dir path

* removing setup eager file

* adding details for dll directory

* adding details for dll directory more

* adding import torch in onnxruntime init file

* removing dll dir path and moving requirements file

* enabling pipeline for py3.7

* remove enter

* removing debug build

* removing openmp

* adding comments for torch dll loading and cases of failure

* cleaning up the pipeline
2021-11-05 09:09:09 -07:00
Changming Sun
74dc48f61c
Remove python 3.6 from macos pipeline (#9602) 2021-11-04 19:36:44 -07:00
pengwa
ee167bd078
Optimize _TileKernel for non-memcpy case (#9648)
* optimize _TileKernel for non-memcpy case

* fallback shape_rank >MAX_DIMS
2021-11-05 09:22:09 +08:00
Edward Chen
a355bcbd73
Clarify cgmanifest.json update process. (#9664) 2021-11-04 16:23:52 -07:00
Ginés Hidalgo
90de3c8e7c
Update MLOperatorAuthorPrivate.h (#9575)
Updated MLOperatorAuthorPrivate.h to remove `enum DML_TENSOR_DATA_TYPE;` to avoid warning "C4471: 'DML_TENSOR_DATA_TYPE': a forward declaration of an unscoped enumeration must have an underlying type"
2021-11-04 16:19:22 -07:00
Hariharan Seshadri
bbeceb7541
Support optional type in ORT (#8339) 2021-11-04 15:01:42 -07:00
Ginés Hidalgo
2fb03769bc
Updated OperatorUtility to avoid C2672 and C2783 (#9651)
Updated OperatorUtility to avoid compiler error errors C2672 and C2783.

- Error C2672: 'TryMapStringToIndex': no matching overloaded function found
- Error C2783: 'std::optional<_Ty> Dml::TryMapStringToIndex(std::string_view,gsl::span<const Dml::NameAndIndex>)': could not deduce template argument for 'T'. note: see declaration of 'Dml::TryMapStringToIndex'. 'TryMapStringToIndex': function declaration must be available as none of the arguments depend on a template parameter
2021-11-04 14:04:32 -07:00
Ginés Hidalgo
1128bf282d
Updated cpuid_info.cc to fix PVS-Studio error C1012 (#9581)
Updated cpuid_info.cc to fix PVS-Studio error: "fatal error C1012: unmatched parenthesis: missing ')'"
2021-11-04 14:03:49 -07:00
Edward Chen
ddb4c05852
Save graph runtime optimizations for minimal build (#9508)
Add support for saving graph runtime optimizations in an ORT format model. The idea is to allow some optimizations to be "replayed" at runtime in a minimal build. The replaying part will be in a future change.
2021-11-04 10:49:46 -07:00
Sheil Kumar
71a1a7b471
Enable building winml with --build_nuget (#9632)
* Enable building winml with --build_nuget

* Fix flake8 errors

* semicolor

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-11-04 00:42:51 -07:00
Guoyu Wang
9cedb12c89
Add App Center e2e test to Android package (#9653)
* Add app center test for Android pipeline

* update android test path

* Add logging of system abi

* fix java spotless error
2021-11-03 22:56:46 -07:00
Chi Lo
c8151b4037
Add percentile method for PTQ (#9342)
* Add percentile method for calibration

* Update configuration
2021-11-03 22:18:51 -07:00
Changming Sun
2406a425a7
Move the common part of Windows CPU CI pipeline to a template file (#9650) 2021-11-03 17:35:51 -07:00