Commit graph

7863 commits

Author SHA1 Message Date
Scott McKay
912e50f61c
Add CI minimal build with all options disabled. Fix python binding code if sparse tensors are disabled. (#9898)
* Add 2 builds to validate the cmake defines for excluding optional components work in both full and minimal builds.

* Create empty config for no-ops build

* Create empty config for no-ops build - attempt #2

* Create empty config for no-ops build - attempt #3

* Update python binding code to work when sparse tensors are disabled.
2021-12-03 06:56:51 +10:00
Justin Stoecker
5190953f6b Merged PR 6728025: Merge latest ORT master branch
Successful build: https://microsoft.visualstudio.com/WindowsAI/_build/results?buildId=42253044&view=logs&j=4e3dd845-80f6-5682-7592-d2eda4c1196a&t=b3ed071a-d172-5351-0dfc-02424ddb17a7

```
Printing log to show merge was successful.
1bbf04e75564b074a4c5a6fc051a357882fb2c76 Merge remote-tracking branch 'upstream/master' into dmldev_temp
3f5c1e1c58 Update to include the Xamarin targets for internal ORT builds so the managed nuget package is consistent as both CPU and GPU builds produce a package called Microsoft.ML.OnnxRuntime.Managed. (#9906)
```
2021-12-02 20:56:10 +00:00
Justin Stoecker
2e9dc8a0e2 Merge remote-tracking branch 'upstream/master' into tmp_merge 2021-12-02 10:32:14 -08:00
Dwayne Robinson
5ba4079546 Merge branch 'DmlDev' into user/dwayner/FnsCandyTolerance30696168 2021-12-02 01:04:49 -08:00
Dwayne Robinson
7a3abd863f Update WinML model test tolerances for tiny_yolov2 and FNS_Candy 2021-12-02 00:48:54 -08:00
Scott McKay
3f5c1e1c58
Update to include the Xamarin targets for internal ORT builds so the managed nuget package is consistent as both CPU and GPU builds produce a package called Microsoft.ML.OnnxRuntime.Managed. (#9906) 2021-12-02 18:43:34 +10:00
Jeff Daily
8d88a6ac7f
add --amdgpu-target=gfx90a (#9820) 2021-12-01 22:28:52 -08:00
Scott McKay
00c979db4d
Update doc for operators/opsets supported by mobile package (#9899) 2021-12-02 13:51:22 +10:00
RandySheriffH
078782ea3c
exclude test case from reduced-op build (#9895)
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2021-12-01 15:50:36 -08:00
Ryan Lai
3f82b2d5b1 Merged PR 6723735: Need to unsquash the last RI to maintain history
Related work items: #37220320
2021-12-01 22:13:25 +00:00
Ryan Lai
73d106e33a Merge commit '9345894c823de67f913d907da91dbc07ac322041' into HEAD 2021-12-01 14:11:29 -08:00
Sherlock
175acf08f4
ScatterND supports negative indices (#9739)
* ScatterND supports negative indices
2021-11-30 21:17:32 -08:00
Hariharan Seshadri
c161813217
Misc InstanceNorm CUDA kernel changes (#9879) 2021-11-30 19:15:45 -08:00
Sherlock
6de79d82c8
Fix Training Packaging pipeline (#9885)
* Fix Training Packaging pipeline
2021-11-30 15:26:10 -08:00
Ryan Lai
d8a7e1d159 Merged PR 6718335: RI 11/30 from github
Pipeline green https://microsoft.visualstudio.com/WindowsAI/_build/results?buildId=42142807&view=results

![image.png](https://microsoft.visualstudio.com/274e76ac-6b29-4f77-a85d-7914c77cabd5/_apis/git/repositories/853d2ddc-663c-4fe8-8036-dbf0d50db2d9/pullRequests/6718335/attachments/image.png)

Related work items: #37220320
2021-11-30 21:29:25 +00:00
Abhishek Jindal
740679d329
Abjindal/fix windows ci pipeline (#9883)
* switching to /wd4800 for eager mode

* fixing compile flags ignore warnings, previously it was only using the last one
2021-11-30 10:33:13 -08:00
RandySheriffH
9345894c82
Add build option to enable cuda profiling (#9875) 2021-11-29 22:44:50 -08:00
RandySheriffH
16bfd3c771
Cancel transpose optimizer for resize (#9870)
* cancel transpose optimizer for resize

* add UT

* addressing comments

* fix build err
2021-11-29 22:30:16 -08:00
Chi Lo
0baf687f2d
QDQ tool modification part2 (#9720)
* Add finetuned qdq options

* Add description

* Add unit tests

* Modify for channel axis

* Remove too specific feature. Move this implementation to e2e example

* Add OpTypesSupportPerChannelQuantization

* fix bug for unit test

* Keep flags OpTypesSupportPerChannelQuantization and QDQChannelAxis for internal use 

Will have a follow-up PR to fine tune the code

* remove unnecessary warning

Co-authored-by: stevenlix <38092805+stevenlix@users.noreply.github.com>
Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>
2021-11-29 21:45:41 -08:00
Ryan Lai
57a6f7c205
Various fixes to fix WindowsAI RI build. (#9877)
* WAI RI fixes

* span changes

* Spaces

* Additional warnings to fix

* Fix redundant commment
2021-11-29 21:33:15 -08:00
Yufeng Li
a0afd7303d
add int8_t support for pool operators (#9852)
* add int8_t support for pool operators
2021-11-29 18:43:43 -08:00
George Wu
ec9b0ed800
[python manylinux package] emit warning if missing CUDA/TensorRT dependency causes ld_preload to fail and user tries to register either CUDA/TensorRT EP (#9872)
* add warning if ld_preload fails for CUDA or TRT when trying to register either provider

* refactor

* change wording from register to create
2021-11-29 16:04:23 -08:00
Yulong Wang
27e337ed7a
[js/node] npm audit fix (#9861) 2021-11-29 13:11:19 -08:00
Ye Wang
bf716e667c
layernorm throw error if input has no data (#9837) 2021-11-29 12:43:17 -08:00
cristei
9e75ebf0dc
Remove redundant inline specifiers, sync server IsLittleEndianOrder with runtime core (#9856)
* Remove redundant inline

* Make server IsLittleEndianOrder represent runtime core endianness implementation, make the endianness check constexpr
2021-11-29 08:32:16 -08:00
Tang, Cheng
37bf46eb19
support print in ort eager mode (#9825)
* fix reshape implementation in eager mode

* test code

* update opgen script to support fallback to cpu

* enhance the eager backend to support torch cpu fallback

* add more testes

* disable the printensor test for now, as we need to erge a PR to pytorch first
2021-11-29 08:03:57 -08:00
George Wu
1e9e57df3e
no fallback when enforcing explicit EP registration. (#9863)
* no fallback when enforcing explicit EP registration.

* add explicit ep registrations for python.
2021-11-25 07:26:51 -08:00
Yulong Wang
a3ebc5e082
[js/web] do not use nodejs type 'Buffer' in web (#9839)
* [js/web] do not use nodejs type 'Buffer' in web

* resolve comments and validate tests

* remove 'Buffer' in test
2021-11-24 14:14:42 -08:00
Yulong Wang
6eb0c8d420
fix build break in release pipeline for Node.js binding test (#9850) 2021-11-24 13:54:43 -08:00
Sheil Kumar
53c43e9949
WinML RT API: Add PixelRange Metadata to Bind() call PropertySet (#9827)
* Enable Normalization Binding Metadata

* copy paste error

* Small fix.

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-11-24 13:44:25 -08:00
Hariharan Seshadri
18fd2cf457
Fix potential data race with OrtValue usage in Python (#9841) 2021-11-23 14:34:24 -08:00
Maajid khan
0ae0f29f14
[OpenVINO-EP] V3.4 Release with OpenVINO 2021.4.2 LTS Release (#9848)
* Changes to ensure openvino build go through in Windows

* Modified Hetero plugin Logic

*Modified Hetero Feature logic. In Hetero,
if the operator to be marked true in getcapability(),
it should be supported by either of the devices
specified with HETERO in the device_type.

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

* OV updated to 2021.4.2 version

* OV updated to 2021.4.2 version

* Updated OV to 2021.4.2 version, mono download  link and dotnet version

* Copying Managed nugets in openvino c# docker file

*Copying Managed nuget to nugets artifacts
directory

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

Co-authored-by: saharfraza <sfatima.3001@gmail.com>
Co-authored-by: mayavijx <mayax.vijayan@intel.com>
Co-authored-by: Aravind Gunda <aravindx.gunda@intel.com>
2021-11-23 13:12:08 -08:00
Ginés Hidalgo
926109bd63
Added algorithm for std::clamp (#9797) 2021-11-23 12:58:05 -08:00
Dmitri Smirnov
d012d9fb91
Fix memset size (#9840) 2021-11-23 09:19:55 -08:00
Ryan Hill
6749e9fd44
Cuda instance_norm fix (#9826)
* Fix allocation size & initial values
2021-11-22 22:59:20 -08:00
Vincent Wang
24f3d72b77
relax atol and rtol for einsum ut (#9842) 2021-11-23 14:39:14 +08:00
RajalakshmiSR
8564fc1933
POWER10: Add optimized dgemm kernel (#9652)
* POWER10: Add optimized dgemm kernel

This patch makes use of POWER10 matrix multiply assist feature and
adds new DGEMM kernel.

* Indentation update

Co-authored-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>
2021-11-22 20:28:21 -08:00
jingyanwangms
bf5e9a5044
bumping up ORT_API_VERSION to 10 (#9838)
Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-11-22 20:27:45 -08:00
Scott McKay
fb4a8e12fc
Limit inclusion of Xamarin mobile target frameworks. (#9834)
- Only set them as targets for the ORT nuget package
    - Use OrtPackageId as the condition for inclusion, if installed
      - need to do the nuget restore via msbuild so that this property is set correctly
  - Add desktop-only version of the C# sln as there is no way to exclude the mobile specific csproj's from an sln
    - use this when applicable if someone is running build.py with the `--build_nuget` flag

Other
  - remove attempt to include symbols in the nuget package as nuget doesn't support symbols in native packages
  - update build.py to use `nuget` and not a windows specific path and filename for a linux build with `--build_nuget`
2021-11-23 11:29:53 +10:00
Yulong Wang
74ca417c0e
[js/web] optimize bundle file size (#9817)
* es2017 by default for ort-common

* add visualizer and define plugin

* es2017 for ort-web. also add build target for es5

* add multiple reduced size build for ort-web

* resolve comments, add e2e tests and add docs
2021-11-22 13:56:55 -08:00
Edward Chen
bcc6ab29f6
Trim DataTypeImpl binary size (#9813)
* De-virtualize DataTypeImpl::AsXType() functions.
* Refactor helpers.
2021-11-22 12:06:24 -08:00
Dmitri Smirnov
567749b2dc
Expose IOBinding SynchronizeInputs/Outputs via C/C++/C# And Python APIs (#9823)
Add C/C++ APIs for SynchronizeBoundInputs/Outputs
 Add python bindings
 Expose SynchronizeBoundInputs/Outputs to C# API
2021-11-22 09:45:31 -08:00
dependabot[bot]
8dcd388ec5 Bump ansi-regex from 5.0.0 to 5.0.1 in /js
Bumps [ansi-regex](https://github.com/chalk/ansi-regex) from 5.0.0 to 5.0.1.
- [Release notes](https://github.com/chalk/ansi-regex/releases)
- [Commits](https://github.com/chalk/ansi-regex/compare/v5.0.0...v5.0.1)

---
updated-dependencies:
- dependency-name: ansi-regex
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
2021-11-22 16:04:10 +00:00
dependabot[bot]
c779f2cad5 Bump path-parse from 1.0.6 to 1.0.7 in /js
Bumps [path-parse](https://github.com/jbgutierrez/path-parse) from 1.0.6 to 1.0.7.
- [Release notes](https://github.com/jbgutierrez/path-parse/releases)
- [Commits](https://github.com/jbgutierrez/path-parse/commits/v1.0.7)

---
updated-dependencies:
- dependency-name: path-parse
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
2021-11-22 16:03:45 +00:00
Changming Sun
88d8d2d02a
Remove Copy NuGet Package to Azure Blob Store build step (#8991) 2021-11-22 08:03:02 -08:00
Yi-Hong Lyu
efcdbac4e7
Fix ONNX Runtime Mobile link in js/web/README.md (#9828) 2021-11-21 17:58:03 -08:00
Chi Lo
3a85ade511
Avoid segmentation fault when creating session with TRT EP or OpenVINO EP using python (#9814)
* fix segmentation fault

* fix typo

* fix bug

* make logic the same as CUDA ep

* Modify for OpenVINO

* Add env variable check for OpenVIO

* refine the code

* refine EP failed registration warning messages.

* update OpenVINO exception message.

Co-authored-by: George Wu <jywu@microsoft.com>
2021-11-20 07:52:50 -08:00
Dwayne Robinson
7396689c2b
Merge pull request #9765 from microsoft/user/dwayner/DML1.8forORT1.10
Update DirectML 1.5.1 to 1.8.0 for ORT1.10
2021-11-19 22:39:07 -08:00
Ye Wang
6856619b18
Decoder Attention CUDA Op (#9792)
* add kernel interface

* register kernel

* add self/cross qkv projection without cache

* add LaunchTransQkv2 for (S,B,X,N,H) -> (X,B,N,S,H)

* refactor ConcatPastToPresent

* DecoderQkvToContext interface

* q,k,v buffer and cache as output

* qk, pv and transctx

* fix compiler error on linux machine

* key_padding_mask

* add test_parity file. However not runnable

* add partial unittest

* made partial attributes to inputs

* --gen_doc

* change kernel interface, add more tests

* morre parity tests

* fix test

* fix typo

* transpose optimizer has bug. remove it temporarily

* add input shape checks

* add type/shape inference

* fix cache shape check

* fix rocm build failure

* fix rocm build error

* review comments

* review comments
2021-11-19 19:25:36 -08:00
Zhang Lei
16ddaf564c
Fix uninitialized warning by some compiler. (#9822)
Co-authored-by: Lei Zhang <phill.zhang@gmail.com>
2021-11-19 17:47:14 -08:00