Commit graph

7863 commits

Author SHA1 Message Date
RandySheriffH
058108bef9
Execution Provider Profiler (#8406)
* implement cuda provider

* define profiler common

* call start after register

* add memcpy event

* add cuda correlation

* format code

* add cupti to test path

* switch to CUpti_ActivityKernel3

* reset cupti path

* fix test case

* fix trt pipeline

* add namespace

* format code

* exclude training from testing

* remove mutex
2021-09-28 13:59:52 -07:00
Suffian Khan
6f580f07de
Switch AMD CI pipeline to use environment image from onnxruntimecibuildenvironment (#9206)
* shift docker image reference for amd ci pipeline

* fix service endpoint

* reduce perf tolerance
2021-09-28 13:06:16 -07:00
Changming Sun
1104e8d3e5
Linux Anybuild build pipeline (#9091) 2021-09-28 11:22:27 -07:00
ytaous
d3f859fe30
Dropout Vectorized Kernel (#9157)
* vectorized kernel

* fix build

* re-calibrate expected loss

* fix build

* re-calibrate convergence results

* more re-calibrate on loss

* divide kernels

* adress comments

* more calibration

* calibration

* per comments

* enable sync

Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-09-27 17:19:12 -07:00
Wei-Sheng Chin
1b0816859f
Only wrap sub-modules which can be wrapped as ORTModule (#9021) 2021-09-27 17:18:22 -07:00
Yufeng Li
1a71687102
Add QDQ for output of node (#9134)
* Add QDQ for output of node

* keep output of removable activation
2021-09-27 16:55:23 -07:00
Sheil Kumar
f16bb37fae
make uwp store apps link to statically-linked crt desktop builds (#9182)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-09-27 15:37:15 -07:00
baijumeswani
c30cc9190a
Change the agent pool for orttraining-distributed pipeline (#9179) 2021-09-26 21:26:44 -07:00
Chi Lo
9fda95fec9
Change keepdims of ReduceMax/ReduceMin to always 1 when using quatization calibration MinMax approach (#9167)
* Change keepdims to always 1

* fix typo

* Refine code
2021-09-25 10:13:54 -07:00
baijumeswani
fd91bf91c9
Print full stacktrace exception when exporter fails (#9169) 2021-09-24 10:24:37 -04:00
Vincent Wang
39dc6ea8a3
Fix to_dlpack Failure on PyTorch-1.10 (#9151)
* workaround to_dlpack fail in new pt version

* add torch code link
2021-09-24 09:48:07 +08:00
George Wu
0888c6cc59
fix bug introduced by PR9130 (#9166) 2021-09-23 10:58:33 -07:00
Guoyu Wang
ddafe50199
Fix Android build break after Virtual Environment update to 20210919 (#9163) 2021-09-23 10:07:18 -07:00
Chi Lo
bde16eea68
Cleanup USE_TENSORRT macro (#8593)
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support

* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support

* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support

* handle unused parameters

* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support

* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support

* handle unused parameters

* Disable some testcases

* only include opset13 for testing and add a keyword filter set

* rename variable

* add back code which was accidentally commented on previous commit

* Adjust model test filter for opset14
2021-09-22 21:04:44 -07:00
ke1337
6e83392ff1
Bump up TVM version to avoid conflict with existing one (#9159)
* Bump up tvm version

* Bump up onnxruntime-tvm version

There are some c++17 related fixes in TVM

Co-authored-by: KeDengMS <kedeng@microsoft.com>
2021-09-22 17:39:19 -07:00
Ye Wang
4934455ab6
Bumping up to 1.10 (#9006)
* bump to 1.10

* Update Versioning.md

* Update README.rst

* Change opset version to 15
2021-09-22 16:34:28 -07:00
Jason
4e5bc8365b
Add Paddle2ONNX to Versioning.md (#9067)
* Add Paddle2ONNX to Versioning.md
2021-09-22 13:38:14 -07:00
chethanpk
267fb898e3
Added code to support Softmaxgrad for DNNL EP (#9022)
* Added code to support Softmaxgrad

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>

* Bringing back the opset filters for softmax that I had removed.
This will fix the test failures from onnnx repo.

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>
2021-09-21 23:30:58 -07:00
Dwayne Robinson
675ea9b305 Merged PR 6486412: Register ORT DML ONNX operators for int64
Now that DML has int64 support directly, register the related operators for uint64/int64 (rather than the hack in the ORT DML EP with doubled strides).

## Remaining work
- Not implemented in DML: CumSum, Range, MaxPool/MaxUnpool, TopK, ReduceProd/Sum/SumSquare/L1
- Implemented in DML but need DML EP kernel work: Clip, Pad, Neg, Range, ConstantOfShape

```
te.exe OnnxConformanceTests.dll
Summary: Total=4454, Passed=4147, Failed=0, Blocked=0, Not Run=0, Skipped=307
```

Corresponding PR: https://microsoft.visualstudio.com/WindowsAI/_git/WindowsAI/pullrequest/6486426

Related work items: #28761231, #33883294
2021-09-22 00:49:18 +00:00
sfatimar
1db21da5ae
Replaced onnx build with pypi installation (#9139)
Co-authored-by: mayavijx <mayax.vijayan@intel.com>
2021-09-21 15:57:28 -07:00
Thiago Crepaldi
153767bab4
Add internal determinism flag configuration for ORTModule (#9074) 2021-09-21 15:11:41 -04:00
Changming Sun
b175f98dcc
Do not generate nuget symbol packages on Linux (#9131) 2021-09-21 02:45:20 -07:00
Guoyu Wang
4df94a631d
[NNAPI EP] Fix MaxPool error using uint8 (#9129)
* fix issue in maxpool running using uint8

* Minor update
2021-09-21 00:44:43 -07:00
Hariharan Seshadri
f7dedc9002
Fix default initialization value in C API header (#9126)
* fix default initialization value in C API header

* Fix conflicts

* Nits
2021-09-20 20:58:13 -07:00
Changming Sun
02b9213736
Fix a bug for Openvino Python binding (#9130) 2021-09-20 20:39:17 -07:00
Pranav Sharma
dae37dc946
Fix S360 issue by using "use strict" for javascript code. (#9128) 2021-09-20 20:32:44 -07:00
Yulong Wang
49b329e266
[js/api] add typedoc and revise comments (#9077)
* [js/api] add typedoc and revise comments

* update document

* fix lint error

* use config file for typedoc
2021-09-20 17:54:46 -07:00
Yulong Wang
750e2e0481
[js/web] check session ID in releaseSession() (#9105) 2021-09-20 17:49:53 -07:00
Tianlei Wu
5e197698b8
Fuse attention for BERT without num_heads, hidden_size (#9108)
* fuse attention given wrong num_heads, hidden_size
* update comments
2021-09-20 12:14:33 -07:00
Suffian Khan
47888392ab
Fix nightly CI pipeline to generate ROCm 4.2 wheels and add ROCm 4.3.1 wheels (#9101)
* make work for both rocm 4.2 and rocm 4.3.1

* fix rocm 4.3.1 docker image reference

* fix CUDA_VERSION to ROCM_VERSION

* fix ReduceConsts conflict def

* add ifdef to miopen_common.h as well

* trailing ws
2021-09-19 23:36:03 -07:00
Yufeng Li
23e9c0a7f1
move quantization examples to onnxruntime-inference-examples (#9107) 2021-09-19 21:50:18 -07:00
Ryan Hill
6ae5f7a244
C API Docs - Add build instructions (#9106)
* Update Doxyfile, add build instructions to header
* Update paths in README.md
2021-09-17 18:40:27 -07:00
Ryan Hill
b876e5675b
C API Enum Name Fixes (#9092) 2021-09-17 15:11:26 -07:00
Yulong Wang
0183492931
simplify Web CI pipeline (#9094) 2021-09-17 13:00:07 -07:00
Guoyu Wang
438175cb34
Build shared host protoc in iOS CI pipelines (#9087)
* iOS build, share host protoc build

* Change android ci

* ios packaging ci

* checkout submodule

* revert

* update package pipeline

* minor update
2021-09-16 17:16:50 -07:00
Maajid khan
7fc28cd539
[OpenVINO-EP] UEP v3.1 Release with OpenVINO 2021.4.1 (#9081)
* 2021.4.1 Docker and ci changes

* OV version change

* Removing Imagescaler op from the op's list

Reverting this change which was added in last
PR. Imagescaler is now deprecated. so removing
it from the supported list. Also this
op is causing regression in the performance
of the FP16 models.

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

* Re-writing the help message for num_of_threads

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

Co-authored-by: Aravind Gunda <aravindx.gunda@intel.com>
2021-09-16 17:09:07 -07:00
Ryan Hill
280e79463a
FIll in more documentation (#9088)
Fix plural values with %s
Fix more symbol links
Add custom header for web metrics
2021-09-16 17:08:27 -07:00
Ryan Hill
26509465f0
Add default C++ initialization to OrtCUDAProviderOptions (#9064)
* Add default C++ initialization to OrtCUDAProviderOptions
2021-09-16 15:03:58 -07:00
Tiago Koji Castro Shibata
12515552d1
Remove cpuinfo from WCOS builds (#9076) 2021-09-16 12:05:47 -07:00
Yulong Wang
498461485f
optimize WebAssembly CI pipeline (#9058) 2021-09-16 01:07:36 -07:00
Tracy Sharpe
4828d2ebb1
MLAS: port aarch64 sgemv kernel to Windows ARM64 (#9071) 2021-09-15 18:40:40 -07:00
Guoyu Wang
bee5c26580
Add CPU_ONLY runtime option to NNAPI EP (#9066)
* Add NNAPI cpu only option

* update java

* Update comments
2021-09-15 15:50:18 -07:00
Suffian Khan
e758870b18
Upgrade ROCm CI pipeline for ROCm 4.3.1 and permit run inside container (#9070)
* try to run inside 4.3.1 container

* no \ in container run command

* remove networking options

* try with adding video render groups

* add job to build docker image

* try without 1st stage

* change alpha, beta to float

* try adding service connection

* retain huggingface directory

* static video and render gid

* use runtime expression for variables

* install torch-ort

* pin sacrebleu==1.5.1

* update curves for rocm 4.3.1

* try again

* disable determinism and only check tail of loss curve and with a much larger threshold of 0.05

* disable RoBERTa due to high run variablity on ROCm 4.3.1

* put reduction unit tests back in
2021-09-15 12:32:02 -07:00
austinpagan
a05e32803a
Fixing MORE mlas unittest failures in POWER (#8673) 2021-09-15 11:39:46 -07:00
Sheil Kumar
273494ee9e
Ensure ms-experimental domain Audio Ops build in mac pipeline (#8857)
* Globally enable ms-experimental ops

* change meaning of ms_experimental to mean *all* ms_experimental ops. Some experimental ops will still be enabled globally without this flag like audio ops.

* add cmath

* add cmath to signal_defs.cc

* move audio back into experimental, verify on mac

* remove experimental from mac builds

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-09-15 10:59:32 -07:00
ashbhandare
98ac341c5b
Filter nones from ctx saved tensors (#9063)
Co-authored-by: Aishwarya Bhandare <aibhanda@5cb7a9c3931a4b19a66ae028b49221a6000001.ahkw4qp232huflxlm4gmpq4nbh.jx.internal.cloudapp.net>
2021-09-15 10:13:45 -07:00
Changming Sun
4930320647
Delete linux-pytorch-custom-ops-ci-pipeline.yml (#9023) 2021-09-14 21:51:21 +00:00
Changming Sun
0270ab17c5
Set onnxruntime_DISABLE_RTTI to default OFF (#9049) 2021-09-14 13:53:02 -07:00
Edward Chen
32366fea02
[Objective-C API] WIgnore clang documentation warnings from C/C++ header usage. (#9057) 2021-09-14 13:03:48 -07:00
Tianlei Wu
3ec3e9f705
Add t-test to compare experiments in GPT-2 mixed precision conversion (#9042)
* Add t-test to compare two experiments
* Ranking based on pair-wise T-test results and a custom scoring function
2021-09-14 12:40:25 -07:00