Commit graph

11997 commits

Author SHA1 Message Date
Jason
4e5bc8365b
Add Paddle2ONNX to Versioning.md (#9067)
* Add Paddle2ONNX to Versioning.md
2021-09-22 13:38:14 -07:00
chethanpk
267fb898e3
Added code to support Softmaxgrad for DNNL EP (#9022)
* Added code to support Softmaxgrad

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>

* Bringing back the opset filters for softmax that I had removed.
This will fix the test failures from onnnx repo.

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>
2021-09-21 23:30:58 -07:00
Dwayne Robinson
675ea9b305 Merged PR 6486412: Register ORT DML ONNX operators for int64
Now that DML has int64 support directly, register the related operators for uint64/int64 (rather than the hack in the ORT DML EP with doubled strides).

## Remaining work
- Not implemented in DML: CumSum, Range, MaxPool/MaxUnpool, TopK, ReduceProd/Sum/SumSquare/L1
- Implemented in DML but need DML EP kernel work: Clip, Pad, Neg, Range, ConstantOfShape

```
te.exe OnnxConformanceTests.dll
Summary: Total=4454, Passed=4147, Failed=0, Blocked=0, Not Run=0, Skipped=307
```

Corresponding PR: https://microsoft.visualstudio.com/WindowsAI/_git/WindowsAI/pullrequest/6486426

Related work items: #28761231, #33883294
2021-09-22 00:49:18 +00:00
sfatimar
1db21da5ae
Replaced onnx build with pypi installation (#9139)
Co-authored-by: mayavijx <mayax.vijayan@intel.com>
2021-09-21 15:57:28 -07:00
Thiago Crepaldi
153767bab4
Add internal determinism flag configuration for ORTModule (#9074) 2021-09-21 15:11:41 -04:00
Changming Sun
b175f98dcc
Do not generate nuget symbol packages on Linux (#9131) 2021-09-21 02:45:20 -07:00
Guoyu Wang
4df94a631d
[NNAPI EP] Fix MaxPool error using uint8 (#9129)
* fix issue in maxpool running using uint8

* Minor update
2021-09-21 00:44:43 -07:00
Hariharan Seshadri
f7dedc9002
Fix default initialization value in C API header (#9126)
* fix default initialization value in C API header

* Fix conflicts

* Nits
2021-09-20 20:58:13 -07:00
Changming Sun
02b9213736
Fix a bug for Openvino Python binding (#9130) 2021-09-20 20:39:17 -07:00
Pranav Sharma
dae37dc946
Fix S360 issue by using "use strict" for javascript code. (#9128) 2021-09-20 20:32:44 -07:00
Yulong Wang
49b329e266
[js/api] add typedoc and revise comments (#9077)
* [js/api] add typedoc and revise comments

* update document

* fix lint error

* use config file for typedoc
2021-09-20 17:54:46 -07:00
Yulong Wang
750e2e0481
[js/web] check session ID in releaseSession() (#9105) 2021-09-20 17:49:53 -07:00
Tianlei Wu
5e197698b8
Fuse attention for BERT without num_heads, hidden_size (#9108)
* fuse attention given wrong num_heads, hidden_size
* update comments
2021-09-20 12:14:33 -07:00
Suffian Khan
47888392ab
Fix nightly CI pipeline to generate ROCm 4.2 wheels and add ROCm 4.3.1 wheels (#9101)
* make work for both rocm 4.2 and rocm 4.3.1

* fix rocm 4.3.1 docker image reference

* fix CUDA_VERSION to ROCM_VERSION

* fix ReduceConsts conflict def

* add ifdef to miopen_common.h as well

* trailing ws
2021-09-19 23:36:03 -07:00
Yufeng Li
23e9c0a7f1
move quantization examples to onnxruntime-inference-examples (#9107) 2021-09-19 21:50:18 -07:00
Ryan Hill
6ae5f7a244
C API Docs - Add build instructions (#9106)
* Update Doxyfile, add build instructions to header
* Update paths in README.md
2021-09-17 18:40:27 -07:00
Ryan Hill
b876e5675b
C API Enum Name Fixes (#9092) 2021-09-17 15:11:26 -07:00
Yulong Wang
0183492931
simplify Web CI pipeline (#9094) 2021-09-17 13:00:07 -07:00
Guoyu Wang
438175cb34
Build shared host protoc in iOS CI pipelines (#9087)
* iOS build, share host protoc build

* Change android ci

* ios packaging ci

* checkout submodule

* revert

* update package pipeline

* minor update
2021-09-16 17:16:50 -07:00
Maajid khan
7fc28cd539
[OpenVINO-EP] UEP v3.1 Release with OpenVINO 2021.4.1 (#9081)
* 2021.4.1 Docker and ci changes

* OV version change

* Removing Imagescaler op from the op's list

Reverting this change which was added in last
PR. Imagescaler is now deprecated. so removing
it from the supported list. Also this
op is causing regression in the performance
of the FP16 models.

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

* Re-writing the help message for num_of_threads

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

Co-authored-by: Aravind Gunda <aravindx.gunda@intel.com>
2021-09-16 17:09:07 -07:00
Ryan Hill
280e79463a
FIll in more documentation (#9088)
Fix plural values with %s
Fix more symbol links
Add custom header for web metrics
2021-09-16 17:08:27 -07:00
Ryan Hill
26509465f0
Add default C++ initialization to OrtCUDAProviderOptions (#9064)
* Add default C++ initialization to OrtCUDAProviderOptions
2021-09-16 15:03:58 -07:00
Tiago Koji Castro Shibata
12515552d1
Remove cpuinfo from WCOS builds (#9076) 2021-09-16 12:05:47 -07:00
Yulong Wang
498461485f
optimize WebAssembly CI pipeline (#9058) 2021-09-16 01:07:36 -07:00
Tracy Sharpe
4828d2ebb1
MLAS: port aarch64 sgemv kernel to Windows ARM64 (#9071) 2021-09-15 18:40:40 -07:00
Guoyu Wang
bee5c26580
Add CPU_ONLY runtime option to NNAPI EP (#9066)
* Add NNAPI cpu only option

* update java

* Update comments
2021-09-15 15:50:18 -07:00
Suffian Khan
e758870b18
Upgrade ROCm CI pipeline for ROCm 4.3.1 and permit run inside container (#9070)
* try to run inside 4.3.1 container

* no \ in container run command

* remove networking options

* try with adding video render groups

* add job to build docker image

* try without 1st stage

* change alpha, beta to float

* try adding service connection

* retain huggingface directory

* static video and render gid

* use runtime expression for variables

* install torch-ort

* pin sacrebleu==1.5.1

* update curves for rocm 4.3.1

* try again

* disable determinism and only check tail of loss curve and with a much larger threshold of 0.05

* disable RoBERTa due to high run variablity on ROCm 4.3.1

* put reduction unit tests back in
2021-09-15 12:32:02 -07:00
austinpagan
a05e32803a
Fixing MORE mlas unittest failures in POWER (#8673) 2021-09-15 11:39:46 -07:00
Sheil Kumar
273494ee9e
Ensure ms-experimental domain Audio Ops build in mac pipeline (#8857)
* Globally enable ms-experimental ops

* change meaning of ms_experimental to mean *all* ms_experimental ops. Some experimental ops will still be enabled globally without this flag like audio ops.

* add cmath

* add cmath to signal_defs.cc

* move audio back into experimental, verify on mac

* remove experimental from mac builds

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-09-15 10:59:32 -07:00
ashbhandare
98ac341c5b
Filter nones from ctx saved tensors (#9063)
Co-authored-by: Aishwarya Bhandare <aibhanda@5cb7a9c3931a4b19a66ae028b49221a6000001.ahkw4qp232huflxlm4gmpq4nbh.jx.internal.cloudapp.net>
2021-09-15 10:13:45 -07:00
Changming Sun
4930320647
Delete linux-pytorch-custom-ops-ci-pipeline.yml (#9023) 2021-09-14 21:51:21 +00:00
Changming Sun
0270ab17c5
Set onnxruntime_DISABLE_RTTI to default OFF (#9049) 2021-09-14 13:53:02 -07:00
Edward Chen
32366fea02
[Objective-C API] WIgnore clang documentation warnings from C/C++ header usage. (#9057) 2021-09-14 13:03:48 -07:00
Tianlei Wu
3ec3e9f705
Add t-test to compare experiments in GPT-2 mixed precision conversion (#9042)
* Add t-test to compare two experiments
* Ranking based on pair-wise T-test results and a custom scoring function
2021-09-14 12:40:25 -07:00
G. Ramalingam
7d28b596f4
Add function-body to opschema of FastGeluGrad (#9028)
* Add function body to FastGeluGrad

* Add test case
2021-09-14 12:27:55 -07:00
Suffian Khan
4322f7e647
Fix ROCm wheels CI pipeline break by installing latest protobuf from source (#9047)
* install protobuf from source

* fix rm command in Dockerfile

* fix options on rm command

* fix cd into protobuf source directory

* try again

* remove strip step

* debug list the files

* ls on /usr

* more debug

* more debug

* adjust LD_LIBRARY_PATH

* try remove protobuf before ORT build
2021-09-14 12:07:00 -07:00
Guoyu Wang
cf70635d2a
Add Android executable drop in the Package pipeline (#9050)
* add copy executable for android job

* minor fix

* Variable fix

* Move to use tgz because zip is not part of the docker image

* update compression
2021-09-14 11:45:33 -07:00
Yulong Wang
be80698698
[js/web] a bugfix and add tests for wasm proxy worker (#9048)
* [js/web] add tests for wasm proxy worker

* fix script src override
2021-09-14 10:38:58 -07:00
Edward Chen
e574be4a53
[C API Docs] Add docs for run options tag/log level accessors/modifiers. (#9045)
Add documentation for these C API functions:
RunOptionsGetRunLogSeverityLevel
RunOptionsGetRunLogVerbosityLevel
RunOptionsGetRunTag
RunOptionsSetRunLogSeverityLevel
RunOptionsSetRunLogVerbosityLevel
RunOptionsSetRunTag

Update some existing documentation.
2021-09-14 08:53:35 -07:00
mindest
6036a6b915
Add type int64 for Equal, float types for ReduceSum (ROCm) (#9010) 2021-09-14 00:07:30 -07:00
Sherlock
9174cbe3d5
Optimize CUDA Kernel for 3D and 4D Transpose (#8928)
* Optimize Transpose120 and Transpose102

* Generalize Transpose0123 for more input shapes

* Add Transpose3D test cases

* update rocm kernel
2021-09-13 23:00:53 -07:00
Tianlei Wu
5969d576e5
Revert "disable half2 kernel by dfault (#9034)" (#9044)
This reverts commit 289999af35.
2021-09-13 17:25:25 -07:00
baijumeswani
34f37d2920
Disable fallback for ortmodule api tests (#9018) 2021-09-13 16:00:13 -07:00
Guoyu Wang
c709380c52
Add full iOS job in package pipeline (#9036)
* Add full ios xcframework job

* create zip file of the xcframework
2021-09-13 15:54:11 -07:00
baijumeswani
1422a9ba6b
Remove previous temporary fixes and address TODOs (#9020) 2021-09-13 10:10:07 -07:00
Edward Chen
011cb8fd48
Fix Where op type reduction processing (#9033)
* Update type reduction script to track Where Op's second input type.

* Clean up op_kernel_type_control.h includes.

* Use more maintainable include.
2021-09-13 08:37:58 -07:00
mindest
a1021a1cf4
Add BatchNorm kernel for ROCm (#9014)
* Add BatchNorm kernel for ROCm, update BN test

* correct epsilon_ setting; limit min epsilon
2021-09-13 15:15:05 +08:00
Rajalakshmi Srinivasaraghavan
e83cc534d4 Fix cmake POWER10 detection
Recent commit 60c98a8 changed variable mlas_common_srcs which affects
POWER10 detection.
2021-09-12 11:56:55 -07:00
Hariharan Seshadri
c674343d94
Remove document text from error message in a couple of ops (#9003) 2021-09-11 08:37:52 -07:00
Ryan Hill
c3321b1778
Fix NVTX profiling so it can run in the shared CUDA provider (#9035)
* Move NVTX profiling so it can run in the shared provider properly
2021-09-11 00:35:54 -07:00