Commit graph

5097 commits

Author SHA1 Message Date
Scott McKay
cccd61e3bc
Add int64 as a required type to ConstantOfShape as it's used by the pytorch converter for Pad. (#8128)
It's also used pointlessly for torch.tensor.repeat (although that usage should always be able to be constant folded).
2021-06-23 14:53:06 +10:00
Edward Chen
b1e21312b5
[Mobile package] Update required operator config with additional ops for newer version of Wav2Vec 2. (#8123)
This is an update to https://github.com/microsoft/onnxruntime/pull/8079
The sample application motivating the original update changed to use an updated version of the model. Now, fewer ops are required. This change removes the previously added ops which are no longer needed.
2021-06-22 19:19:46 -07:00
Evgenii Indenbom
664e548e31 Col2im optimization by eliminating integer multiplications:
1. No padding branch performance is improved 8 times
2. Symmetric padding branch is generalized for asymmetric padding case (padding symmetry was not actually used) and further optimized by eliminating integer multiplications.
2021-06-22 18:44:20 -07:00
Changming Sun
6e2b064aec
Delete some unused code in run_dockerbuild.sh and Enable Nuget CUDA tests (#8089)
1. Remove some unused code and simplify tools/ci_build/github/linux/run_dockerbuild.sh.
2. Enable Nuget CUDA tests. The original design was we could leverage Directory.Build.props and let cmake generate the required properties(USE_CUDA/...) there. However, in nuget packaging pipeline we test the package on a different host that doesn't run cmake command and doesn't have the auto-generated Directory.Build.props file.
2021-06-22 18:43:33 -07:00
Guoyu Wang
f6292d9b38
[Android] Output error message to android log instead of stderr (#8114)
* Output error message to android log instead of stderr

* Address CR comments, move macro to a helper function

* Address CR comments

* Fix ort minimal build break
2021-06-22 17:50:06 -07:00
Guoyu Wang
9003df5d87
Fix 32bit Android java API crash (#8122)
* Fix 32bit Android java API crash

* fix code formating
2021-06-22 17:41:11 -07:00
Yufeng Li
4bb0e29d0e
initialize generated_value_names with graph input (#8085)
* initialize generated_value_names with graph input
* use set for following usage
2021-06-22 15:08:54 -07:00
Ryan Lai
839f69d249
Implement WINRT_IMPL_LoadLibraryW to avoid calling LoadLibraryW directly (#8065)
* Override load library w in cppwinrt

* Add comment
2021-06-22 14:31:20 -07:00
Shucai Xiao
e7d7fa8fa2
Update migraphx to rocm4.2 (#7994)
* update dockerfile for migraphx ep

* update to rocm4.2

* code cleanup

* fix error related to onnx unit tests
2021-06-22 13:39:51 -07:00
Changming Sun
5809890ba2
Fix a compile error in InferenceTest.cs (#8119) 2021-06-22 13:01:35 -07:00
Sunghoon
8cacb26946
remove debug.keystore from repository due to a credential issue report (#8113) 2021-06-22 10:15:10 -07:00
Chi Lo
27d1784d44
Add TRT 7.1 Pipeline (#8073)
* Revert for testing TensorRT 7.1

* change to origianl googletest version

* change machine

* remove build arg

* change back machine

* revert back googletest version

* Make it ready to merge to master

* revert onnx-tensorrt to v7.1

* rename yml

* use [[ ]] in bash command

* add sudo

* add chmod

* add correct path

* change another way to revert onnx-tensorrt

* change docker image to manylinux build
2021-06-21 20:57:04 -07:00
chethanpk
3cd06cb38c
Added support for ReduceMean on DNNL EP for CPU and GPU (#7902)
* Added support for ReduceMean on DNNL EP for CPU and GPU

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>

* Added fix for a resnet model failure where it was failing to create dst shape for reducemean when it was part of a subgraph with other ops

Signed-off-by: Chethan Palangotu Keshava <chethan.palangotu.keshava@intel.com>

* Removing the DNNL EP from these unit tests. This is in anticipation of two changes:
- DNNL EP unit tests would be added in a different location later on, so addition of EP individually to these tests will not be necessary
- This was causing a memory leak fail in debug build. The bug is in the EP itself and not in the code added for reducemean. The fix for this is in the i/o handling overhaul which will be added later.

* Update reduction_ops_test.cc

Had accidentally deleted a new line. Making sure there are no unnecessary changes in this file
2021-06-21 17:15:46 -07:00
Du Li
352d560fd5
Adding Conv+Clip fusion (#8102) 2021-06-21 16:30:12 -07:00
Chandru Ramakrishnan
10b7ed6430
Added op_name to message when we are missing a kernel. (#8110)
* Added op_name to message when we are missing a kernel.

* Added domain and version

* Added missing ,
2021-06-21 14:45:53 -04:00
Changming Sun
cba4bc11c7
Split Linux CPU CI pipeline (#8097) 2021-06-21 10:52:30 -07:00
Bowen Bao
51c12a715b
Add NGramRepeatBlock contrib op (#8078)
**Description**: 
Enforce no repetition of n-grams. Scores are set to `-inf` for tokens that form a repeated n-gram if added to the back of the input_ids.

**Motivation and Context**
Needed by transformer models in sequence generation algorithms (greedy search and beam search). This module has heavy impact on performance, and can be highly parallelized.
2021-06-21 10:21:48 -07:00
Sherlock
5ac06bad61
Relax test tolerance to make CI more reliable (#8100) 2021-06-21 07:41:54 -07:00
Tang, Cheng
059d705988
support pass in custom op registry for eager mode (#8087)
* support pass in custom op registry for eager mode

* fix the comments
2021-06-20 13:38:09 -07:00
pengwa
9f5969693a
clean up builds for interop_torch (#8017)
* clean up builds for interop_torch

* add python dependency for executables

* disable onnxruntime_ENABLE_TRAINING_TORCH_INTEROP by default; enable it in ortmodule GPU training pipeline only

* disable training unrelated tests when torch interop is enabled

* simplify the python dependency.

* clean up and fix
2021-06-19 13:41:07 +08:00
Thiago Crepaldi
5c2e1bbb0a
Fix input schema extrator for ORTModule (#8098) 2021-06-18 21:47:49 -07:00
baijumeswani
7701c8703e
Add module attribute to ORTModule to support HuggingFace Trainer save_model (#8088) 2021-06-18 13:13:45 -07:00
Hariharan Seshadri
08eeb8763d
Loosen validation checks in Concat to unblock execution of model in #8020 (#8080) 2021-06-18 11:14:36 -07:00
Olivia Jain
b2247ece25
Make Perf Test Configurable (#7836)
- Allow anyone to kick off a perf test here. Customize: branch, eps, model selection, cuda version.
- Only run shape inference when required.
- Kill errored out memory processes.
- Remove warmup run.
- Clean up script.
- Standalone_TRT is it's own "EP" vs as an additional run with TRT EP
2021-06-18 11:11:19 -07:00
Edward Chen
aa68157c3d
[Mobile package] Update required operator config with additional ops for wav2vec2. (#8079)
Add some additional ops to the mobile package that are needed for the wav2vec2 model.
2021-06-17 13:08:15 -07:00
Guoyu Wang
d83f7fd4aa
[NNAPI EP] Enable Slice support (#8031)
* Enable slice for NNAPI EP

* Add ANEURALNETWORKS_STRIDED_SLICE support

* Addressed CR comments

* Addressed CR comments, rename PrepareForCompute to PrepareForComputeHelper to avoid confusion
2021-06-17 12:36:12 -07:00
Changming Sun
96989b83ee
Create python packages for DML (#8061) 2021-06-16 16:59:12 -07:00
Nick Kreeger
d924fd205b
Create and move quantization tests to a shared Quantized utils file. (#8054)
* Create a shared quantization util for all unit tests.

* Cleanup qlinear_binary_op_test.cc

* save

* save

* save

* cleanup

* save

* cleanup for linux build
2021-06-16 17:00:36 -05:00
Guoyu Wang
32ef39be58
[Android] Move add header files into AAR to using Gradle (#8068)
* Move add header files into AAR to using Gradle

* fix gradle format violation
2021-06-16 12:03:42 -07:00
Ryan Hill
1d8edd0b5b
Fix missing files on linux (#8066) 2021-06-16 11:05:03 -07:00
Wei-Sheng Chin
c76172fab6
Fix PythonOp with input which has no gradient (#8011)
* Fix PythonOp with input has no gradient

* Fix another bug which happens when inputs require gradient

* Remove comments

Co-authored-by: Peng Wang <pengwa@microsoft.com>
2021-06-17 00:19:41 +08:00
Vincent Wang
de8f2ecda9
Reduce Kernel Optimization (#8067)
* reduce optimization

* bug fix

* add a check

* add ut

* refactor

* add ut cases for keepdims=true
2021-06-16 19:53:46 +08:00
Ryan Hill
0ebaa71f49
Improve Windows Platform system error messages (#8063) 2021-06-15 22:17:35 -07:00
Chen Fu
32e118bef0
Fix microbenchmark build failure (#8064)
Co-authored-by: Chen Fu <fuchen@microsoft.com>
2021-06-15 20:49:39 -07:00
Tang, Cheng
e31784b6cf
decouple the python module construction from pybind_state (#8060)
* fix broken tests

* decouple the module construction to a seperate file
2021-06-15 18:52:26 -07:00
Changming Sun
96cf533c76 Remove DML from Windows GPU CUDA 10.2 pipeline 2021-06-15 16:53:24 -07:00
George Wu
25c49a5fe0
fix issue with cmake path (#8055) 2021-06-15 15:09:15 -07:00
iperov
07b166bb1b fix PATH addition in windows
should set PATH, not add to the tail the copy of PATH
2021-06-15 14:18:00 -07:00
Sunghoon
887c3149e3
[js/react_native] Use a mobile ORT instead of a full ORT (#8042)
* Change full ort to mobile ort

* Update Android example to load mobile ort

* Change the format of test models to ort

* update ios to use mobile ort

* revise README

* use onnxruntime-mobile-c CocoaPods in a npm package
2021-06-15 13:36:05 -07:00
Nick Kreeger
6a1b000125
Fix unit test typo in test_op_embed_layernorm.py (#8056) 2021-06-15 15:27:44 -05:00
Changming Sun
07788e082e
Enable python GPU tests (#7854) 2021-06-15 10:24:58 -07:00
G. Ramalingam
8079c76383
Create ORT opschema library (#7903)
* Op schema library

* Create ORT opschema library and sample app

* delete message in cmake

* Fix cmake

* Address PR feedback and add dependency

* Add cmake dependency

* Cmake fix

* Add dependency for nsync

* Add dependency for nsync

* Reorder dependencies

* Testing for dependencies on all platforms

* Resolve dependencies on GetStackTrace, floatToHalf

* Compiler strict-aliasing warning

* Merge with master

* Minor cleanup
2021-06-14 14:02:33 -07:00
Olivia Jain
c72a8c7ff4
Upgrade tf 2.4.1 to 2.4.2 for component governance (#8036)
* Upgrade tf 2.4.1 to 2.4.2 for component governance

* Trial run with tf 2.5.0
2021-06-14 09:30:58 -07:00
George Nash
9acf93b90a
Take graph topology into account when creating dnnl subgraphs (#7910)
Check the inputs of all nodes are part of the subgraph for all
operators.  Previously the code assumed all operators only had
a single input except for the "Sum" operator.

This resolves issue seen when adding new operators that a subgraph
was incorrectly accepting a node when the subgraph should not have
because it was not following the topology of the nodes.

Signed-off-by: George Nash <george.nash@intel.com>
2021-06-13 19:23:37 -07:00
Xavier Dupré
6d7461795f
Update Version.md (#8021)
Fix the correct supported opset 1.8.0.
2021-06-13 18:52:40 +02:00
Pranav Sharma
ad6a306a7f
Add pragma once (#8040) 2021-06-11 23:47:26 -07:00
Scott McKay
96ead2be91
Avoid hashing the operator type in the GraphViewer priority node check unless the string has a chance of matching. (#7972)
* Avoid hashing the operator type in the GraphViewer priority node check unless the string has a chance of matching.

Below are perf numbers from a test that loads 16 models multiple times. I was checking that some unrelated changes didn't have unexpected perf cost and found the PriorityNodeCompare overwhelmed any contribution the other changes were making.

*Before*

CPU Time:74.678s

CPU Time for relevant Top Hotspots
std::_Hash_array_representation<char> 20.834s
onnxruntime::PriorityNodeCompare::IsHighPri 7.589s
onnxruntime::Graph::KahnsTopologicalSort 4.487s

*After*

CPU Time:47.103s

CPU Time for relevant Top Hotspots
onnxruntime::Graph::KahnsTopologicalSort 4.465s
onnxruntime::PriorityNodeCompare::IsHighPri 2.873s
2021-06-12 14:11:33 +10:00
Edward Chen
6e134c2cc3
[Objective-C API] Add support for documentation generation (#7999)
Adding support for generating API documentation with the Jazzy tool.
It's a manual process now, but we can eventually make it a part of the release pipeline.
2021-06-11 17:49:00 -07:00
Nick Kreeger
1d7f44a832
Add unit test for EmbedLayerNormalization quantization op. (#8033) 2021-06-11 17:33:55 -05:00
Ye Wang
e6225c62a5
transformers test CI pipeline fix (#8016)
* init checkin

* Restore initial environment

* -y

* testtest

* fix

* fix indent
2021-06-11 12:57:52 -07:00