Commit graph

4288 commits

Author SHA1 Message Date
M. Zeeshan Siddiqui
cb8d8464bc
Do not create compute stream when external CUDA allocator is used. (#6833) 2021-02-26 20:13:02 -08:00
Ye Wang
b4b87ac7a0
update (#6827) 2021-02-26 13:58:41 -08:00
Pranav Prakash
d5175795d2
Improvements to quantizer: Removed unused qType field, add reshape op (#6179)
* Handle case where bias_name is already quantized

If bias is shared between multiple nodes and we've already quantized it, just return the quantized name from the map

* Remove qType attribute from QuantizedValue and QuantizedInitializer

These are unused (and were incorrectly set in the case of int8 quantization)

* Add Reshape op to quantizer

* Add test for Reshape quant
2021-02-26 10:21:37 -08:00
Surya Siddharth Pemmaraju
3426108739
Fixed issue in python cmake to update wheel package (#6384)
* Fixed issue in python cmake to update wheel package

* Fixes python cmake issue for OV EP

Added post build step for libonnxruntime_providers_openvino
that copies the updated libonnxruntime_providers_openvino.so file
to /onnxruntime/capi directory every time this target is rebuilt.

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

* Removed post_build step from onnxruntime_python.cmake

Now that we have added the post build step to copy
onnxruntime_providers_openvino.so and providers_shared.so
to /onnxruntime/capi directory in onnxruntime_providers.cmake file.
so removing the duplication of the same from here.

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

* Fixed python cmake issue for OpenVINO-EP

->Fixed issue for both Linux and windows

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com>
2021-02-26 06:34:43 -08:00
Chi Lo
9b3171e95c
Make keepdims to its default value when adding ReduceMin/ReduceMax for quantization calibration (#6788)
* Make keepdims to its default value when adding ReduceMin/ReduceMax

* Fix bug for adding ReduceMin/ReduceMax with keepdims=1
2021-02-25 09:47:59 -08:00
Olivia Jain
db05d53b94
Setup perf in docker and add features (#6582)
* setup scripts to run in docker 
* percent threshold for accuracy 
* branch testing
2021-02-25 09:31:03 -08:00
stevenlix
d5f292ab73
fix issues caused by quantize/calibrate changes (#6802) 2021-02-25 05:41:21 -08:00
Maajid khan
7465673e33
[OpenVINO-EP] Find package changes (#6801)
* Find package changes to cmake

* Removing unwanted code from cmake

Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>

Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com>
2021-02-25 05:12:57 -08:00
Suffian Khan
8a148e44fb
make ci pipeline also run batch and convergence test (#6798) 2021-02-24 20:18:03 -08:00
Hariharan Seshadri
ab1713f5cc
Fix regression in constant folding optimizer (#6795) 2021-02-24 19:10:14 -08:00
Weixing Zhang
40fa40f3ce
Enable more unit tests for ROCM EP (#6776)
* enable more ops and unit tests for ROCM EP
2021-02-24 15:20:50 -08:00
Tianlei Wu
f4acdb2ecd
Update transformers benchmark for transformers 4.3.* and ORT 1.7 (#6796)
* update benchmark for transformers 4.* and ORT 1.7

* Fix gpt2 onnx conversion for transformers 4.3.*. Add a check of transformer version >= 3.1.

* remove code related to openmp

* update pretrain model list: keep representitive models only
2021-02-24 12:52:35 -08:00
fthielke
71a70ecf6e
Allow 3D ConvTranspose in CUDA execution provider (#6794)
Co-authored-by: Felix Thielke <felix.thielke@mevis.fraunhofer.de>
2021-02-24 11:53:31 -08:00
Adam Pocock
5a473216b7
[Java] Adds extra providers (#6770)
Add providers for CoreML, ROCM, NNAPI, ArmNN
 Adding the structs for OrtCUDAProviderOptions and OrtOpenVINOProviderOptions
 Updating NNAPI flags.
 Adding the new CoreML flag.
 Adding hooks to the build system to tell Java about the new providers.
2021-02-24 10:25:05 -08:00
Ye Wang
47c8e9ad28
Adding fp16 support for Einsum Cuda kernel (#6775)
* checkin einsum fp16 support

* remove unnecessary code

* add tests

* add another test
2021-02-24 01:15:29 -08:00
jingyanwangms
c02ec38f8a
[Running CI now] Remove duplicate tests to speed up CI (#6768)
* remove tests to speed up CI

* add back _into_data_parallelism tests to see how long the CI test takes

* remove unnecessary save calls

* add back data_parallelism_full_precision_bart_path

* add data_parallelism_full_precision_path

* remove data parallelism tests

Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-02-23 23:21:06 -08:00
Scott McKay
7aa69a1f2a
Support building python bindings when building unit tests is manually disabled. (#6771)
* If unit tests are manually excluded via `--cmake_extra_defines onnxruntime_BUILD_UNIT_TESTS=OFF` (e.g. testing changes to binary size where you want to keep the build time as quick as possible) it should still be possible to create the python bindings.

Update CMakeLists.txt to decouple the inclusion of onnxruntime_python.cmake from unit tests being enabled.

Update onnxruntime_python.cmake so it works when unit tests are disabled. Also skip copying of test py files when unit tests are disabled.
2021-02-24 14:21:57 +10:00
Ori Levari
b8b41e3775
Update DirectML 1.4.1 to 1.4.2 for ORT 1.7 (#6780)
Co-authored-by: Ori Levari <orlevari@microsoft.com>
2021-02-23 10:52:10 -08:00
Edward Chen
09a5d6a9dc
Update docs/ONNX_Runtime_for_Mobile_Platforms.md with info about op type reduction. (#6747) 2021-02-23 10:25:23 -08:00
Edward Chen
5db0c9c648
Enable CI to cover globally allowed types (#6778)
Add test to CI build to cover type reduction with globally allowed types.
2021-02-23 10:24:12 -08:00
Tianlei Wu
8703e2c778
update benchmark_longformer for default test suite (#6772) 2021-02-22 22:00:59 -08:00
Tianlei Wu
8a6f6bc38b
add --enable_cuda_line_info to build.py (#6773) 2021-02-22 22:00:21 -08:00
Guoyu Wang
f5313cc4ce
[CoreML EP] Add options to enable CoreML EP only on hardware with Apple Neural Engine (#6765) 2021-02-22 18:55:27 -08:00
Edward Chen
ee35be0129
Support specifying globally allowed types from build script (#6677)
Add initial support for constraining operator kernel implementations (which support this type-granularity) to a set of allowed types from scripts.
2021-02-22 14:05:00 -08:00
Ivan Stojiljkovic
c91f314217
Add robust dependency check for Python package (#6436)
* Add robust dependency check for Python package

* Add version_info.py to .gitignore

* Fix Linux build

* Fix Windows CPU build

* Fix Windows 32-bit build

* Minor tweak

* Generate version_info.py earlier in onnxruntime_python.cmake

* Print a user-friendly message if cuDNN is not found in

* Relax version requirements for CUDA 11 - only the major version has to match

* Fix PATH environment variable to include CUDA 11 in 'Python packaging pipeline' (Windows/GPU)

* Fix the build with cuDNN 7
2021-02-21 15:11:28 -08:00
Scott McKay
9f7dffb109
Make it easier to fold MakeString usages to reduce binary size. (#6754)
* Decay char[n] to char* so that we get better re-use of MakeString invocations.
2021-02-20 20:41:19 +10:00
Tianlei Wu
3bda7f4d36
Fix longformer parity and perf regression (#6760)
* add fast kernel back, update benchmark and conversion scripts
2021-02-19 21:47:36 -08:00
Guoyu Wang
47519623cd
Fix iOS/macOS build warning for inconsistent symbol visibility settings (#6750)
* Fix ios/macos build warning

* Corrent a typo

* Fix test failure

* Fix custom op test failure
2021-02-19 12:19:32 -08:00
Nat Kershaw (MSFT)
c170061998
Removed BUILD.md from master as source now lives in gh-pages (#6709) 2021-02-19 11:34:21 -08:00
Faith Xu
20d6613efb
Add direct link to build instructions on readme (#6729) 2021-02-19 10:56:50 -08:00
Yulong Wang
a25c8e52a9
fix link to samples in nodejs README (#6746) 2021-02-19 10:56:22 -08:00
liqunfu
79b966b01a
. (#6751)
make ort training pert test green. remove unneeded yaml
2021-02-19 09:03:58 -08:00
Chi Lo
67c478ede4
Entropy method for calibration-based quantization (#6619)
* Add entropy method

* Update pre/post-preprocessing of yolov3

* Code refactor

* Code refactor
2021-02-18 05:50:59 -08:00
S. Manohar Karlapalem
3722dd2692
Update docs MCR image publishing policy change (#6743)
ONNX Runtime 1.7 will the last release that will publish MCR
container images for ONNX Runtime with OpenVINO EP. From ONNX
Runtime 1.8 onwards, this will be discontinued. Users are advised
to switch to using PyPi packages or build their own containers
using dockerfiles.
2021-02-18 05:45:46 -08:00
stevenlix
53eb948f4c
Upgrade TensorRT to v7.2.2 (#6452)
* upgrade to TensorRT 7.2.2

* extend GPU tensorrt CI timeout to 150 minutes

* update docker image name

* disable user interaction to avoid tensorrt container stuck when install tzdata

* upgrade to libssl1.1 for ubuntu20.04

* remove libicu60 from ubuntu20.04

* add libicu66 for ubuntu20.04

* debug

* llvm

* llvm

* disable ReverseSequenceTest.InvalidInput

* disable ReverseSequenceTest.InvalidInput

* fix issues

* fix issues

* Update linux-gpu-tensorrt-ci-pipeline.yml

* disable warning 4458 for TensorRT parser

* update onnx-tensorrt submodule

* disable warnings for TensorRT parser

* update onnx-tensorrt submodule to include latest bug fixes

* update setup_env_trt

* update pool for win trt ci pipeline'

Co-authored-by: George Wu <jywu@microsoft.com>
2021-02-18 04:30:47 -08:00
Yufeng Li
b1a12b49b7
Avoid removing constant weight that is graph output (#6735) 2021-02-17 19:55:19 -08:00
Olivia Jain
ea3aee4d5f
Bumping up version to 1.7 (#6736)
* bumping up version to 1.7

* Windows AI should align with ORT Version
2021-02-17 19:07:38 -08:00
M. Zeeshan Siddiqui
e44ac6524f
Plug n Allocate with external CUDA allocator via PyBind. (#6679) 2021-02-17 18:59:38 -08:00
liqunfu
dd8ef4409a
Liqun/migrate perf test (#6733)
move ort training perf tests to azure devops
2021-02-17 17:48:47 -08:00
liqunfu
2c5e603bad
Liqun/nuphar nuget (#6656)
create nuphar nuget with correct name
2021-02-17 16:13:07 -08:00
Ramakrishnan Sivakumar
a5bef6886b
Threading support for Hybrid core architecture (#6728) 2021-02-17 15:35:07 -08:00
Guoyu Wang
6810d98ea3
Update links to gh-pages for ORT minimal documents (#6721)
* Fix broken link in ort minimal docs

* Update link of build.md to gh-pages
2021-02-17 14:34:50 -08:00
Justin Stoecker
af4e5c0c6e
Minor WinML model test skip name change 2021-02-17 14:27:58 -08:00
Maajid khan
b41e9b5d4c
[OpenVINO-EP] Fixes OpenVINO-EP build on windows (#6726)
* Fixes OpenVINO-EP windows build

Openvino EP build is broken on windows. The issue
is wchar_t is UTF-16 on windows while on other platforms
such as Linux and MacOS, wchar_t is UTF-32.

so wide Unicode string has to be converted to an UTF8 string
for sure on windows.

This commit fixes this issue.
2021-02-17 13:49:03 -08:00
Yulong Wang
9a9202a218
[Node.js binding] update dependency typedoc (#6720) 2021-02-17 10:22:05 -08:00
Changming Sun
0be5475de6
Update packaging pipelines(#6664) 2021-02-17 09:53:36 -08:00
Changming Sun
46c06f6ac7
Change Windows GPU CI pipeline to CUDA11 (#6616) 2021-02-17 09:44:44 -08:00
Changming Sun
eefeacd828
Skip running gpt2 model in C# x86 (#6722) 2021-02-17 09:37:16 -08:00
Derek Murray
b8d5fa812c
Fix typo in README.md (#6713)
Fixes #6710.
2021-02-17 09:29:30 -08:00
Wei-Sheng Chin
9e67b88c83
Use local rank as GPU ID (#6719) 2021-02-17 22:42:54 +08:00