Commit graph

3358 commits

Author SHA1 Message Date
Wenbing Li
2a456d16c0
Enable onnxruntime iOS shared library build. (#5148) 2020-09-14 10:32:39 -07:00
ashbhandare
cc3212f9d5
Add fp16 pow kernel (#5016)
* Add fp16 pow kernel

* Fix test added for non-cuda runs
2020-09-14 10:01:39 -07:00
Moshe David
1d6a21fd08
[TensorRT] Add slightly faster hash computation for vector<int> (#5142)
* w

* w

Co-authored-by: modav <modav@microsoft.com>
2020-09-14 09:01:59 -07:00
sfatimar
0c7e9fb52a
changes to ensure compilation issues in windows is fixed by disabling the level 3 warning 4267 (#5147)
while a more permanent fix is found

Co-authored-by: sfatimar <sahar.fatima@intel/com>
2020-09-14 08:59:41 -07:00
RandySheriffH
9392aa2f64
Promote Cuda version to 10.2 for windows pipelines (#5138) 2020-09-13 20:32:06 -07:00
Guoyu Wang
6fcd99f6ed
Some minor updates for ORT mobile (#5146)
* Minor update ios build instructions and other comments

* Create shared string for nodearg name
2020-09-13 17:28:12 -07:00
Guoyu Wang
e5892dd6f3
Change the version check of ort format save/load (#5140)
* change the version check of ort format save/load

* Address PR comments, update the unit test ort models

* Update some variable names to code convention

* Move IsOrtModelVersionSupported inside of #if defined(ENABLE_ORT_FORMAT_LOAD)
2020-09-11 23:11:31 -07:00
Scott McKay
323a1ba8a4
Add option to exclude support for loading ORT format models in full build. (#5129)
* Add ability to exclude support for loading ORT format models.
Disable support for ORT format models in packages
2020-09-12 12:21:30 +10:00
Ye Wang
5302fe4079
A fix in load_pretrained_model() (#5137)
* Fix in load_pretrained_model

* Update onnx_exporter.py
2020-09-11 17:23:02 -07:00
Hariharan Seshadri
849bb8653f
Exclude a few python tests for the DML EP (#5135) 2020-09-11 16:40:02 -07:00
Guoyu Wang
698eccf15e
Add iOS build instruction (#5125)
* ios build instruction

* fix logger issue in onnx_model_info

* Revert "fix logger issue in onnx_model_info"

This reverts commit 72f2b88256ccf29c75fefbcd1daf6b4dcf7e0c61.

* Address comments and fix small issue in iOS build
2020-09-11 16:10:36 -07:00
Tianlei Wu
7511021e0e
Save Gpt2 test data (#5132)
(1) Save gpt2 test data during test generation.
(2) Use torch fp32 model as baseline when onnx model is fp16.
(3) Refine logic to compose onnx model path
2020-09-11 14:31:49 -07:00
RandySheriffH
120e3cda74
fix path (#5131)
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2020-09-11 12:18:07 -07:00
Rayan-Krishnan
92a8c650ad
[Debuggability] Add feature to ORTTrainer Frontend (#5124)
* add option, feature to orttrainer and test

* address comments

* minor fixes

* further address comments

* minor changes

Co-authored-by: Rayan Krishnan <t-rakr@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-09-11 12:16:07 -07:00
Ye Wang
89509f256a
Not fuse SkipLayerNorm when add has initializer input (#5123) 2020-09-11 11:46:31 -07:00
Ashwini Khade
cd56ab197c
csharp build documentation (#5121) 2020-09-11 11:46:10 -07:00
dependabot[bot]
15d431f39b Bump node-fetch from 2.6.0 to 2.6.1 in /nodejs
Bumps [node-fetch](https://github.com/bitinn/node-fetch) from 2.6.0 to 2.6.1.
- [Release notes](https://github.com/bitinn/node-fetch/releases)
- [Changelog](https://github.com/node-fetch/node-fetch/blob/master/docs/CHANGELOG.md)
- [Commits](https://github.com/bitinn/node-fetch/compare/v2.6.0...v2.6.1)

Signed-off-by: dependabot[bot] <support@github.com>
2020-09-11 11:45:37 -07:00
Tianlei Wu
ccfbc56388
Handle dummy mask in Attention operators (#5108)
* Handle dummy mask with shape (1, 1) or (batch_size, 1).
2020-09-11 09:31:03 -07:00
stevenlix
c794c88ae0
Solve name conflict in TensorRT engine caching (#5128)
* fix hash conflict

* Add verbose for engine deserialization and destroy old engine memory if new engine is generated

* update parser

* Update tensorrt_execution_provider.cc

* use a better hash algorithm

* Update tensorrt_execution_provider.cc
2020-09-11 09:12:56 -07:00
Guoyu Wang
51f3d3af72
Enable onnxruntime_perf_test for ORT minimal build (#5126)
* Enable onnxruntime_perf_test for ort minimal build

* Add error message

Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>
2020-09-11 01:58:11 -07:00
Scott McKay
59ee8ffb17
Remove SparseTensor support from minimal build. (#5114)
* Remove SparseTensor support from minimal build.

Currently the only valid usage of a SparseTensor is as an attribute of a Constant node. That would have been lifted to a dense tensor initializer when loading the onnx model, so would not exist when saving the ORT format model. Due to that there can be no SparseTensors in an ORT format model.

Co-authored-by: gwang <wanggy@outlook.com>
2020-09-11 17:56:54 +10:00
Ye Wang
879751f3b7
Support Tensorflow benchmarking and onnx export in transformers tool (#5068)
* init checkin for tf export and tf benchmark

* small fix on argparse

* refactor

* review comments

* review comments
2020-09-11 00:47:37 -07:00
Changming Sun
c5efb0085d
Update Linux GPU build pipelines to CUDA 10.2 (#5120)
* Update Linux GPU build pipelines to CUDA 10.2
2020-09-10 17:40:51 -07:00
Ashwini Khade
a8557b3f0f
skip tests when model opset > released opset (#5096)
* skip tests when model opset > released opset

* remove multiple model load

* nit fixes

* plus some comments
2020-09-10 17:25:32 -07:00
Hariharan Seshadri
782ccff207
Add dll probe path so that the right DirectML.dll is loaded while running C# tests (#5104) 2020-09-10 16:19:21 -07:00
Wei-Sheng Chin
5618b9dddc
Use CMake built-in function to compare NCCL version (#5118)
* Use CMake built-in function to compare version

* Address comment
2020-09-10 15:59:47 -07:00
Tianlei Wu
c5d4ae0401
Add transformers tools to python package (#5090)
* Add transformers to onnxruntime python package
2020-09-10 15:42:15 -07:00
Moshe David
61051396e8
[TensorRT] Align naming convention and remove redundant code (#5094) 2020-09-10 15:03:34 -07:00
Scott McKay
fae5915d76
CMake fixes/tweaks for minimal builds and MinSizeRel builds (#5112)
* Fix places where MinSizeRel wasn't having relevant flags added in the same way as Release and RelWithDebInfo
Enable LTO for minimal build. Cleanups onnx_minimal.cmake to remove some things handled when LTO is enabled in CMakeLists.txt

* Only enable LTO for MSVC in a minimal build
2020-09-11 06:50:28 +10:00
Changming Sun
a5530358c9
Fix a path problem in Dockerfile.manylinux2014_cuda10_2 (#5106) 2020-09-10 10:30:13 -07:00
Changming Sun
47554a0422
Disable some tests (#5103) 2020-09-10 08:15:18 -07:00
Ryan Hill
3207de276c
Remove IDeviceAllocator class as it doesn't extend IAllocator in any way. (#5067) 2020-09-10 00:46:35 -07:00
Guoyu Wang
5b6643cefb
Move ort flatbuffers header to use enum class instead of enum (#5105)
* change fbs to scoped enum

* modify ort code to use new fbs header

Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>
2020-09-10 17:17:49 +10:00
Guoyu Wang
433061531e
Enable onnx_test_runner for ort format (#5100)
* Enable onnx_test_runner using ort format, for ort minimal build only

Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>
2020-09-10 17:15:19 +10:00
Tiago Koji Castro Shibata
62848c4de5
Add store builds to nuget packaging (#5040)
* Nuget store packaging

* Move DNNL workaround to EP

* Fix warning as error

* Disable store tests

* Skip store tests

* msbuild target

* Cross compile protoc in Store

* Disable DML in store

* Move store builds to CPU queue

* Copy uap10 to final nuget

* Fix pip8 error

* Remove extra dml copies

* Fix argparse

* pep8

* Forward IsStoreBuild

* Apply is_store_build to duplicate generate_nuspec

* runtimes

* Refactor uap10

* Store .NET

* uap

* PR feedback
2020-09-09 21:38:14 -07:00
Wei-Sheng Chin
9ba56dcfed
Support Send and Recv for old NCCL versions (#5097)
If NCCL version < 2.7, MPI is sued. Otherwise, we use NCCL Send and Recv.
2020-09-09 20:58:05 -07:00
Changming Sun
09a6ce6bc0
Add re2 to memory leak checker whitelist (#5101)
* Add re2 to memory leak checker whitelist
2020-09-09 20:08:37 -07:00
Wei-Sheng Chin
934f30fc38
Not to call NVTX when not available (#5095)
* Not to call NVTX when not available

* fix syntax

* Fix a syntax error
2020-09-09 20:01:45 -07:00
Scott McKay
4b7aa16ed2
Fix a few more signed/unsigned warnings. (#5098) 2020-09-10 10:39:56 +10:00
RandySheriffH
5e10cde006
PipelinesForCuda11Cudnn8 (#4938)
* cancel night build on pyop

* setup win cuda11 pipeline

* add debug build

* test base gpu settings

* setup pipelines to test cuda 10.2 and 11

* rename linux docker images

* rename docker image tag and add clean up job

* fix typo in cuda 11 config

* set cuda11 env

* update linux cuda 11 pipeline

* reset docker image name

* disable uninitialized warning from linux build

* change the way to silence uninitialized warning

* add flags to linux gpu pipeline

* switch docker image for linux cuda 10.2

* switch linuc cuda 10.2 image

* test cuda11 with devtool8

* try latest built images

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2020-09-09 16:13:58 -07:00
Xueyun Zhu
a90fae8c71
unify error handling in pipeline transformer (#5039) 2020-09-09 14:52:04 -07:00
Hariharan Seshadri
61151af321
Fix typo in DML native method call from the C# API (#5083) 2020-09-09 14:47:50 -07:00
Tiago Koji Castro Shibata
f7c3e4fa99
Store/containerized apps support (#4651)
* Initial containerized/Store build

* Remove unsupported APIs

* Remove usage of STL ifstream

* Revert CMake changes

* Link to app runtime

* WCOS/Store cmake

* Update CMakeSettings.json

* Fix winapi family support

* Fix downlevel

* Downlevel build

* Remove downlevel workaround

* pep8 compliance

* Workaround WinRT headers bug

https://github.com/microsoft/cppwinrt/issues/584 in older SDK

* Always cross compile to avoid warnings as errors

* PR feedback

* More CI fixes

* PR feedback

* aiinfra build fix

* Win8 store
2020-09-09 14:36:35 -07:00
Changming Sun
924ecb0623
Use manylinux2014 for Linux CPU build (#5091) 2020-09-09 10:09:52 -07:00
Thiago Crepaldi
6594d6672f
Move onnxruntime.experiment to onnxruntime.training namespace (#5045) 2020-09-09 09:46:06 -07:00
Wei-Sheng Chin
4ccca20def
Replace MPI Send and Recv with NCCL Send and Recv (#5054)
* Prototype NCCL P2P

* Clean code

* Fix NCCL path and some minor bugs

* Add path

* Fix path

* Try fix path

* Add missed files

* Address some comments

* Clean code

* Rename files

* Add MPI path back and fix a path

* Put MPI path under USE_NCCL flag

* not to build Send and Recv when MPI is not installed
2020-09-09 09:39:56 -07:00
Scott McKay
dbf4e7019d
Add ability to generate configuration file with required operators. (#5089)
* Add ability to generate configuration file with required operators.
2020-09-09 21:39:17 +10:00
Scott McKay
80ada0291f
Improve the minimal build size on android and linux (#5086)
Fix bug where linux build fails when python is enabled and rtti is disabled
Update doco for new build settings
2020-09-09 21:38:34 +10:00
Guoyu Wang
5019b2f3b9
fix for x86 android build break (#5088) 2020-09-09 21:38:22 +10:00
gwang-msft
a1a81470e3
Add minimal build binary size verification (arm64) to Android CI (#5087)
* Add minimal build binary size verification (arm64) to Android CI

* Add comments in the CI ymal
2020-09-09 19:06:20 +10:00