Commit graph

3029 commits

Author SHA1 Message Date
Changming Sun
5eec4f66ed
Refactor manylinux docker image and the related pipelines (#4751)
1. Publish the image ACR, instead of building it every time for every PR
2. Make USE_MKLML and USE_OPENMP be able to co-exist. Currently both of them are enabled in our Linux CI build but indeed only one of them is taking effect.
3. Split nuphar and DNNL to separated pipelines.
4. Fix two warnings in onnxruntime/core/optimizer/matmul_scale_fusion.cc and onnxruntime/test/tvm/tvm_basic_test.cc.
5. Update the manylinux2010_x86_64 image to the latest.
2020-08-17 09:40:31 -07:00
Hariharan Seshadri
ea3b4e1f8d
Fix bug in DispatchOnTensorType macro (#4808) 2020-08-17 01:16:01 -07:00
Ori Levari
5899c1197a
add telemetry for named dimension overrides (#4794)
Co-authored-by: Ori Levari <orlevari@microsoft.com>
2020-08-16 17:09:55 -07:00
sfatimar
0a0ac70eec
Sahar/csharp support openvino (#4703)
* Temp changes and include openvino to ensure nuget package is created with linux till we configure azure ci pipeline

* string id change

* native nuget indentation changes

* documentation changes

* Update Openvino_execution_provider.md

Documentation includes openvino execution provider

* Update OpenVino-ExecutionProvider.md

update details to build csharp api for openvino execution provider .

* vadm backend revert

* Update Openvino-Execution-Provider.md

updated for review comments

* Update OpenVino-Execution-Provider.md

* Update OpenVINO-ExecutionProvider.md

* nuget package custome support for openvino
change in native nuget spec python script for including linux runtime

* change to make path to boolean flag

* removed the tab

* Update OpenVINO-ExecutionProvider.md

updated for review comments

* chnages to include pep8 warnings
modification to documentation

Co-authored-by: saharfraza <sfatima.3001@gmail.com>
Co-authored-by: sfatimar <sahar.fatima@intel/com>
2020-08-16 17:07:26 -07:00
Tang, Cheng
1b1a6a4ca9
Bump onnx to get bfloat16 in ops, and some update in ort to support bfloat16 (#4791)
* bump onnx to support bfloat16

* sign test code

* fix ut failures

* add bfloat type in gradient schema

* add bfloat16 to gathernd

* add bfloat16 into grad op defs

* temp disable gpu fusing transformers

* bfloat16 support fix

* more fix to bfloat

* bug ifx

* add bfloat16 to transpose matmul

* fix sce loss

* fix cast opset13 and other missing part of bfloat16

* Revert "temp disable gpu fusing transformers"

This reverts commit b627bc9019.

* add SCEloss back

* fix build break

* fix gpu failure due to missing kernel in opset13

* add tile opset 13 kernel

* Revert "fix gpu failure due to missing kernel in opset13"

This reverts commit 661d63d0599029757f240d29afd64b197b76b880.

* fix comments in pr

* fix cuda break due to opset13

* fix missing msdomain

* add nll loss tests into android build's broken list; disable bfloat16 cast tests due to the wrong type saved in onnx test data, will fix it in onnx first

Co-authored-by: Cheng Tang <chenta@microsoft.com>
2020-08-16 17:05:40 -07:00
Bogdan Bugaev
8ba6b6a21e
Support usage of C API with C++ standards older than C++11 (#4257)
* Use throw() in C API if noexcept is not supported
2020-08-15 11:39:28 -07:00
George Wu
8d2e22558d
unattended-upgrades (#4804) 2020-08-14 18:12:27 -07:00
ashbhandare
5a8962d327
Make grad name unique (#4788)
* Make grad name unique

* Modify for review comment
2020-08-14 15:17:17 -07:00
Weixing Zhang
afa89566d7
Using cublasGemmBatchedEx/cublasGemmStridedBatchedEx for training (#4731)
* use cublas extenstion API for fp16

* Using cublasGemmBatchedEx/cublasGemmStridedBatchedEx for training

To avoid accuracy, the accumulation needs to be done in FP32 for training.

Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
2020-08-14 02:12:14 -07:00
Maxim Kalinin
ec36c793e8
Eliminate redundant subexpressions (#3047)
* Eliminate redundant subexpressions

Apply local value numbering to merge graph nodes that will always
evaluate to the same value.

* Rename cpp->cc

* Handle optional arguments

* Add test models

* Add more tests with optional arguments

* Fix processing of subgraphs

Also, be resilient to possible mixture of optional and variadic
parameters

* Fix random operators

* Address PR comments

* Minor changes and a test

* Move CSE before constant folding

* Random* operators are always non-deterministic

Even when seed is provided.

* Fix a CSE test

* Reuse the list of non-deterministic operators with constant folding pass

* Address PR comments

* Fix formatting

* Address PR comment

* Minor cleanup / comments

* Fix build failure in Linux

* Reuse existing optimizer/utils file.

Also, check for graph outputs when removing a node.

* Add a test

* Fix compiler warnings

* Fix build in older compilers

* More compatibility with old STL versions
2020-08-14 01:13:05 -07:00
Marcus Turewicz
ce65275edf
C# samples: Faster R-CNN (#4733)
* C# sample: Faster R-CNN

* Add link to new sample in samples README

* Remove duplicate image
2020-08-13 17:05:01 -07:00
Sergii Dymchenko
de2685261b
Install AzureML support and commonly used packages in the training image. (#4790) 2020-08-13 16:48:48 -07:00
stevenlix
7acef875bb
Fix bugs in TensorRT (#4780)
* fix bugs

* Move -Wno-deprecated-declarations to target compile flag
2020-08-13 16:09:27 -07:00
Yulong Wang
aa993e95c9
enable build flag '--use_openmp' on MacOS (#4774)
* enable build flag '--use_openmp' on MacOS

* cmake 3.16.1 to enable find_package(OpenMP) on mac
2020-08-13 15:56:42 -07:00
George Wu
f12e9de111
build fixes for https://github.com/microsoft/onnxruntime/pull/4721 (#4784)
* test

* test

* add missing CUDA header include

* debug

* fix

* fix python package for dnnl and tensorrt.

* fix

* fix windows build.

* revert

* target_link_directories for tensorrt shared lib.
2020-08-14 06:24:44 +08:00
James Yuzawa
aca34352a5
Java API: Documentation cleanup (#4395)
* update java API docs

* fix link

* rearrange

* update platforms, use table

* use javadoc.io

* craigacp tested it in java 14

* update link

* fix broken link

* fix testdata link
2020-08-13 12:06:42 -07:00
Sheil Kumar
722602f32d
replace namespace reference with alias (#4786)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-08-13 11:14:55 -07:00
ashbhandare
5e7a6e78e3
Changes for BART dynamic shapes in reduction (#4730)
* Modify to hit row reduction over cudnn

* kernel overflow fix

* Cleanup

* fix for mainz/zcode model

* revert

* Review comments

* Review comments
2020-08-13 11:14:01 -07:00
edgchen1
74b3b8448c
Fix MatmulTransposeFusion::ApplyImpl() setting of modified flag (#4775)
Update MatmulTransposeFusion::ApplyImpl() to set modified flag whenever a fusion is performed.
2020-08-13 09:51:52 -07:00
Scott McKay
8fb743f767
Refactor Cast to reduce binary size. (#4765)
* Refactor Cast to reduce binary size.
82.5 -> 60.8KB on Windows

* Address PR comments.
Fix build issue.
2020-08-13 20:43:22 +10:00
Tim Harris
9cec98ec1b
Honor allow_spinning at barrier at end of parallel sections (#4767)
This commit means that when the thread pool is configured to spin, then we spin at the barrier at the end of parallel sections in the main thread, in addition to having workers spin waiting for work. 

The change updates Barrier.h to take an additional boolean to select spin/block, and passes this in based on the thread pool configuration. 

It adds an additional test case for barriers, although no problems were identified by the test case.
2020-08-13 09:40:40 +01:00
Faith Xu
61b2a663a3
Update Python version support (#4778) 2020-08-12 23:48:23 -07:00
Changming Sun
cddddc4d55
Add missing header file to MNIST.cpp (#4773)
Resolve #4766
2020-08-12 21:46:11 -07:00
Tianlei Wu
a69ca63895
add --no_attention_mask option (#4750)
output producer name and version in optimized model.
avoid removing initializer that existed in graph output
2020-08-12 15:56:25 -07:00
jingyanwangms
adda8c66d9
Docker image release pipeline (#4682)
* create orttraining-1p-linux-gpu-ci-pipeline.yml

* fix syntax

* fix file path

* fix template path

* publish docker image to test acr

* use right task name

* change parameter list

* use variables

* use python.version

* remove --enable_onnx_tests due to segfault

* add back --enable_onnx_tests

* fix docker push command line

* change docker login command

* login differently

* fix docker tag script

* create password.txt

* add ortrelease docker image

* enable test in build.sh

* add pipeline parameter

* add pipeline parameter

* change timeout

* change timeout

* fix run_dockerbuild.sh

* use PR checkin build docker

* fix strategy syntax

* fix strategy syntax

* change dockerfile

* change run_dockerbuild.sh

* change tag name

* build with root user

* use build id for docker image tag

* remove all user lines

* change docker tag

* add mpi, mellanox

* add missing args

* use release dockerfile for ci build

* remove install wheel

* use release docker image

* fix syntax

* use different pool

* add Dockerfile.training

* remove sudo to run on Linux-Multi-GPU-V100

* change docker file path

* update dockerfile

* use latest dockerfile

* change agent pool

* remove --preserve-env

* add back parameter

* Add test_flag

* use azuredevops docker

* change repository

* use cmd for docker login

* echo build script

* use ortrelrease ACR

* change key vault connection

* Move --build flag

* change build command

* add paramter for image tag

* clean up for PR

* remove unnecessary changes

* whitespace changes

* whitespace changes

* change build flag

* change flag name

* change flag

* use latest dockerfile

* enable build tests

* build builder stage and run test

* Add back python.version

* change build directory

* always run build entire dockerfile

* fix yml syntax

* fix syntax

* add en-UTF8 locale

* rename

* remove unused template

* Update orttraining-linux-gpu-docker-release-pipeline.yml for Azure Pipelines

* Update orttraining-linux-gpu-docker-release-pipeline.yml for Azure Pipelines

* Test commit sha1 in pipeline

* fix parameter

* update docker file

* fix --from=build

* remove commented blocks

* PR comments

* fix syntax

* fix syntax

* use timestamp as build number

* remove latest tag

* add build_timestamp variable

* remove wrong property

* fix docker run command

* test build id

* Use datestamp build id

* change build tags

* add no-cache to docker build

* rename BUILD_VERSION -> BUILD_CONFIG

Co-authored-by: Jingyan Wang <jingywa@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2020-08-12 13:29:37 -07:00
Sheil Kumar
8a66ad79a6
Add Experimental WinRT API IDL as placeholder for adding new winrt features (#4736)
* Add experimental winrt api idl with dummy type to satisfy the build

* remove experimental from the api_lib target

* make experimental api available on windows builds also

* remove /y /d

* revert some pathing changes

* remove experimental api call from tests

* revert cppwinrt cmake changes

* switch to stdapi

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-08-12 12:45:19 -07:00
Vincent Wang
7e955960f1
Optimize Slice Kernel by Removing If-statement (#4753)
* Slice kernel optimization.

* remove space

Co-authored-by: Vincent Wang <weicwang@AiFramework2080ti2.corp.microsoft.com>
2020-08-12 16:36:03 +08:00
Josh Bradley
b7254551f0
Add new api function At() (#4457)
* add modern standards to function arguments
* add first version of At for better tensor element access
2020-08-11 18:34:03 -07:00
Scott Bonebrake
38c804a048
Fix broken link to ScoreMNIST.java in Java_API.md (#4213) 2020-08-11 17:36:19 -07:00
Ryan Hill
ac725b53f6
Convert TensorRT provider into a shared library (#4721)
Lots of changes to shared library interfaces, new lighter weight design.
2020-08-10 21:17:16 -07:00
Dmitri Smirnov
ac4997665a
Make Java Publishing and Java GPU pipelines to run nightly (#4749)
Schedule Java daily
  Bump up iInux GPU build timeout
2020-08-10 17:38:45 -07:00
Yang Chen
f51385fd1e
Yanchen/nuphar/clip 11 (#4737)
* [WIP] log unsupported ops in Nuphar

* [Nuphar] added support for clip-11

also added some log information for unsupported ops in Nuphar
2020-08-10 15:45:21 -07:00
Dmitri Smirnov
3530ce541c
Expose IOBinding features via C/C++/C# language bindings. (#4646)
Expose I/O Binding in C/C++/C#
  Expose OrtAllocator, OrtMemoryAllocation, OrtMemoryInfo and OrtIoBinding
2020-08-10 13:33:49 -07:00
Scott McKay
6c33d7f5df
Fix bug in Loop optimization (#4210)
* Fix bug where an optimization to avoid a copy resulted in the iteration num for a Loop subgraph

* Update comments to clarify
2020-08-11 06:31:29 +10:00
Tiago Koji Castro Shibata
082a741636
Move DNNL workaround to EP (#4738) 2020-08-10 13:06:22 -07:00
edgchen1
487665c21f
Transpose MatMul fusion fixes (#4728)
Fix Transpose MatMul fusion handling of existing TransposeScaleMatMul node's attributes and enable support for missing Transpose perm attribute.
Update expected test data to account for floating point calculation differences resulting from the fusion.
2020-08-10 13:00:22 -07:00
Tianlei Wu
316d1a9e69
Update benchmark for large model or model name with non-alphanumeric. (#4743)
* Export model > 2GB using external data format
2020-08-10 12:58:01 -07:00
Vagif
6499a38b7d
Add the missing onnx_proto import (#4705)
* add missing onnx_proto import
* Fix TensorProto usage in calibrate.py
* remove unused imports
2020-08-10 12:46:21 -07:00
Scott McKay
2e3ccc7518
Change order of some checks to workaround a linker issue when /LTCG:incremental is set. (#4713) 2020-08-10 17:54:11 +10:00
Nat Kershaw (MSFT)
24d4f76436
Added explicit instructions to build for Jetson (#4714)
* Added explicit instructions to build for Jetson.

* Update after review
2020-08-09 20:28:20 -07:00
Bowen Bao
abbb7f6f5c
Avoid duplicated calls of postprocess in training frontend (#4579) 2020-08-07 21:34:11 -07:00
stevenlix
77c69a0325
Upgrade TensorRT to v7.1.3.4 (#4704)
* upgrade to TensorRT 7.1.3.4

* Upgrade onnx-tensorrt parser for TensorRT 7.1.3.4

* fix format issue

* fix format issue

* fix format issue

* Update tensorrt_execution_provider.cc

* change cmake version to 3.14

* Remove --msvc_toolset 14.16

* change to onnxruntime::make_unique

* use onnxruntime::make_unique

* disable some tests for TensorRT

* disable some tests for TensorRT

* Update upsample_op_test.cc

* Update tile_op_test.cc

* disable some tests for TensorRT

* Update constant_of_shape_test.cc

* update parser

* Update Dockerfile.ubuntu_tensorrt
2020-08-07 17:43:56 -07:00
Oliver Rausch
9c3153acd6
Improve shape inference for OneHot (#4452)
* Improve shape inference for OneHot

Attempt to get the depth parameter before adding a new symbolic dimension.

* Update symbolic shape infer

* Nit
2020-08-07 14:05:20 -07:00
Tianlei Wu
9c729d1719
Update notebook for mac since onnxruntime 1.3 or 1.4 in mac does not have openmp (#4732) 2020-08-07 14:01:48 -07:00
Marcus Turewicz
37c45c3d6b
C# ResNet50 v2 sample/tutorial (#4722)
C# ResNet50 v2 sample
  Update samples README
2020-08-07 13:36:36 -07:00
Ye Wang
61726e58f0
fix (#4697) 2020-08-07 13:08:41 -07:00
Sergii Dymchenko
c334b5738e
Remove docstring for removed parameter (#4734) 2020-08-07 11:43:36 -07:00
Yufeng Li
b22091dc91
Add the framework to support prepack (#4413)
* add support of prepack
* add support for QAttention and DynamicQuantizeMatMul
* add an use_prepacking option
* add use_prepacking in c_sharp api
2020-08-07 09:39:19 -07:00
zhijxu-MS
33fe770037
Support log sigmoid gradient (#4719)
* add log's gradient op and its related gradient test

* support sigmoid's gradient op

* resolve review comments
2020-08-07 11:21:36 +08:00
Wei-Sheng Chin
7905c57f43
Revert "Remove code which is not thread-safe. (#4454)" (#4712)
* Revert "Remove code which is not thread-safe. (#4454)"

This reverts commit 5222b2c6c0.

* Resolve race condition

* More thread-safe changes

* Remove unused lock

Polish comments
2020-08-06 18:42:05 -07:00