Commit graph

527 commits

Author SHA1 Message Date
manashgoswami
bdc2bbb207 Build details for TensorRT execution provider. (#632)
* Fixed typos in docs for 'onnx_test_runner'

* TensorRT Execution Provider (preview) release

Updated build instructions and component governence and third party notices for TensorRT execution provider release.

* test runner option for tensorrt

updated to add option for tensorrt.

* Introduction to TensorRT Execution Provider

Intro README for TensorRT Execution Provider.

* Update BUILD.md

* Update TensorRT-ExecutionProvicer.md

* corrected typo in the filename

* corrected typos

* updated with corrections.

* removed conflicting edits.

* Update BUILD.md
2019-03-15 11:29:00 -07:00
KeDengMS
a362c3bbdf
Fix node def replacement by checking memory location (#623)
Fix node def replacement by checking memory location
2019-03-15 01:12:21 -07:00
stevenlix
5c09aa863c
python binding test updates for TensorRT (#635)
* Disable unsupported python tests for TensorRT

* Rename python wheel for TensorRT
2019-03-14 22:25:16 -07:00
stevenlix
639aa6d97a
Update run_dockerbuild.sh (#631) 2019-03-14 16:06:21 -07:00
Changming Sun
71b6445967
Large Model: Support offset (#615)
* Support offset
2019-03-14 15:19:24 -07:00
stevenlix
e8b0ae8923
Trt execution provider (#382)
* updated cmake files for trt

* added trt execution provider

* added trt basic test

* removed trt_path action attribute

* Add files via upload

* Update build.py

* Update trt_allocator.h

* fixed issues found by reviewers

* changed cast operator

* added comment for custom kernel implementation

* changed auto to auto&

* changed to function compile APIs for TRT execution provider

* changed to function compile APIs for TRT execution provider

* added new DType DInt64

* adapted to the changes of onnxruntime_c_api

* removed trt kernel (use function compile instead)

* updated onnx-tensorrt submodule

* set default memory type to TRT fused kernel

* resolve merge conflict

* fixed the issue that USE_CUDA conflicts with USE_TRT

* construct graph by adding nodes in topological order

* made changes for Windows

* change buffers type

* bypass HasImplementationOf check for TRT XP because TRT kernel is not registered

* added domain to version info in rebuilt model proto

* added trt to test option list

* added DomainToVersionMap() to GraphViewer

* removed Copy()

* fixed broken code

* format the code to clang format

* used local reference to the frequently used values

* fixed a couple of issues according to reviewers feedback

* fixed a couple of issues according to reviewers feedback

* added python binding for TRT and enable use_cuda when use_trt is on

* fixed a redefinition issue

* changed shared_ptr to unique_ptr on trt engines, and made a few changes required by reviewers

* enabled trtexecution provider for unit tests

* renamed trt to tensorrt

* added tesorrt to python binding

* update submodule onnx and onnx-tensorrt

* made a couple of minor changes based on reviewer's feedback

* added CUDA_CHECK

* removed test code

* fixed broken code after merge

* updated onnx-tensorrt submodule

* added post processing to align trt inputs/outputs with graph inputs/outputs

* updated onnx submodule

* added CUDA fallback for TensorRT and fixed TensorRT cmake issue

* added ci pipeline for tensorrt and removed some redundent code from trt xp

* fixed syntax issue

* updated onnx-tensorrt submodule

* fix trt build problem by: (#602)

1. Add additional /wd for debug build
2. Add io.h for additional targets
3. Bring back mb version of getopt

* Update install_ubuntu.sh

* Update linux-gpu-tensorrt-ci-pipeline.yml

* Update linux-gpu-tensorrt-ci-pipeline.yml

* Update run_build.sh

* Update run_build.sh

* Update run_build.sh

* Update run_build.sh

* fixed the issue that GetKernelRegistry returns nullptr

* merged master to this branch

* moved some data types to private

* fixed tensorrt CI pipeline issue

* customized test data for TensorRT pipeline

* added onnx-tensorrt in json file and fixed an issue in ci script

* added comments
2019-03-14 12:00:39 -07:00
Konstantinos Karanasos
37f7ed156e
renaming utils namespace related to graph operations to graph_utils (#618) 2019-03-13 17:44:28 -07:00
Dmitri Smirnov
fba98bb4de
Rework tokenexp to match tokens instead of separators. (#617)
Adjust tests.
2019-03-13 17:43:37 -07:00
Konstantinos Karanasos
2ae83c580c
Constant folding (#168)
Constant folding rewrite rule computes nodes that have only constant inputs at compile time and avoids these computations at run time.
2019-03-13 15:44:26 -07:00
Changming Sun
ab734ec5a6 Fix a bug in ExternalDataInfo 2019-03-13 12:57:32 -07:00
Hariharan Seshadri
cfb08c4848
TopK op: Promote onnx to a newer commit and handle changed TopK spec for opset 10 (#611)
* Initial commit

* Nit fix
2019-03-13 10:21:58 -07:00
Xavier Dupré
7dd9bc4d78
Restore changes removed by PR #571 (numpy version) (#603)
add options numpy_version, skip-keras-test to build.py, gives more options to build onnxruntime
2019-03-13 10:55:56 +01:00
Hariharan Seshadri
eab1f5463c
Disable benign warning that shows up in Windows cross-compiled ARM builds (#607)
* Disable warning that shows up in ARM builds

* PR feedback
2019-03-12 20:53:42 -07:00
Ke Zhang
5bb842538d
sync onnx and maintain old version history for removed exp ops (#588)
* sync onnx and maintain old version history for removed exp ops in onnx runtime.

* update

* updating to specific onnx commit - remove exp ops.

* update

* disable the 3 failures to push the change as it's blocking folks.

* update test
2019-03-12 18:48:27 -07:00
Xavier Dupré
c6d39b60cd remove OnnxTransformer (#554) 2019-03-12 18:15:39 -07:00
shahasad
bf43ac41aa
fix version number for tarball packages (#600)
* add variables for version number and git commit hash

* fix typo

* fix typo

* some logging

* some logging

* some logging

* some logging

* some logging

* some logging

* some logging

* some logging

* some more edits to see generic scripts can print

* working

* fixing windows git hash

* try quoted echo

* fix git rev-parse

* echo without quotes

* removed commit hash from artifact filename, added long commit hash as a file inside

* added the missing commit id parameter

* fix windows pipeline

* keep only win 64, others disabled

* remove disabling conditions
2019-03-12 17:55:08 -07:00
Randy
b452151baf Rashuai/restore yml2 (#604)
* restore capi yml

* format file
2019-03-12 12:27:46 -07:00
Du Li
3f507fa9f8 fix a shape inference bug. (#605) 2019-03-12 12:04:17 -07:00
jignparm
de9f1ff1ff Add new C function OrtOnnxTypeFromTypeInfo (#585) 2019-03-12 10:11:14 -07:00
Randy
f048fc5fb0 cross compile x86 linux (#562)
* cross compile x86 linux

* fix comments

* install multilib for ubuntu cross compile

* remove tailing slash

* fix -fPIC relocations for x86 target too

* add asm make flag

* fix x86 compile err

* test x86 with zlib and png

* Disable zlib from x86

* install x86 python header

* remove cross-compiling changes

* test 32bit ubuntu

* add x86 ubuntu docker file

* add x86 as arch parametr for docker build

* config pipeline

* avoid dotnet install

* install cmake

* skip dep install

* use latest ubuntu

* install latest cmake

* install x86 deps

* configure cmake

* install ninja

* correct ninja dir

* apt get re2c

* install onnx

* set processor x86

* disable warning

* skip test

* disable test

* disable test

* find lib

* fix typo

* restore test

* disable backend model test

* disable test

* fix test err

* stop installing onnx

* disable onnx test on x86

* restore yml

* mergef with master yml

* cancel needless config setting

* enable x86 flag

* restore all onnx tests

* fix yml typo

* install onnx

* add back x86 flag

* disable cases

* disable case

* disable cases

* add macro to disable cases

* fix typo

* print platform

* remove condition
2019-03-12 09:47:45 -07:00
Changming Sun
3ef273b84b Support memory mapping on Linux 2019-03-11 19:39:02 -07:00
liqunfu
7218be4f1f
support protoc version < 3.2.0 (#594)
* support protoc version < 3.2.0

* add comment to old protobuf code block

* update according to code reviewer's comment
2019-03-11 17:20:44 -07:00
Dmitri Smirnov
59cfafbb9e
Restore tolerance fix and make test_gru_seq_length_cpu run (#595) 2019-03-11 16:28:30 -07:00
Raymond Yang
02ad7daa8b
Add component detection (#592) 2019-03-11 15:30:49 -07:00
Changming Sun
93bbb42e75 Change the way of linking dl lib 2019-03-11 10:46:05 -07:00
Changming Sun
b183df7143
Remove InsertFusedRules from CPUExecutionProvider (#573)
Talked with the original author @souptc , this function is not needed anymore.
2019-03-11 10:45:44 -07:00
Hariharan Seshadri
867eda5262
Support Windows cross-compiling for ARM(64) in ORT build scripts (#549)
* Initial commit

* More changes

* More changes

* More changes

* More changes

* PR feedback

* Commiting Azure build config file

* Fix build pipeline

* Cleanup build dir template addition

* Remove conda modules download step

* PR feedback

* Revert x86 arguments to as they are currently

* More changes
2019-03-08 17:42:20 -08:00
Dmitri Smirnov
6136efc0c0
Promote to ONNX commit that has StringNormalizer (#499)
Promote to ONNX commit that has StringNormalizer
  Adjust implementation to match ONNX spec.
  Remove test exclusions.
  Override BackendTest class to handle strings  comparision
  properly until this is fixed in ONNX.
  Restore test_gru_seq_length_cpu exclusion
2019-03-08 17:15:26 -08:00
Weixing Zhang
530748ad5c Fix the broken link. 2019-03-08 15:16:23 -08:00
Changming Sun
d23edeb8c5
Fix a warning in env.cc (#568)
* Fix a warning in env.cc
2019-03-08 15:09:31 -08:00
Scott McKay
b7298a6deb
Add python script to export subgraphs from Scan/Loop/If nodes in an ONNX model. (#583) 2019-03-08 14:51:30 -08:00
Scott McKay
fb6f091073
Create derived class from onnx BackendTest so we can use the same tolerance values as in onnx_test_runner. Enable GRU test that was failing. (#584) 2019-03-08 14:51:13 -08:00
Scott McKay
b742c3a965
Fix for Scan 8 bug where the sequence length was > 2 (#580) 2019-03-08 13:58:03 -08:00
Jesse Benson
c785fe26d3 Update CreateExecutionProviderFactory_BrainSlice() definition. 2019-03-08 12:53:45 -08:00
Tracy Sharpe
44cb4c4f74
Handle x86_64 -fPIC relocations correctly (#566)
Thanks to cbecker for highlighting the problem in PR#565. That change used the GOT to access MlasMaskMoveAvx, but was incomplete: the GOT has a pointer to the global data, not the data directly. I updated the kernels to use a free register to load the address and then use that for the data. I ran through onnxruntime_mlas_test for AVX and FMA3 to verify the changes.
2019-03-08 10:30:20 -08:00
Pranav Sharma
9fa7b570da Fix publishing of Linux and MacOSX artifacts. (#579) 2019-03-08 05:57:41 -08:00
Ryan Hill
1180c60035 Add link to custom ops (#576)
* Add link to custom ops

* Wording
2019-03-08 00:49:10 -08:00
shahasad
51273b48f6 Use cuda9 1 in c api packaging (#571)
* use CUDA 9.1 for both linux and windows

* added powershell scripts for cuda props setup/cleanup

* fix yml syntax

* set path to cuda9.1 bin

* correct label

* ad --cuda_version

* added some log to browse the directory

* disabled jobs other than win gpu to save some resource while testing

* add msvc_toolset=14.11

* added more logs

* log the props file

* remove setting vcvarsall

* try some modificationi n build.py

* fix typo

* let the config Step modify envoronment

* set some more env vars manually

* try reordering vcvars after cuda props copying

* use single script for build and test

* single line script

* remove extra quote

* cleanup trial changes
2019-03-08 00:48:18 -08:00
Changming Sun
3cd448e05a Fix: IExecutionProvider::GetCapability returns redundant subgraphs 2019-03-07 17:25:31 -08:00
Faith Xu
e272feb60d Readme updates (#570)
* Update readme

* Update table

* Formatting updates

* Formatting

* Formatting

* Update README.md

* Minor updates

* Update README.md

* Update README.md
2019-03-07 16:19:11 -08:00
jignparm
a79c09388f
Fix GPu package testing for CAPI (#569) 2019-03-07 14:51:18 -08:00
Changming Sun
d40a9f894f
Enable Component Detection (#559)
* Enable Component Detection
2019-03-07 11:07:35 -08:00
Xavier Dupré
b4ffcf8258
Fixes #31, add option numpy_version, skip_keras_test to the parser of build.py, add flag PRIVATE for the python bindings (#544)
* add option numpy_version to build against the installed numpy version and not 1.15.0 (hardcoded version number), default is still 1.15.0
* add option skip_keras_test to skip keras test even if keras is installed (still enabled by default)
disable unnecessary warnings about ubuntu
* enable option PRIVATE for the compilation of the Python bindings (settings recommended on pybind11 documentation)
* test on debian 9
2019-03-07 13:08:02 +01:00
jignparm
4635bcc624 Updating C_API end-to-end test and user samples (#564)
* Updating user sample and C_API unit test

* remove debugging info

* remove precompiled headers

* header file location changed in master...updating
2019-03-07 00:28:15 -08:00
David Fan
b68079fe5d
Support int32_t for Split op (#563)
* Support int32_t for Split op

* Support int32_t for Split op
2019-03-07 00:13:11 -08:00
Ryan Hill
af9c554dd3
Ryanunderhill/custom op (#550)
* Prototype version that demonstrates it can work
* Switched to OrtValue and removed the OrtCustomOpTensor code.
* Support multiple outputs and reading of attributes
* Add custom domain handling to custom ops
* Update documentation
* more wording changes
2019-03-06 19:09:55 -08:00
jignparm
0b143d0703
Fix parentheses and commas (#560) 2019-03-06 18:01:49 -08:00
shahasad
b247fced3b
Linux and MacOS C api packaging (#555)
* added linux packaging template and pipeline

* Update linux-packaging-pipeline.yml for Azure Pipelines

* fix path seperator

* update copy command for linux

* fixed linux gpu artifact name, added mac build

* fixed linux gpu artifact name, added mac build

* fixed vmImage syntax

* use 1 model at a time for macos

* added onnx test on Mac CI

* some refactor of the pipeline scripts

* try fixing the tensorproto for x86 build

* try __cdecl

* try C-style cast

* use ORTAPICALL

* put the deleter under the namespace
2019-03-06 14:56:53 -08:00
Faith Xu
98a9c0e715 Fix json error (#557) 2019-03-06 14:53:07 -08:00
Scott McKay
0e65bfe7ae
Remove caching from InferenceSession::Run (#547)
* Remove caching from InferenceSession::Run

* Fix automatic merge of one file

* trigger rerunning checks
2019-03-06 14:29:42 -08:00