Commit graph

556 commits

Author SHA1 Message Date
Raymond Yang
12ecd77ffb
Update README.md (#686) 2019-03-22 14:47:40 -07:00
stevenlix
af389593be
Add Windows CI pipeline for TensorRT (#687)
* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update symbols.txt

* Update CMakeLists.txt

* Update build.py

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update tensorrt_execution_provider.cc

* Update CMakeLists.txt

* Update win-gpu-tensorrt-ci-pipeline.yml
2019-03-22 14:46:57 -07:00
utsabsingharoy
21dde6fd16 Clang build failure in test (#683)
* fixing clang build failure
2019-03-22 13:17:56 -07:00
jywu-msft
8d782582f4
fix build_wheel option (#684) 2019-03-22 11:18:23 -07:00
Sherlock
25f45cb2db
Introduce Rowwise/Colwise Sum to math util (#656) 2019-03-22 10:12:58 -07:00
jignparm
600dc9ecc5
Remove licenseurl and add licensefile, to fix issue 664 (#669)
* Remove licenseurl and add licensefile, to fix issue 664

* Added back LICENSE file, instead of LICENSE.txt
2019-03-21 20:27:57 -07:00
Changming Sun
9b0d56dbed Fix a warning in GraphTransformerManager 2019-03-21 15:49:08 -07:00
Ryan Hill
cd52431b8f
Custom op interface to the C API to remove shared library dependency (#668)
* Adding a custom op interface to the C API to remove shared library dependency.

* Fixup const issues

* Renaming to make things a little simpler

* Add a comment
2019-03-21 15:46:50 -07:00
RandySheriffH
6c40aed95c
Rashuai/build x86 (#676)
* add how-to for x86 build

* formatting

* formatting
2019-03-21 15:45:33 -07:00
Pranav Sharma
5d452b3029
Use protobuf-lite to reduce onnxruntime.dll size. (#639)
* Test protobuf-lite

* Test protobuf-lite

* Test protobuf-lite

* Optimize protobuf usage for LITE_RUNTIME to reduce the binary size of
onnxruntime.dll. More details can be found here https://developers.google.com/protocol-buffers/docs/proto.
The reduction is significant. For commit id: 4873b452151bafe49da332aaeab639ef0318fc1ca28d728, the size
reduced by ~700K; from 4873728 to 4172800.

* Add LITE_RUNTIME flag in in.proto files

* Fix merge conflict.

* Address PR comments

* Forgot to add 2 files + fix linux and gpu build errors.

* Fix build errors + test failures

* Fix cuda tests

* Fix tensor rt build

* Use full protobuf for trt

* Address PR comments

* Print tensor shape proto as text string for easier debugging
2019-03-21 14:06:38 -07:00
utsab
a624e1091e fixing clang build failure 2019-03-21 13:55:47 -07:00
RandySheriffH
0995e853fa
Rashuai/unify version (#653)
* unify version logic

* merge function to build.py

* read file to get version

* fix path issue

* format python

* use path join

* fix __init__.py

* move function to a separate file

* remove redundant
2019-03-21 13:44:13 -07:00
Scott McKay
a3499083da
Add iterator traits aliases to ConstPointerContainer::ConstIterator (#634)
* Add iterator traits aliases.

* Add a few more pieces to make more compliant with the input iterator requirements.
2019-03-21 15:45:58 +10:00
Ashwini Khade
2f1c3028b7
add capi to set graph optimization level (#657)
* add capi to set graph optimization level

* remove 1 unnecessary check + review comment

* plus updates
2019-03-20 17:14:46 -07:00
jignparm
819457dd45
Added netframework test (#658) 2019-03-20 10:37:09 -07:00
Scott McKay
17af8e9ba7
Add subgraph check/update to node removal logic. Fix a few minor issues with Graph that came up during testing of the changes. (#651)
* Check usage of node output as implicit input in any subgraphs.

* Add logic to check/update subgraphs when removing a node.
Fix some issues with Graph
  - Include local outer scope variables when validating. Required if calling Resolve on a subgraph
  - Include outer scope variables in the value info so the type information is captured. Also required to Resolve a subgraph but will detect a type mismatch (previously we threw the type information away).
  - Fix GraphNodes iterator so it can be used with std::find_if. Needed to be assignable so the end_ value can't be const.
2019-03-20 14:57:45 +10:00
jignparm
c366647262
Add missing probit function for treeregressor (#619)
* added missing probit function for treeregressor

* corrected ml_logit to ml_logistic to reflect update of output

* Updated function names to comply with style guide

* use write_scores() to simplify code block
2019-03-19 17:55:26 -07:00
Changming Sun
9e323901b2
Upgrade mkldnn to 0.18 (#650)
Upgrade mkldnn to 0.18, to fix a build issue with GCC 8.
2019-03-19 16:23:55 -07:00
Hariharan Seshadri
1aa24cbbf3
executable size reduction: cleaned up slice op to get savings (#621)
* Initial commit

* More cahnges

* More changes

* Fix build break
2019-03-19 15:00:16 -07:00
Casey Carter
3f52de07c7 Add missing include to status.h
status.h must include <ostream> to use std::ostream.
2019-03-19 11:59:41 -07:00
Xavier Dupré
4cc7121368
Fixes #626, remove posix option for regular expression in Tokenizer operator (#627)
* remove posix option
* add unit test for regular expression
2019-03-19 14:18:27 +01:00
Ryan Hill
da9af592d9 Remove OrtAppendCustomOpLibPath (#642)
* Remove OrtAppendCustomOpLibPath

* Fix parameter mismatch

* More parameter fixes
2019-03-18 19:44:32 -07:00
Ashwini Khade
481eb971ec
graph transformers update (#608)
* graph transformers update

* some updates

* plus changes

* more updates

* fixes per review comments

* enable tests

* adding more tests

* more changes

* update api in inference sesion

* changes per review

* Linux CI fix

* fix linux CI failure

* fix MAC CI failure

* more updates

* add more documentation and add level param to register transformer
2019-03-18 14:52:16 -07:00
Scott McKay
541b3149dd
Minor fix to disallow an execution provider registering a nullptr. This matches the expected behavior of GetKernelRegistriesByProviderType to not return any nullptrs. (#646) 2019-03-19 06:56:15 +10:00
Scott McKay
971058fc38
Avoid copy of pre-existing value to subgraph output (#637)
* Add AllocKind::kShare to allow copying the MLValue for a pre-existing value to a graph output when an Identity node is involved. Ideally we can make this handling for an Identity node more general purpose, however the current logic to free an MLValue during execution doesn't take into account a re-use point also needing a free. Due to that, limit the scope and start with a somewhat ugly hardcoded approach.

Migrate some changes from PR497

The existing Loop unit tests exercise the new code. Also manually stepped through the problematic model to verify the unnecessary copy was avoided.

* Fix build error

* Fix missing switch case in debug output of allocation plan

* Limit optimization to Loop
2019-03-19 06:55:59 +10:00
Changming Sun
14d9a2bdc7
Remove an unnecessary check in GetDirNameFromFilePath function (#616)
* Remove an unnecessary in GetDirNameFromFilePath function

* Remove PathTest.invalid_double_slash
2019-03-17 23:55:25 -07:00
Yufeng Li
8faccfcd92
Remove parallel logic for backward and forward GPU (#641) 2019-03-16 02:37:51 -07:00
Dmitri Smirnov
4c2b1c3018
Rework Transpose as a generic type agnostic implementation (#561)
Make Transpose op impl generic and add std::string support.
  Un-templatize implementation functions that make use of memcpy().
  Support all types per spec. Add string tests.
2019-03-15 14:03:15 -07:00
Changming Sun
4bd8463228 Update docs (#633) 2019-03-15 13:56:58 -07:00
manashgoswami
bdc2bbb207 Build details for TensorRT execution provider. (#632)
* Fixed typos in docs for 'onnx_test_runner'

* TensorRT Execution Provider (preview) release

Updated build instructions and component governence and third party notices for TensorRT execution provider release.

* test runner option for tensorrt

updated to add option for tensorrt.

* Introduction to TensorRT Execution Provider

Intro README for TensorRT Execution Provider.

* Update BUILD.md

* Update TensorRT-ExecutionProvicer.md

* corrected typo in the filename

* corrected typos

* updated with corrections.

* removed conflicting edits.

* Update BUILD.md
2019-03-15 11:29:00 -07:00
KeDengMS
a362c3bbdf
Fix node def replacement by checking memory location (#623)
Fix node def replacement by checking memory location
2019-03-15 01:12:21 -07:00
stevenlix
5c09aa863c
python binding test updates for TensorRT (#635)
* Disable unsupported python tests for TensorRT

* Rename python wheel for TensorRT
2019-03-14 22:25:16 -07:00
stevenlix
639aa6d97a
Update run_dockerbuild.sh (#631) 2019-03-14 16:06:21 -07:00
Changming Sun
71b6445967
Large Model: Support offset (#615)
* Support offset
2019-03-14 15:19:24 -07:00
stevenlix
e8b0ae8923
Trt execution provider (#382)
* updated cmake files for trt

* added trt execution provider

* added trt basic test

* removed trt_path action attribute

* Add files via upload

* Update build.py

* Update trt_allocator.h

* fixed issues found by reviewers

* changed cast operator

* added comment for custom kernel implementation

* changed auto to auto&

* changed to function compile APIs for TRT execution provider

* changed to function compile APIs for TRT execution provider

* added new DType DInt64

* adapted to the changes of onnxruntime_c_api

* removed trt kernel (use function compile instead)

* updated onnx-tensorrt submodule

* set default memory type to TRT fused kernel

* resolve merge conflict

* fixed the issue that USE_CUDA conflicts with USE_TRT

* construct graph by adding nodes in topological order

* made changes for Windows

* change buffers type

* bypass HasImplementationOf check for TRT XP because TRT kernel is not registered

* added domain to version info in rebuilt model proto

* added trt to test option list

* added DomainToVersionMap() to GraphViewer

* removed Copy()

* fixed broken code

* format the code to clang format

* used local reference to the frequently used values

* fixed a couple of issues according to reviewers feedback

* fixed a couple of issues according to reviewers feedback

* added python binding for TRT and enable use_cuda when use_trt is on

* fixed a redefinition issue

* changed shared_ptr to unique_ptr on trt engines, and made a few changes required by reviewers

* enabled trtexecution provider for unit tests

* renamed trt to tensorrt

* added tesorrt to python binding

* update submodule onnx and onnx-tensorrt

* made a couple of minor changes based on reviewer's feedback

* added CUDA_CHECK

* removed test code

* fixed broken code after merge

* updated onnx-tensorrt submodule

* added post processing to align trt inputs/outputs with graph inputs/outputs

* updated onnx submodule

* added CUDA fallback for TensorRT and fixed TensorRT cmake issue

* added ci pipeline for tensorrt and removed some redundent code from trt xp

* fixed syntax issue

* updated onnx-tensorrt submodule

* fix trt build problem by: (#602)

1. Add additional /wd for debug build
2. Add io.h for additional targets
3. Bring back mb version of getopt

* Update install_ubuntu.sh

* Update linux-gpu-tensorrt-ci-pipeline.yml

* Update linux-gpu-tensorrt-ci-pipeline.yml

* Update run_build.sh

* Update run_build.sh

* Update run_build.sh

* Update run_build.sh

* fixed the issue that GetKernelRegistry returns nullptr

* merged master to this branch

* moved some data types to private

* fixed tensorrt CI pipeline issue

* customized test data for TensorRT pipeline

* added onnx-tensorrt in json file and fixed an issue in ci script

* added comments
2019-03-14 12:00:39 -07:00
Konstantinos Karanasos
37f7ed156e
renaming utils namespace related to graph operations to graph_utils (#618) 2019-03-13 17:44:28 -07:00
Dmitri Smirnov
fba98bb4de
Rework tokenexp to match tokens instead of separators. (#617)
Adjust tests.
2019-03-13 17:43:37 -07:00
Konstantinos Karanasos
2ae83c580c
Constant folding (#168)
Constant folding rewrite rule computes nodes that have only constant inputs at compile time and avoids these computations at run time.
2019-03-13 15:44:26 -07:00
Changming Sun
ab734ec5a6 Fix a bug in ExternalDataInfo 2019-03-13 12:57:32 -07:00
Hariharan Seshadri
cfb08c4848
TopK op: Promote onnx to a newer commit and handle changed TopK spec for opset 10 (#611)
* Initial commit

* Nit fix
2019-03-13 10:21:58 -07:00
Xavier Dupré
7dd9bc4d78
Restore changes removed by PR #571 (numpy version) (#603)
add options numpy_version, skip-keras-test to build.py, gives more options to build onnxruntime
2019-03-13 10:55:56 +01:00
Hariharan Seshadri
eab1f5463c
Disable benign warning that shows up in Windows cross-compiled ARM builds (#607)
* Disable warning that shows up in ARM builds

* PR feedback
2019-03-12 20:53:42 -07:00
Ke Zhang
5bb842538d
sync onnx and maintain old version history for removed exp ops (#588)
* sync onnx and maintain old version history for removed exp ops in onnx runtime.

* update

* updating to specific onnx commit - remove exp ops.

* update

* disable the 3 failures to push the change as it's blocking folks.

* update test
2019-03-12 18:48:27 -07:00
Xavier Dupré
c6d39b60cd remove OnnxTransformer (#554) 2019-03-12 18:15:39 -07:00
shahasad
bf43ac41aa
fix version number for tarball packages (#600)
* add variables for version number and git commit hash

* fix typo

* fix typo

* some logging

* some logging

* some logging

* some logging

* some logging

* some logging

* some logging

* some logging

* some more edits to see generic scripts can print

* working

* fixing windows git hash

* try quoted echo

* fix git rev-parse

* echo without quotes

* removed commit hash from artifact filename, added long commit hash as a file inside

* added the missing commit id parameter

* fix windows pipeline

* keep only win 64, others disabled

* remove disabling conditions
2019-03-12 17:55:08 -07:00
Randy
b452151baf Rashuai/restore yml2 (#604)
* restore capi yml

* format file
2019-03-12 12:27:46 -07:00
Du Li
3f507fa9f8 fix a shape inference bug. (#605) 2019-03-12 12:04:17 -07:00
jignparm
de9f1ff1ff Add new C function OrtOnnxTypeFromTypeInfo (#585) 2019-03-12 10:11:14 -07:00
Randy
f048fc5fb0 cross compile x86 linux (#562)
* cross compile x86 linux

* fix comments

* install multilib for ubuntu cross compile

* remove tailing slash

* fix -fPIC relocations for x86 target too

* add asm make flag

* fix x86 compile err

* test x86 with zlib and png

* Disable zlib from x86

* install x86 python header

* remove cross-compiling changes

* test 32bit ubuntu

* add x86 ubuntu docker file

* add x86 as arch parametr for docker build

* config pipeline

* avoid dotnet install

* install cmake

* skip dep install

* use latest ubuntu

* install latest cmake

* install x86 deps

* configure cmake

* install ninja

* correct ninja dir

* apt get re2c

* install onnx

* set processor x86

* disable warning

* skip test

* disable test

* disable test

* find lib

* fix typo

* restore test

* disable backend model test

* disable test

* fix test err

* stop installing onnx

* disable onnx test on x86

* restore yml

* mergef with master yml

* cancel needless config setting

* enable x86 flag

* restore all onnx tests

* fix yml typo

* install onnx

* add back x86 flag

* disable cases

* disable case

* disable cases

* add macro to disable cases

* fix typo

* print platform

* remove condition
2019-03-12 09:47:45 -07:00
Changming Sun
3ef273b84b Support memory mapping on Linux 2019-03-11 19:39:02 -07:00