Commit graph

573 commits

Author SHA1 Message Date
jignparm
36f5d008de
Add /Qspectre compiler flags (#671)
* Add /Qspectre compiler flags

* Added /Zi. Seems required for binskim to pass

* reset cmake min to 3.11 for nuget pipeline

* Reset CMake min to 3.13
2019-03-28 11:08:05 -07:00
Changming Sun
f6a77617c1 update test data 2019-03-27 21:56:20 -07:00
KeDengMS
deaea702ff
Bump up cmake_minimum_required to 3.13 (#722)
This is consistent with CI version. cmake 3.11 has issues with CUDA build in Linux.
2019-03-27 14:45:24 -07:00
Konstantinos Karanasos
f299104a19
Enable constant folding in L1 transformers (#720) 2019-03-27 13:26:47 -07:00
Changming Sun
fb2a44f642 Remove header_files_test.cc 2019-03-27 13:16:56 -07:00
Raymond Yang
c35b605b8d
Support updated opschema with functionbody (#640)
* Update onnx

* Support updated function schema in ORT

* Update onnx related commit hash

* Check out an older commit in ONNX

* Add support for subgraph attribute

* Add comments
2019-03-27 11:38:10 -07:00
shahasad
83ae641425
add documentation for custom ops (#708)
* added tools for doc gen, added doc

* doc updated

* some fixes

* hooked up with build.py

* hooked up with build.py and fail on nonupdated doc

* update
2019-03-26 21:58:01 -07:00
Ashwini Khade
77b981824a
fix graph transformers and refactor tests (#696)
* fix graph transformers and refactor tests

* fix merge master

* Set default optimization level to Level1

* fix build warnings for Linux

* try root cause tensorrt test failures

* try root cause tensorrt test failure

* Test level2 transformers with  all CI builds

* remove ConvActivation fusion transformer

* change default level back to level1

* remove providers from apply api

* more changes
2019-03-26 20:38:12 -07:00
Konstantinos Karanasos
a872ba7894
Convert Unsqueeze elimination to rewrite rule + improvements in graph utils and graph transformer utils (#670)
* Convert unsqueeze elimination to rewrite rule

* Simplify the way we register predefined transformers and rules in the inference session (all details are now moved to the graph transformer utils)

* Some reorganization and renaming of methods in graph_utils

* Updates in graph transformers test

* Update in edge removal to not perform unnecessary check of node args that led to race conditions when updating the graph

* Improve documentation for rewrite rules

* Remove top-down rule-based transformer (given we currently have only one type of rule-based transformer)
2019-03-26 13:58:15 -07:00
Tao Qin
a28b42a42c Fix path_lib.h for Mac and refine #include in InferenceSesssion.h (#711)
* Fix path_lib.h for macox py build

* don't have to include path_lib.h for model path in InferenceSession
2019-03-26 13:15:59 -07:00
Pranav Sharma
bcf1ce94be
Provide an option to disable contrib ops. (#707) 2019-03-26 12:31:36 -07:00
Tao Qin
39fb68b761
Refactor InferenceSession class (#654)
* Refactor InferenceSession interface

* Make some member and func private

* more protected members

* more protected

* reorder class members

* reordering

* reordering

The InferenceSession was implemented in the pImpl idiom, which hides the actual implementation. There are requirements to expose the implementation to other new classes, so this change is to pave the way.

The main changes are: abandon the pImpl idiom of InferenceSession
2019-03-25 14:09:33 -07:00
Hariharan Seshadri
c8f1da28c4
tile op: make implementation type-agnostic (and support a few more types) (#688)
* Initial commit

* PR feedback

* PR feedback
2019-03-25 11:55:51 -07:00
Changming Sun
6497f0c133 build python only when onnxruntime_BUILD_UNIT_TESTS is ON (#694) 2019-03-22 19:25:49 -07:00
Tracy Sharpe
21dacdd4d6
fix epilogue code (#695)
Repair the epilogue code for the aarch64 SGEMM kernel to restore registers from the correct stack location.
2019-03-22 18:13:41 -07:00
Changming Sun
179afe1594
Remove onnxruntime_USE_PREBUILT_PB (#692)
* Remove onnxruntime_USE_PREBUILT_PB
2019-03-22 17:41:21 -07:00
Changming Sun
a26696fb0e Enable LTO on Linux 2019-03-22 15:30:37 -07:00
Raymond Yang
12ecd77ffb
Update README.md (#686) 2019-03-22 14:47:40 -07:00
stevenlix
af389593be
Add Windows CI pipeline for TensorRT (#687)
* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update symbols.txt

* Update CMakeLists.txt

* Update build.py

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update win-gpu-tensorrt-ci-pipeline.yml

* Update tensorrt_execution_provider.cc

* Update CMakeLists.txt

* Update win-gpu-tensorrt-ci-pipeline.yml
2019-03-22 14:46:57 -07:00
utsabsingharoy
21dde6fd16 Clang build failure in test (#683)
* fixing clang build failure
2019-03-22 13:17:56 -07:00
jywu-msft
8d782582f4
fix build_wheel option (#684) 2019-03-22 11:18:23 -07:00
Sherlock
25f45cb2db
Introduce Rowwise/Colwise Sum to math util (#656) 2019-03-22 10:12:58 -07:00
jignparm
600dc9ecc5
Remove licenseurl and add licensefile, to fix issue 664 (#669)
* Remove licenseurl and add licensefile, to fix issue 664

* Added back LICENSE file, instead of LICENSE.txt
2019-03-21 20:27:57 -07:00
Changming Sun
9b0d56dbed Fix a warning in GraphTransformerManager 2019-03-21 15:49:08 -07:00
Ryan Hill
cd52431b8f
Custom op interface to the C API to remove shared library dependency (#668)
* Adding a custom op interface to the C API to remove shared library dependency.

* Fixup const issues

* Renaming to make things a little simpler

* Add a comment
2019-03-21 15:46:50 -07:00
RandySheriffH
6c40aed95c
Rashuai/build x86 (#676)
* add how-to for x86 build

* formatting

* formatting
2019-03-21 15:45:33 -07:00
Pranav Sharma
5d452b3029
Use protobuf-lite to reduce onnxruntime.dll size. (#639)
* Test protobuf-lite

* Test protobuf-lite

* Test protobuf-lite

* Optimize protobuf usage for LITE_RUNTIME to reduce the binary size of
onnxruntime.dll. More details can be found here https://developers.google.com/protocol-buffers/docs/proto.
The reduction is significant. For commit id: 4873b452151bafe49da332aaeab639ef0318fc1ca28d728, the size
reduced by ~700K; from 4873728 to 4172800.

* Add LITE_RUNTIME flag in in.proto files

* Fix merge conflict.

* Address PR comments

* Forgot to add 2 files + fix linux and gpu build errors.

* Fix build errors + test failures

* Fix cuda tests

* Fix tensor rt build

* Use full protobuf for trt

* Address PR comments

* Print tensor shape proto as text string for easier debugging
2019-03-21 14:06:38 -07:00
utsab
a624e1091e fixing clang build failure 2019-03-21 13:55:47 -07:00
RandySheriffH
0995e853fa
Rashuai/unify version (#653)
* unify version logic

* merge function to build.py

* read file to get version

* fix path issue

* format python

* use path join

* fix __init__.py

* move function to a separate file

* remove redundant
2019-03-21 13:44:13 -07:00
Scott McKay
a3499083da
Add iterator traits aliases to ConstPointerContainer::ConstIterator (#634)
* Add iterator traits aliases.

* Add a few more pieces to make more compliant with the input iterator requirements.
2019-03-21 15:45:58 +10:00
Ashwini Khade
2f1c3028b7
add capi to set graph optimization level (#657)
* add capi to set graph optimization level

* remove 1 unnecessary check + review comment

* plus updates
2019-03-20 17:14:46 -07:00
jignparm
819457dd45
Added netframework test (#658) 2019-03-20 10:37:09 -07:00
Scott McKay
17af8e9ba7
Add subgraph check/update to node removal logic. Fix a few minor issues with Graph that came up during testing of the changes. (#651)
* Check usage of node output as implicit input in any subgraphs.

* Add logic to check/update subgraphs when removing a node.
Fix some issues with Graph
  - Include local outer scope variables when validating. Required if calling Resolve on a subgraph
  - Include outer scope variables in the value info so the type information is captured. Also required to Resolve a subgraph but will detect a type mismatch (previously we threw the type information away).
  - Fix GraphNodes iterator so it can be used with std::find_if. Needed to be assignable so the end_ value can't be const.
2019-03-20 14:57:45 +10:00
jignparm
c366647262
Add missing probit function for treeregressor (#619)
* added missing probit function for treeregressor

* corrected ml_logit to ml_logistic to reflect update of output

* Updated function names to comply with style guide

* use write_scores() to simplify code block
2019-03-19 17:55:26 -07:00
Changming Sun
9e323901b2
Upgrade mkldnn to 0.18 (#650)
Upgrade mkldnn to 0.18, to fix a build issue with GCC 8.
2019-03-19 16:23:55 -07:00
Hariharan Seshadri
1aa24cbbf3
executable size reduction: cleaned up slice op to get savings (#621)
* Initial commit

* More cahnges

* More changes

* Fix build break
2019-03-19 15:00:16 -07:00
Casey Carter
3f52de07c7 Add missing include to status.h
status.h must include <ostream> to use std::ostream.
2019-03-19 11:59:41 -07:00
Xavier Dupré
4cc7121368
Fixes #626, remove posix option for regular expression in Tokenizer operator (#627)
* remove posix option
* add unit test for regular expression
2019-03-19 14:18:27 +01:00
Ryan Hill
da9af592d9 Remove OrtAppendCustomOpLibPath (#642)
* Remove OrtAppendCustomOpLibPath

* Fix parameter mismatch

* More parameter fixes
2019-03-18 19:44:32 -07:00
Ashwini Khade
481eb971ec
graph transformers update (#608)
* graph transformers update

* some updates

* plus changes

* more updates

* fixes per review comments

* enable tests

* adding more tests

* more changes

* update api in inference sesion

* changes per review

* Linux CI fix

* fix linux CI failure

* fix MAC CI failure

* more updates

* add more documentation and add level param to register transformer
2019-03-18 14:52:16 -07:00
Scott McKay
541b3149dd
Minor fix to disallow an execution provider registering a nullptr. This matches the expected behavior of GetKernelRegistriesByProviderType to not return any nullptrs. (#646) 2019-03-19 06:56:15 +10:00
Scott McKay
971058fc38
Avoid copy of pre-existing value to subgraph output (#637)
* Add AllocKind::kShare to allow copying the MLValue for a pre-existing value to a graph output when an Identity node is involved. Ideally we can make this handling for an Identity node more general purpose, however the current logic to free an MLValue during execution doesn't take into account a re-use point also needing a free. Due to that, limit the scope and start with a somewhat ugly hardcoded approach.

Migrate some changes from PR497

The existing Loop unit tests exercise the new code. Also manually stepped through the problematic model to verify the unnecessary copy was avoided.

* Fix build error

* Fix missing switch case in debug output of allocation plan

* Limit optimization to Loop
2019-03-19 06:55:59 +10:00
Changming Sun
14d9a2bdc7
Remove an unnecessary check in GetDirNameFromFilePath function (#616)
* Remove an unnecessary in GetDirNameFromFilePath function

* Remove PathTest.invalid_double_slash
2019-03-17 23:55:25 -07:00
Yufeng Li
8faccfcd92
Remove parallel logic for backward and forward GPU (#641) 2019-03-16 02:37:51 -07:00
Dmitri Smirnov
4c2b1c3018
Rework Transpose as a generic type agnostic implementation (#561)
Make Transpose op impl generic and add std::string support.
  Un-templatize implementation functions that make use of memcpy().
  Support all types per spec. Add string tests.
2019-03-15 14:03:15 -07:00
Changming Sun
4bd8463228 Update docs (#633) 2019-03-15 13:56:58 -07:00
manashgoswami
bdc2bbb207 Build details for TensorRT execution provider. (#632)
* Fixed typos in docs for 'onnx_test_runner'

* TensorRT Execution Provider (preview) release

Updated build instructions and component governence and third party notices for TensorRT execution provider release.

* test runner option for tensorrt

updated to add option for tensorrt.

* Introduction to TensorRT Execution Provider

Intro README for TensorRT Execution Provider.

* Update BUILD.md

* Update TensorRT-ExecutionProvicer.md

* corrected typo in the filename

* corrected typos

* updated with corrections.

* removed conflicting edits.

* Update BUILD.md
2019-03-15 11:29:00 -07:00
KeDengMS
a362c3bbdf
Fix node def replacement by checking memory location (#623)
Fix node def replacement by checking memory location
2019-03-15 01:12:21 -07:00
stevenlix
5c09aa863c
python binding test updates for TensorRT (#635)
* Disable unsupported python tests for TensorRT

* Rename python wheel for TensorRT
2019-03-14 22:25:16 -07:00
stevenlix
639aa6d97a
Update run_dockerbuild.sh (#631) 2019-03-14 16:06:21 -07:00