Commit graph

62 commits

Author SHA1 Message Date
Raymond Yang
ec8ac04f30
Update cast op to support string <-> numeric (#379)
* Update cast kernel to support to/from string

* Update namespace

* Add support for literal numeric case

* Update to support -INF test

* Update kernel registration for cast

* Update ONNX to 1.4.1

* Update registy api

* Resolve some comments

* Update cast kernel implementation

* Resolve comments

* Fixed test data in onnx

* Update cast kernel implementation

* Resolve PR comments

* Update cast_op.cc

* Update onnx commits info

* Update comments
2019-02-12 10:10:56 -08:00
Yufeng Li
7b37dc6105
Enable USE_MKLML_FOR_BLAS (#387)
* Enable USE_MKLML_FOR_BLAS

* add mklml include directory for onnxruntime_provider and onnxruntime_provider_cuda

* add mklml_include_dir to include_directories
2019-02-08 07:14:37 -08:00
Pranav Sharma
db0fde9add
Make USE_MLAS macro conditional on cmake flag for consistency with other options and make it ON by default. It was already enabled by default today. (#454) 2019-02-07 18:33:00 -08:00
shahasad
8a8d1b0cea
Fix MacOS shared library build (#447)
* try removing the --version-script

* remove --no-undefined flag

* remove the -rpath linker flag

* remove the -rpath linker flag, including the -Wl

* remove the --whole-archive flags

* added -all_load -noall_load flags in place of --whole-archive and --no-whole-archive

* spell correct all-load

* set the MacOS specific cmake configs with if(APPLE) condition

* added --build_shared_lib to mac CI
2019-02-06 15:27:37 -08:00
Weixing Zhang
696ab8a194
Create a separate component for graph optimization. (#421)
* Create a project for graph optimizer.

Move optimizer related code to the folder optimizer.

* Fix build failures.

* rebase and fix build failures.

* fix build failure.

* fix build failure with cuda path.

* fix python build failure.

* Move two transformers(memcpy and insert_cast) from framework to optimizer.

* rebase.

* SessionState should not depend on optimizer.
2019-02-04 15:45:12 -08:00
Changming Sun
9f0298261d Fix a build warning in onnxruntime python extension (#416) 2019-02-01 00:19:41 -08:00
Randy
2f73d7abf8
compile with GL/LTCG (#391)
* compile with GL/LTCG

* apply the change to release build

* remove GL/LTCG from release build

* exclude cuda from using GL/LTCG
2019-01-29 14:18:23 -08:00
Xavier Dupré
439dbbada9
Adds OnnxTransformer to plug onnxruntime in sckit-learn's pipeline (#389)
Useful for transfer learning
2019-01-29 18:51:24 +01:00
jignparm
68881fadcd Delay load cudart64 for cpu execution 2019-01-29 08:50:37 -08:00
Changming Sun
6349114583 Revert "Rashuai/link with ltcg (#378)" (#383)
This reverts commit f53cc032db.
2019-01-25 19:00:23 -08:00
jignparm
ccca1e9402
Update property file for Nuget Linux package (#369)
* Copy mkldnn to output folder for linux. Nuget doesn't resolve dll dependency correctly within a package

* Modify to copy all dlls to output folder

* update rpath for shared library

* Simplified linker flags for RPATH

* Removing copying of dlls to output folder, since setting RPATH works fine now
2019-01-25 10:45:39 -08:00
Randy
f53cc032db Rashuai/link with ltcg (#378)
* compile with GL&LTCG

* remove tab

* restrict flag to only relwithdebinfo

* enable all OPT flags for relwithdebinfo
2019-01-24 19:29:05 -08:00
Scott McKay
bca8daf762
Update ONNX. Implement Scan 9 changes (#366)
* Update ONNX version to pickup Scan spec change that adds scan_output_axes.
Add logic to transpose an output
  - write to temporary buffer when executing subgraph
  - transpose temporary buffer into Scan output when execution completes
Add unit tests

* Update to ONNX dbf3581835e3a05716e10587511d7ab3b2cdc386 to pickup inferencing bugfix.
Update test to match.

* Disable some tests for opset 9 operators that haven't been implemented yet.
2019-01-24 08:10:39 +10:00
stevenlix
8ea7197b82 trt (#361)
* updated cmake files for tensorrt
2019-01-23 13:28:13 -08:00
Changming Sun
c87929e949 Use nsync for implementing condition variable 2019-01-21 22:59:42 -08:00
Tracy Sharpe
6f30bec040 Implement MLAS convolution+activation fusion (#354)
* conv+activation fusion
2019-01-18 14:16:28 -08:00
Changming Sun
948cc03490 upgrade onnx 2019-01-17 13:10:30 -08:00
Changming Sun
34afa0a598 Delete onnxruntime_exec 2019-01-16 11:18:44 -08:00
Changming Sun
d23f01dcd9 Suppress warnings for gemmlowp 2019-01-15 22:29:30 -08:00
Ashwin Kumar
492d9fd6cc
Use Eigen ThreadPool in OnnxRuntime (#323)
* switch to nonblocking threadpool in inference session and sessions state

* switch to eigen threadpool - first draft

* refine

* refine

* add a switch to easily revert back to windows thread pool

* switch thread pool in test runner and turn on leak checker

* remove unncessary files

* fix build error

* more build fixes

* catch exceptions in parallel executor

* fix mac build error

* fix mac build error

* more build fixes

* more mac build fixes

* fix cv issue

* change macro to include cuda compiler for  disabled compiler warning

* try switching the macro to win32 only

* test #error

* move #disable warning to the top

* Update onnxruntime_framework.cmake

* move eigen include to public scope

* turn off eigenthreadpool by default and add todo comment
2019-01-15 15:19:30 -08:00
Ke Zhang
139abda393
convinteger implementation based on gemmlowp (#294)
* update

* cmake change

* rename

* update

* update

* add cmake

* fix build warnings.

* fix comments

* update cmake to avoid run gemmlowp tests

* update cmake

* update

* fix build break

* update

* fix comments

* fix test failure

* add one more test case with padding.

* fix conv implementation of mkldnn and cuda to use updated computekernelshape function.

* fix linux ci build break
2019-01-15 14:39:50 -08:00
Edward Chen
677918cd9a Added generation of C# project properties file containing actual build directory. 2019-01-14 16:02:13 -08:00
jignparm
3b83f062fc remove delayload from mkldnn (#276) 2019-01-14 14:13:28 -08:00
Changming Sun
260639c327 Add missing EXCLUDE_FROM_ALL keyword to nsync submodule 2019-01-11 16:34:55 -08:00
Changming Sun
751eb60819 Fix a build issue in onnxruntime_unittests.cmake (#307) 2019-01-09 22:53:53 -08:00
Changming Sun
84be07114b Disable nsync test 2019-01-09 18:56:12 -08:00
Changming Sun
8cfe8d33a3 Add nsync (#292)
* Add nsync

* nsync2

* nsync3

* fix build

* update comments

* fix build option
2019-01-09 10:40:55 -08:00
Pranav Sharma
31bbb4598e
Enable tvm in CI builds. (#285)
* Enable tvm in CI builds

* Fix tvm dll path issue
2019-01-07 19:37:06 -08:00
Ke Zhang
386b60e35a
update gemmlowp which carries a windows build break fix. (#291) 2019-01-07 18:24:46 -08:00
Yang Chen
4929ddde43 update tvm submodule (#287) 2019-01-07 14:32:58 -08:00
Changming Sun
5e113661a9 Build system upgrades (#281)
* update

* runas normal user
2019-01-07 13:15:24 -08:00
xkszltl
5b6f1823af Use CUDA libraries found by CMake. (#190)
* Link cudart and use CMake predefined variables for cublas.

* Patch for old cmake.

* Remove static cudnn.
Fix file name for cuBLAS DLL delay loading.
2019-01-02 16:51:42 -08:00
Yuan Yu
bd2ace7619 Tweak linker flags to minimize the binary size. (#270) 2019-01-02 14:50:34 -08:00
xkszltl
bddeb3d001 Fix "invalid escape" issue when eigen_SOURCE_PATH contains "\". (#258) 2019-01-02 14:00:21 -08:00
Raymond Yang
8f3a492682 Remove static linked cuda libs (#239) 2019-01-02 10:39:37 -08:00
Dmitri Smirnov
7af1887b33
Introduce basic BFloat16 runtime support (#235)
* Add basic support for BFloat16 type.

* Advance onnx submodule for bfloat16 support.

* Update install_deps for linux.

* Address review comments.
2018-12-21 12:40:59 -08:00
jywu-msft
eb867be331
update mkldnn to 0.17.2 (#231) 2018-12-20 07:32:49 -08:00
Tang, Cheng
0dca080238 remove useless internal schema file (#226)
* placeholder for internal contrib ops

* remove useless internal file

* fix build break
2018-12-19 18:17:20 -08:00
Changming Sun
e97caa7787 change mkldnn so path (#210) 2018-12-19 14:45:57 -08:00
Changming Sun
dc8b37f4c4
update onnx (#209)
* update onnx
2018-12-18 14:50:28 -08:00
Ke Zhang
37b74c771a
add gemmlowp as submodule. (#206) 2018-12-18 13:57:53 -08:00
Changming Sun
c0ec7d56c4
Clean up garbage files (#208) 2018-12-18 13:51:01 -08:00
KeDengMS
b0f27ba0a7
Allow using MKLML header/libs when use_mklml is specified (#178)
Allow using MKLML header/libs when use_mklml is specified
2018-12-17 14:41:42 -08:00
jignparm
5fd9024139
[WIP] Initial checking for CSharp GPU support (#176)
* Initial checking for CSharp GPU support

* Enabled C# for GPU build

* Update Onnxruntime to Ort

* Add runtime check for cuda dlls for windows

* Update pretrained model test, for models where name!=model.onnx

* lowered tolerance for float checks to pass new models

* ignore extra ._resnet34v2.onnx file in pretrained test
2018-12-17 21:18:48 +00:00
xkszltl
383315a7e0 Upgrade Eigen to 3.3.7 (#185) 2018-12-16 00:26:08 -08:00
Tongliang Liao
16b3a9fd63 onnxruntime_CUDA_HOME should only be used when available. 2018-12-16 00:25:12 -08:00
Tracy Sharpe
47551da994
Optimize Tanh/Sigmoid activations (#162)
* optimized tanh/sigmoid

* fix /W4 warnings from alternate build environment

* use MLAS for tanh/sigmoid

* fix my broken C++ templates

* add x86_64 files
2018-12-13 22:53:40 -08:00
edgchen1
c5a0119d42 Added Environment::IsInitialized() and added check to InferenceSession constructor. (#169) 2018-12-13 13:34:49 -08:00
Ashwini Khade
b054646ddd
Askhade/implement erf (#137)
* erf implementation for op9

* enable erf node tests + review comment fixes

* update CMAKE flag

* plus erf to execution provider
2018-12-10 17:26:01 -08:00
Ashwini Khade
c5a691d67d
update onnx to latest commit (#132)
* update onnx to latest commit

* fixes

* Add not implemented operator tests to whitelist

* fix test name
2018-12-10 11:17:52 -08:00