Randy
f53cc032db
Rashuai/link with ltcg ( #378 )
...
* compile with GL<CG
* remove tab
* restrict flag to only relwithdebinfo
* enable all OPT flags for relwithdebinfo
2019-01-24 19:29:05 -08:00
Scott McKay
bca8daf762
Update ONNX. Implement Scan 9 changes ( #366 )
...
* Update ONNX version to pickup Scan spec change that adds scan_output_axes.
Add logic to transpose an output
- write to temporary buffer when executing subgraph
- transpose temporary buffer into Scan output when execution completes
Add unit tests
* Update to ONNX dbf3581835e3a05716e10587511d7ab3b2cdc386 to pickup inferencing bugfix.
Update test to match.
* Disable some tests for opset 9 operators that haven't been implemented yet.
2019-01-24 08:10:39 +10:00
stevenlix
8ea7197b82
trt ( #361 )
...
* updated cmake files for tensorrt
2019-01-23 13:28:13 -08:00
Changming Sun
c87929e949
Use nsync for implementing condition variable
2019-01-21 22:59:42 -08:00
Tracy Sharpe
6f30bec040
Implement MLAS convolution+activation fusion ( #354 )
...
* conv+activation fusion
2019-01-18 14:16:28 -08:00
Changming Sun
948cc03490
upgrade onnx
2019-01-17 13:10:30 -08:00
Changming Sun
34afa0a598
Delete onnxruntime_exec
2019-01-16 11:18:44 -08:00
Changming Sun
d23f01dcd9
Suppress warnings for gemmlowp
2019-01-15 22:29:30 -08:00
Ashwin Kumar
492d9fd6cc
Use Eigen ThreadPool in OnnxRuntime ( #323 )
...
* switch to nonblocking threadpool in inference session and sessions state
* switch to eigen threadpool - first draft
* refine
* refine
* add a switch to easily revert back to windows thread pool
* switch thread pool in test runner and turn on leak checker
* remove unncessary files
* fix build error
* more build fixes
* catch exceptions in parallel executor
* fix mac build error
* fix mac build error
* more build fixes
* more mac build fixes
* fix cv issue
* change macro to include cuda compiler for disabled compiler warning
* try switching the macro to win32 only
* test #error
* move #disable warning to the top
* Update onnxruntime_framework.cmake
* move eigen include to public scope
* turn off eigenthreadpool by default and add todo comment
2019-01-15 15:19:30 -08:00
Ke Zhang
139abda393
convinteger implementation based on gemmlowp ( #294 )
...
* update
* cmake change
* rename
* update
* update
* add cmake
* fix build warnings.
* fix comments
* update cmake to avoid run gemmlowp tests
* update cmake
* update
* fix build break
* update
* fix comments
* fix test failure
* add one more test case with padding.
* fix conv implementation of mkldnn and cuda to use updated computekernelshape function.
* fix linux ci build break
2019-01-15 14:39:50 -08:00
Edward Chen
677918cd9a
Added generation of C# project properties file containing actual build directory.
2019-01-14 16:02:13 -08:00
jignparm
3b83f062fc
remove delayload from mkldnn ( #276 )
2019-01-14 14:13:28 -08:00
Changming Sun
260639c327
Add missing EXCLUDE_FROM_ALL keyword to nsync submodule
2019-01-11 16:34:55 -08:00
Changming Sun
751eb60819
Fix a build issue in onnxruntime_unittests.cmake ( #307 )
2019-01-09 22:53:53 -08:00
Changming Sun
84be07114b
Disable nsync test
2019-01-09 18:56:12 -08:00
Changming Sun
8cfe8d33a3
Add nsync ( #292 )
...
* Add nsync
* nsync2
* nsync3
* fix build
* update comments
* fix build option
2019-01-09 10:40:55 -08:00
Pranav Sharma
31bbb4598e
Enable tvm in CI builds. ( #285 )
...
* Enable tvm in CI builds
* Fix tvm dll path issue
2019-01-07 19:37:06 -08:00
Ke Zhang
386b60e35a
update gemmlowp which carries a windows build break fix. ( #291 )
2019-01-07 18:24:46 -08:00
Yang Chen
4929ddde43
update tvm submodule ( #287 )
2019-01-07 14:32:58 -08:00
Changming Sun
5e113661a9
Build system upgrades ( #281 )
...
* update
* runas normal user
2019-01-07 13:15:24 -08:00
xkszltl
5b6f1823af
Use CUDA libraries found by CMake. ( #190 )
...
* Link cudart and use CMake predefined variables for cublas.
* Patch for old cmake.
* Remove static cudnn.
Fix file name for cuBLAS DLL delay loading.
2019-01-02 16:51:42 -08:00
Yuan Yu
bd2ace7619
Tweak linker flags to minimize the binary size. ( #270 )
2019-01-02 14:50:34 -08:00
xkszltl
bddeb3d001
Fix "invalid escape" issue when eigen_SOURCE_PATH contains "\". ( #258 )
2019-01-02 14:00:21 -08:00
Raymond Yang
8f3a492682
Remove static linked cuda libs ( #239 )
2019-01-02 10:39:37 -08:00
Dmitri Smirnov
7af1887b33
Introduce basic BFloat16 runtime support ( #235 )
...
* Add basic support for BFloat16 type.
* Advance onnx submodule for bfloat16 support.
* Update install_deps for linux.
* Address review comments.
2018-12-21 12:40:59 -08:00
jywu-msft
eb867be331
update mkldnn to 0.17.2 ( #231 )
2018-12-20 07:32:49 -08:00
Tang, Cheng
0dca080238
remove useless internal schema file ( #226 )
...
* placeholder for internal contrib ops
* remove useless internal file
* fix build break
2018-12-19 18:17:20 -08:00
Changming Sun
e97caa7787
change mkldnn so path ( #210 )
2018-12-19 14:45:57 -08:00
Changming Sun
dc8b37f4c4
update onnx ( #209 )
...
* update onnx
2018-12-18 14:50:28 -08:00
Ke Zhang
37b74c771a
add gemmlowp as submodule. ( #206 )
2018-12-18 13:57:53 -08:00
Changming Sun
c0ec7d56c4
Clean up garbage files ( #208 )
2018-12-18 13:51:01 -08:00
KeDengMS
b0f27ba0a7
Allow using MKLML header/libs when use_mklml is specified ( #178 )
...
Allow using MKLML header/libs when use_mklml is specified
2018-12-17 14:41:42 -08:00
jignparm
5fd9024139
[WIP] Initial checking for CSharp GPU support ( #176 )
...
* Initial checking for CSharp GPU support
* Enabled C# for GPU build
* Update Onnxruntime to Ort
* Add runtime check for cuda dlls for windows
* Update pretrained model test, for models where name!=model.onnx
* lowered tolerance for float checks to pass new models
* ignore extra ._resnet34v2.onnx file in pretrained test
2018-12-17 21:18:48 +00:00
xkszltl
383315a7e0
Upgrade Eigen to 3.3.7 ( #185 )
2018-12-16 00:26:08 -08:00
Tongliang Liao
16b3a9fd63
onnxruntime_CUDA_HOME should only be used when available.
2018-12-16 00:25:12 -08:00
Tracy Sharpe
47551da994
Optimize Tanh/Sigmoid activations ( #162 )
...
* optimized tanh/sigmoid
* fix /W4 warnings from alternate build environment
* use MLAS for tanh/sigmoid
* fix my broken C++ templates
* add x86_64 files
2018-12-13 22:53:40 -08:00
edgchen1
c5a0119d42
Added Environment::IsInitialized() and added check to InferenceSession constructor. ( #169 )
2018-12-13 13:34:49 -08:00
Ashwini Khade
b054646ddd
Askhade/implement erf ( #137 )
...
* erf implementation for op9
* enable erf node tests + review comment fixes
* update CMAKE flag
* plus erf to execution provider
2018-12-10 17:26:01 -08:00
Ashwini Khade
c5a691d67d
update onnx to latest commit ( #132 )
...
* update onnx to latest commit
* fixes
* Add not implemented operator tests to whitelist
* fix test name
2018-12-10 11:17:52 -08:00
Tracy Sharpe
3c7c1068e7
refactor threading ( #110 )
2018-12-06 09:20:32 -08:00
jywu-msft
47a6992e1b
update mkldnn to 0.17.1 and address assumptions related to tensor padding that come with new mkldnn version. ( #79 )
2018-12-03 17:07:44 -08:00
Scott McKay
4dd1e50aa7
Merge remote-tracking branch 'origin/master' into scmckay/UpdateCudaInfoInBuildMd
2018-11-29 14:53:11 +10:00
George Wu
3feeb9de3e
Merge branch 'master' of https://github.com/Microsoft/onnxruntime into jywu/incr_fixup
2018-11-28 20:16:06 -08:00
Yulong Wang
846044e282
[Mac] support mkldnn for macOS ( #56 )
2018-11-28 20:01:21 -08:00
Raymond Yang
6371025860
Add flag for mac compliance ( #45 )
...
* Refine windows CI configs
* Add flag for mac compliance
2018-11-28 19:22:29 -08:00
Yulong Wang
d60507d2e9
[Mac] fix python binding ( #54 )
2018-11-28 18:29:16 -08:00
George Wu
6179ddb2a1
fix for possible incremental build breaks.
2018-11-28 18:14:20 -08:00
Scott McKay
2f61926390
Update build.md section on CUDA builds to clarify and add CUDA 10.0 info.
...
Add --msvc_toolset param to build.py
Tweak CMakeLists.txt to set the CUDA toolset path from onnxruntime_CUDA_HOME.
2018-11-29 10:37:21 +10:00
Pranav Sharma
7aef8a1cca
Sync with internal master.
2018-11-22 20:56:43 -08:00
Pranav Sharma
9fbcb7fa51
Update onnx to commit to pickup shape inference fix for broadcast, concat and scan
2018-11-20 19:41:30 -08:00