Commit graph

2053 commits

Author SHA1 Message Date
Thiago Crepaldi
e2afe5e054 Revert Session and InferenceSession implementation 2020-04-02 11:47:44 -07:00
Thiago Crepaldi
0b1e3f1e10 Revert _SliceKernel cuda implementation 2020-04-01 14:28:17 -07:00
Thiago Crepaldi
28ff88ce52 Disable tests (temporary) 2020-04-01 14:28:07 -07:00
Thiago Crepaldi
6d769d47c4 Fix InferenceSession API 2020-03-31 20:10:06 -07:00
Thiago Crepaldi
83c3da3fc0 Fix code-base after breaking API changes 2020-03-31 17:59:20 -07:00
Thiago Crepaldi
759818f2c1 Merge remote-tracking branch 'origin/master' into thiagofc/ort_training_merge_from_master 2020-03-31 10:53:22 -07:00
Scott McKay
ace741680d
Constant-12 support (#3304)
1. Support the new fields for Constant in opset 12
2. Support SparseTensor in the Constant node by converting to dense tensor when lifting the Constant to an initializer. Will make a model with a sparse tensor in a Constant work but isn't an overly efficient approach.
2020-03-30 23:13:52 -07:00
stevenlix
2332a93db0
Update onnx-tensorrt parser (#3369)
* sync onnx-tensorrt parser and update TensorRT doc

* remove --msvc_toolset 14.16 in tensorrt ci pipeline
2020-03-30 20:31:59 -07:00
Jan Scholz
ce9acf0c21
iOS crosscompilation under linux (#3298)
* added support for ios crosscompilation under linux

* reverted cmake generator change

* if --ios is added protoc can be compiled for host system

* accidently reverted change to compile protoc for host system for ios if protoc exe is not set

* wdata is now used

* accidentally pasted CMAKE_OSX_ARCHITECTURES into CmakeLists.txt, also made bad merge on build.py previously

* removed print

* fixed typeo, deleted commented statements for earlier debugging

* reverted accidental delete

* added asmmacro.h for aarch64 asm
now MlasSgemmKernel**** gets underscore added if needed
no need anymote to differentiate between iOS arm64 and normal amr64 build
onnxruntime.cmake: added check if iOSCross is set to properly set RPATH

* removed 2 spaces

* fix: logcial error fixed, now protoc gets compiled if not supplied with --path_to_protoc_exe

* removed unecessarily added spaces

* removed some more spaces
2020-03-30 19:39:17 -07:00
edgchen1
fb2f97a002
Address master merge PR comments (#3348)
Address some comments from https://github.com/microsoft/onnxruntime/pull/3174.

- https://github.com/microsoft/onnxruntime/pull/3174#discussion_r396855459
- https://github.com/microsoft/onnxruntime/pull/3174#discussion_r396855630
- https://github.com/microsoft/onnxruntime/pull/3174#discussion_r396857140
- https://github.com/microsoft/onnxruntime/pull/3174#discussion_r398094858
- https://github.com/microsoft/onnxruntime/pull/3174#issuecomment-599024924
2020-03-30 18:52:48 -07:00
Yufeng Li
af618278f6
fix bugs in quantization and calibration tools (#3329)
Fix 3 bugs:
node names duplicate in calibration augment_graph if the name of node to quantize is empty.
If output nodes are quantized, output value are quantized and not dequantized back
Gather with data type int64 should not be quantized
2020-03-30 17:50:25 -07:00
Maxim Kalinin
f2ca2b2981
Avoid "infinite" loop in optimizer (#3321)
* Avoid "infinite" loop in optimizer

When symbolic dimensions are present and can be overridden,
FreeDimensionOverrideTransformer always sets modified flag to true. As a
consequence, the optimizer loops until the iteration limit is reached.
2020-03-31 08:37:00 +10:00
Changming Sun
06fc9506fd
Thread pool changes (#3153)
1. Copy tensorflow's thread pool class to ORT, so that we can get a better implementation of thread pool based parallelfor
2. Copy Eigen's thread pool class to ORT
3. Support thread affinity
4. Remove RNN kernel’s private thread pool
5. Modify pool kernels to use the thread pool when openmp is disabled.
2020-03-30 12:18:40 -07:00
Yulong Wang
0494036006
fix tensor location mismatch in allocation planner (#3249) 2020-03-30 11:20:43 -07:00
Cassie
2b10e625f9
added public value varibale to NamedOnnxValue (#3347)
Co-authored-by: cassieview <cassie.siljander@microsoft.com>
2020-03-30 10:45:39 -07:00
George Wu
355f39ddee
fix cuda build for cmake >= 3.17.0 (#3362) 2020-03-30 00:38:57 -07:00
ytaous
d8f0a0f223
Address PR comments (#3352)
* PR comments

* revert code for a couple comments

* add negative test case

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-03-29 12:34:54 -07:00
Weixing Zhang
1bbc421884
Don't cast to fp16 in LayernormGrad (#3328)
Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
2020-03-28 19:07:32 -07:00
Yang Chen
33b5010e62
skip optional inputs for scan subgraphs (#3349)
* skip optional inputs for scan subgraphs

We may have cases where the subgraph has optionial inputs that appear
in both subgraph's input and initializer, but not in the node's input.
In such cases, the input model might be invalid, but let's not choke
on it. Instead, let's issue a warning, skip the optional inputs,
and keep going forward.

* address CR feedback
2020-03-28 16:15:45 -07:00
Sherlock
ffb2a3359e
Implement WhereGrad (#3343) 2020-03-27 19:10:40 -07:00
Tiago Koji Castro Shibata
c3cea486d0
Port ConcurrencyTests from TAEF (#3086)
* Add ConcurrencyTests

* Make ConcurrencyTests compatible with TAEF

* Use test PCH in concurrency tests

* Fix include header

* Ignore unused code warnings on WINML_SKIP_TEST

* Remove BOM

* Remove conflicting namespace in older SDK

* Refactor duplicate code

* Fix unused DELAYLOAD

* Fix unused DELAYLOAD

* Remove link to internal bug

* Address code style fixes

* Add new concurrency tests
2020-03-27 17:39:22 -07:00
Tixxx
49e6043d07
support Huggingface's adamw (#3318)
* add weight decay mode to support both pytorch and huggingface's adamw
2020-03-27 08:04:27 -07:00
Yang Chen
5278f73202
Fixed two issues in symbolic_shape_infer script (#3332)
* Fixed two issues in symbolic_shape_infer script

This change addressed #3293

There were two issues in the script:

* We need to handle a special case for infer_Reshape, where input_shape
is empty and target shape_value is [-1]. In such case, we need to
get sympy data for the output dim (or create one if it doesn't exist).

* We need to update computed dims for newly-created shape for Range op

* also call _update_computed_dims for _infer_Expand

addressed CR feedback

* added ai.onnx into opset list

* instead of manipulating _infer_Reshape, call _update_computed_dims
from _infer_Expand to update newly-computed dims
2020-03-26 23:27:37 -07:00
Xiang Zhang
810a10b230
Enable Onnxruntime Telemetry by Default for 1.3 (#3338) 2020-03-26 20:57:39 -07:00
ytaous
131c65d23d
Addressing PR comments (#3334)
* PR comments

* PR comments

* PR comments

* error out bad shape

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-03-26 18:43:30 -07:00
Faith Xu
2e875f4e67
Delete outdated page (#3320) 2020-03-26 18:24:02 -07:00
Pranav Sharma
497e83eda5
Minor update to the issue template. Add a line to attach model where applicable. (#3339) 2020-03-26 14:28:27 -07:00
Xueyun Zhu
0a6ec0df56
Merge pull request #3285 from microsoft/xuzhu/merge_from_master
Merge from master to ort_training
2020-03-26 12:10:13 -07:00
Sherlock
d143b41b81
Expose frozen_weights in PyTorch Frontend (#3317) 2020-03-26 11:26:54 -07:00
Hector Li
0e81962e98
correct the cmake version to 3.13 for Arm build (#3333) 2020-03-26 10:20:18 -07:00
ytaous
66c7579c93
address PR comments (#3312)
* address PR comments

* PR comments

* PR comments

* disable logging

* typo

Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-03-25 19:35:12 -07:00
Changming Sun
5f6ec8ea6d Fix a bug in Maxpool v8 2020-03-25 16:27:43 -07:00
Scott McKay
dee4fc8b8a
Apply the same check for no_transpose from the Reduce* ops to ArgMin and ArgMax (#3315) 2020-03-26 07:41:16 +10:00
Sheil Kumar
51e95ea946
Make ort errors appear in winml exceptions (#3316)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-03-25 12:20:40 -07:00
Xueyun Zhu
a031af8cfc fix windows build 2020-03-25 19:18:50 +00:00
Xueyun Zhu
9eb792a5b3 move env to .cc file 2020-03-25 16:57:05 +00:00
Wei-Sheng Chin
b38fc0d541
Add bias correction in Adam & Lamb for C++ frontend & python frontend (#3301) 2020-03-25 09:46:44 -07:00
Xueyun Zhu
e9877850a4 fix python error 2020-03-25 01:59:37 +00:00
Scott McKay
4db01309cb
Use GEMM for SVMRegressor. (#3305) 2020-03-25 11:49:44 +10:00
Bowen Bao
6474801ceb
Update ort_trainer.py with lazy onnx export (#3244)
* Delay onnx export to avoid extra info

* handle cases where onnx model is provided at initialization

* address comments

* fix rebase error
2020-03-24 13:34:15 -07:00
Tianlei Wu
19edad132c
Move AzureML Bert notebook from onnx tutorial (#3302) 2020-03-24 12:31:02 -07:00
Weixing Zhang
fef7989866
Replacing CudaAsyncBuffer with TArray to improve perf (#3303)
* removing using CudaAsyncBuffer

* Keep CudaAsyncBuffer for these ops: non_max_suppression, cudnn_rnn_base, concat, split

* fix windows build error

* fix windows build error.

* fix build error

* fix windows build error

Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
2020-03-24 12:13:27 -07:00
Li-Wen Chang
98c28060b0
Aggregated Send/Recv (#3232)
* Aggregated Send/Recv

* fix typos

* CR refine

* CR refine

* CR refine

* Add scalar check.

* typo

* reformat

* CR refine

* Forgot to swap order in the implementation after spec changed

* CR refine

* Cr refine

* add Send's input type checking
2020-03-24 10:20:11 -07:00
Hariharan Seshadri
ef7b98f988
Support DisposableNamedOnnxValue inputs in c# Run() (#3175)
* Initial commit

* Update error message

* Update

* Updates to support holding onto onnxValue and pinnedmemoryBuffer

* Updates

* Minor updates

* Comment out a portion of the tests

* PR feedback

* Minor nit update

* Resolve comments

* PR feedback

* PR updates

* PR feedback
2020-03-23 18:36:12 -07:00
Faith Xu
fb5ab858d2
Update BUILD instructions (#3282)
Include guidance for building release packages per question from #3251
2020-03-23 18:35:22 -07:00
KeDengMS
d15c74e713
Implement pipeline event generator (#3206)
Implement pipeline event generator with OneFWOneBW schedule in timeline. Each stage of pipeline contains FW and BW of a subset of the model and are scheduled in one worker thread for each microbatch.
2020-03-23 17:32:54 -07:00
Xueyun Zhu
8f7bd51f7a fix pybind issue introduced by merge 2020-03-23 23:23:34 +00:00
Sheil Kumar
b72fe13941
Update WinML Projection to accept sequence of tensors (#3287)
* Enable sequence of tensor

* add tests

* small updates

* There should only be 2 elements returned

* CR feedback, and another 6->2 check update in the test.

* missing semicolon...

* Add explicit to constructor taking pointer paramter

Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-03-23 15:55:20 -07:00
Tixxx
7f610caca0
Make gradient clipping configurable. (#3243)
* Make gradient clipping configurable.
add control flag to c++ and python frontend
2020-03-23 12:21:48 -07:00
Weixing Zhang
843ee346a8
Implement struct TArray and simplify code. (#3291)
* Implement operator[] for TArray and simplify the code.

* fix a build error.

* add a constructor with std::vector input

* fix build error

* update based on code review feedback

Co-authored-by: Weixing Zhang <wezhan@microsoft.com>
2020-03-23 10:51:54 -07:00