edgchen1
2cb8cb816f
Disable or update flaky tests, improve test random seed accessibility. ( #3495 )
...
- Add output of test random seed
- Allow setting of test random seed with environment variable
- Disable / relax tolerance for flaky tests
2020-04-17 15:57:32 -07:00
edgchen1
de543c0308
Add SafeInt include to WinML targets ( #3558 )
...
Fixing Windows builds on the ort_training branch in preparation for the merge to master.
SafeInt (included via onnxruntime/core/common/safeint.h) was recently made a dependency of onnxruntime/core/framework/bfc_arena.h. That requires consumers of bfc_arena to compile with the SafeInt include directory.
2020-04-17 09:54:01 -07:00
manashgoswami
9fc2b6482b
Ort training README ( #3404 )
...
Added README for ORT Training
2020-04-16 14:51:33 -07:00
M. Zeeshan Siddiqui
6c1ccb659f
SoftmaxCrossEntropyLoss-12 forward and backward kernel implementation. ( #3465 )
...
* Update ONNX submodule commit to the latest.
* build break.
* SoftmaxCrossEntropyLoss: Forward and backward kernel implementation.
* Revert "build break."
This reverts commit 847cb50d294efbe6c09fa760e7cacf25bfb6146d.
* Add more tests and misc clean up.
* revert unintended changes.
* PR feedback.
* cleanup.
* PR feedback.
2020-04-16 12:27:07 -07:00
edgchen1
0ec90f7019
Put safeint_interface include directory into onnxruntime_common interface include directories to simplify usage by other targets. ( #3546 )
2020-04-16 10:34:32 -07:00
edgchen1
2f16172e69
Address PR comments and clean up. ( #3536 )
...
Address PR comments and clean up.
- https://github.com/microsoft/onnxruntime/pull/3174#discussion_r408549886
- https://github.com/microsoft/onnxruntime/pull/3174#discussion_r408551151
2020-04-15 15:51:52 -07:00
Jesse Benson
2536e80602
Rename API to onnxruntime.set_seed(<seed>)
2020-04-15 09:44:48 -07:00
Jesse Benson
644bc05830
Add Python API to set random seed: onnxruntime.seed(<seed>)
2020-04-15 09:44:48 -07:00
pengwa
2c7c45076b
MaxBatchSize E2E Test ( #3454 )
...
* max batch size e2e test
*update test data snapshot
2020-04-15 09:50:44 +08:00
edgchen1
4fa88a0a23
Remove cast to OpKernelContextInternal to get threadpool and directly use OpKernelContext. ( #3523 )
2020-04-14 14:30:26 -07:00
Tixxx
06b63975c0
Fix fp16 type mismatch when graph output is an fp32-only node ( #3411 )
...
* verify output node before changing its type in mixed precision mode
2020-04-14 09:35:19 -07:00
edgchen1
ba7225f986
Update Graph SetInputs and SetOutputs for training ( #3446 )
...
Fix training modification of Graph SetInputs() and SetOutputs(). Originally there were distinct code paths in Graph based on whether the graph was loaded from a GraphProto or created from scratch. The training modifications made that distinction a bit ambiguous - i.e., even though the Graph is loaded from a GraphProto for training, sometimes we rely on the other code path, e.g., to deduce the graph inputs after modifying it. Consequently, there was some odd behavior when using SetInputs(). For correctness, this change separates the cases where the graph is loaded from a GraphProto and where it is created from scratch.
2020-04-13 19:10:44 -07:00
M. Zeeshan Siddiqui
5d99f179b9
Merge pull request #3486 from microsoft/sedymche/merge_master_ort_training
...
Merge from master into ort_training
2020-04-13 10:55:36 -07:00
Sergii Dymchenko
bf3df41424
Put back SubmoduleCheckoutMode parameter into mac-ci.yml.
2020-04-12 21:49:38 -07:00
Sergii Dymchenko
b670cdc11d
Remove usage of DeviceProp (which is removed in ort_training) from cudnn_common.cc.
2020-04-12 21:40:54 -07:00
Sergii Dymchenko
7b2fc196fe
Get cudnn_common.cc from master.
2020-04-12 21:10:36 -07:00
Sergii Dymchenko
571a6d52f5
Add to list of failing backend tests from master.
2020-04-12 19:16:53 -07:00
edgchen1
cffdff6702
Publish unit test results from Linux and Mac builds ( #3480 )
...
* Added publish test results step to Linux and Mac builds.
* Fix test result file pattern.
2020-04-10 14:51:56 -07:00
Tixxx
f5ba9c922d
fix internal loss scale ( #3483 )
...
* Changed internal loss scale to 1-D
* added test
Co-authored-by: root <root@525204a066204ea794f942530b05ae7f000000.axlncovkyjne5caro2tmz3zryb.xx.internal.cloudapp.net>
2020-04-10 14:13:48 -07:00
Sergii Dymchenko
4b5f66ad17
Put dropout_default, dropout_random, celu back in the list of broken tests.
2020-04-10 13:27:00 -07:00
edgchen1
20c7dd9f5c
Remove orttraining/docker directory. ( #3476 )
...
The docker images are not publicly available yet.
Addressing PR comment: https://github.com/microsoft/onnxruntime/pull/3174#discussion_r390761308
2020-04-10 09:41:22 -07:00
Vincent Wang
03996c7c08
Fixes for Where, ConcatGrad and ReduceSumGrad ( #3415 )
...
* Fixes for Expand, Where, ConcatGrad ReduceSumGrad.
* Roll back expand, fix, add tests for reduce grad.
* Roll back CPU Expand change.
* Fix after merge.
Co-authored-by: Vincent Wang <weicwang@microsoft.com>
2020-04-10 19:35:32 +08:00
Sergii Dymchenko
bb2f427990
Remove FastGelu from activations.
2020-04-09 19:17:54 -07:00
Sergii Dymchenko
507d2bb9b9
Get onnxruntime/core/providers/cuda/math/matmul_integer.cc from ort_training.
2020-04-09 18:09:46 -07:00
Sergii Dymchenko
1b465ba820
Get onnxruntime/core/providers/cuda/cu from ort_training.
2020-04-09 18:03:17 -07:00
Sergii Dymchenko
c5176087bf
Get onnxruntime/contrib_ops/cuda/bert/fast_gelu.cc from ort_training.
2020-04-09 17:55:52 -07:00
Sergii Dymchenko
6bbc80951d
Get onnxruntime/core/providers/cuda/tensor/slice.h from ort_training.
2020-04-09 17:03:58 -07:00
Sergii Dymchenko
0e4080f1d6
Get cuda_common.h from master.
2020-04-09 16:56:52 -07:00
Sergii Dymchenko
84773c61c6
Rename ONNX OPTIONAL to OPTIONAL_VALUE.
2020-04-09 16:22:30 -07:00
liqunfu
e7297e6c9d
create pipeline for ci frontend tests ( #3422 )
...
create pipeline for nightly python front-end e2e tests
2020-04-09 15:31:22 -07:00
Sergii Dymchenko
eaa3f652df
Fix dynamicslice.cc after merge.
2020-04-09 15:17:21 -07:00
Sergii Dymchenko
8ea0e596ec
Fix onnxruntime_unittests.cmake after merge.
2020-04-09 13:14:15 -07:00
Sergii Dymchenko
6ba7c99e50
Merge branch 'master' into ort_training
2020-04-09 12:42:04 -07:00
ytaous
a08f16471a
Address comments around bfc arena ( #3460 )
...
* rename setting
* todo comments
* fix build
Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-04-08 19:35:32 -07:00
liqunfu
a298556600
raid rtol to unblock CI ( #3457 )
...
raise rtol to avoid expected CI test failure in onnxruntime_test_ort_trainer.py
2020-04-08 17:17:44 -07:00
ytaous
f73008483a
safeint for region bytes in bfc arena and code clean up ( #3447 )
...
* PR comments
* remove build issue workaround
* SafeInt for region bytes
* fix build
* fix build
Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-04-08 13:54:42 -07:00
liqunfu
1ddfe1249b
frontend test to use random seed ( #3209 )
...
frontend test to use random seed
2020-04-08 10:03:07 -07:00
Yufeng Li
4d71958ccf
Use IMMA for int8 matmul to leverage Turing Tensor Core ( #3413 )
...
Use IMMA for int8 matmul to leverage Turing Tensor Core
Format files under onnxruntime/core/providers/cude
2020-04-07 15:22:04 -07:00
Tracy Sharpe
de60a14c16
Fix output range for int8_t QuantizeLinear op ( #3445 )
2020-04-07 15:01:20 -07:00
Yulong Wang
aabf47b107
Fix Split CUDA implementation for zero sized input ( #2942 )
...
* Fix Split CUDA implementation for zero sized input
* resolve comments
* add case
* test case update: split into 2 tensors
2020-04-07 14:44:20 -07:00
Scott McKay
48e96ea65f
Reduce binary size of Slice implementation ( #3238 )
...
* Make the Slice implementation based on type sizes and reduce templatized code to a minimum.
* Remove using 'dynamic' as a template param to Slice as well.
2020-04-08 07:19:29 +10:00
ytaous
b35468289a
View Op - new unit tests and add support for tensor memcpy by offset/size ( #3439 )
...
* view ops UTs
* update per comments
* PR comments - code clean up
* code clean up per comments
Co-authored-by: Ethan Tao <ettao@microsoft.com>
2020-04-07 13:07:11 -07:00
Dmitri Smirnov
53b9d52fc6
Rework TensorToTensorProto. Do not put string data to raw_string. Eliminate redundant argument. ( #3438 )
...
Rework TensorToTensorProto. Eliminate redundant argument.
Do not put string data into raw_data.
2020-04-07 11:42:10 -07:00
Andrews548
43d6c464fc
Fix ACL EP pooling build breakage ( #3429 )
...
The commit 06fc9506fd which refactored cpu Pool class broke ACL EP build.
Also worked on the commit a4fe60c4d3 as it also affects the new class.
Move the declaration of the new MaxPoolV8 cpu class in the header file. Implement MaxPool 8-11 in ACL EP.
Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>
2020-04-07 07:03:52 -07:00
Tianlei Wu
4bdb5cc8e2
Add CPU implementation for FastGelu operator ( #3398 )
...
* Add CPU implementation for FastGelu operator
* Update optimization script to fuse Gelu or FastGelu according to Elf or Tanh is used in graph.
* Merge BiasGelu and FastGelu into one class
* Enable FastGelu Fusion optimizer for CPU Execution Provider.
2020-04-07 00:19:30 -07:00
Changming Sun
9e65298d7a
Re-enable tests ( #3437 )
...
Re-enable some tests that was recently fixed.
2020-04-06 20:13:34 -07:00
Thiago Crepaldi
15e32b44fd
Merge pull request #3383
...
Merge from master into ort_training
2020-04-06 19:05:01 -07:00
Tianlei Wu
8ab09186b7
Bert Optimization Script Improvements ( #3387 )
...
Add opt_level option for graph optimization level in bert perf test.
Support BERT models that output each layer, where SkipLayerNormalization has more than 4 children.
Check weight and bias are 1D for layer norm fusion.
Add a dummy class Gpt2OnnxModel for further changes of GPT2 model.
2020-04-06 16:55:40 -07:00
Edward Chen
95707d22a5
Disable gradient clipping for E2E test.
2020-04-06 23:07:28 +00:00
Dmitri Smirnov
c8f5e6e632
Implement Min/Max/Clip(12) ( #3410 )
...
Implement Max/Min for opset 12.
Add CLip(12) CPU impl.
Implement Clip(12) for CPU and CUDA add tests
2020-04-06 14:24:59 -07:00