Commit graph

5033 commits

Author SHA1 Message Date
Ryan Hill
5da718a114 Formatting 2021-05-17 22:32:11 -07:00
ashbhandare
3c1cf82ff1 Bump to rel-1.9.1 (#7684) 2021-05-16 23:24:16 -07:00
Ryan Hill
8d911833ef Merge error 2021-05-16 20:50:27 -07:00
Ryan Hill
9ddbb046a2 Merge related issues and code review feedback. 2021-05-16 19:22:03 -07:00
Ryan Hill
67adb7bfe0 Merge with master 2021-05-16 18:41:57 -07:00
Tracy Sharpe
943ab9dcef
undo clang-format corruption (#7720) 2021-05-16 15:32:30 -07:00
liqunfu
d604281a86
Liqun/training pkg to run tests (#7662) 2021-05-16 09:10:57 -07:00
liqunfu
3ead2f2f39
update pt lightning version (#7711)
Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-15 21:46:16 -07:00
Yulong Wang
017b94ab36
Specify correct dependency for CI pipeline of nodejs binding (#7717) 2021-05-15 08:56:58 -07:00
Yufeng Li
6b0a7905ed
fix quant weight cleanup bug (#7707) 2021-05-14 22:04:35 -07:00
Sergii Dymchenko
9ba8da65d2
Fix BiasDropoutFusion when there are multiple Dropout consumers. (#7708)
* Add tests for multiple consimers bias dropout fusion.

* Don't fuse resudial Add for multiple Dropout consumers.

* Remove duplicate code.

* Fix comment typo.

Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>

Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
2021-05-14 21:03:23 -07:00
Hariharan Seshadri
53d1d55ea8
Add ability for pre-packed weights of shared initializers to be shared across sessions (#7421) 2021-05-14 20:44:42 -07:00
Ashwini Khade
c78a40c0c7
add opset14 registrations for cuda provider (#7699)
* add opset14 registrations for cuda provider

* update macros

* fix typo

* updates to rocm provider
2021-05-14 19:08:45 -07:00
Yulong Wang
97d9bcd644
[js/web] fix bundle for multi-thread, add e2e test and support nodejs (#7688)
* fix bundle for multi-thread, add e2e test and support nodejs

* add copyright banner

* resolve comments

* add comments for isMultiThreadSupported()
2021-05-14 18:15:38 -07:00
Yufeng Li
a74e41e47d
Add non-zero zp support for quant matmul and attention (#7570)
* add non-zero zp support
* support A and B scale with any dimensions
2021-05-14 16:50:31 -07:00
Yulong Wang
c53b5be509
force multi steps to use the same commit in CI (#7697) 2021-05-14 15:13:38 -07:00
Zhang Lei
0f7721a019
Fix bug for not checking original float value names when treat it as not existing. (#7695) 2021-05-14 12:50:30 -07:00
Changming Sun
1d403ba03b
Fix a compile warning in EigenNonBlockingThreadPool.h (#7638) 2021-05-14 11:38:34 -07:00
Zhang Lei
033f0b3b7c
fix typo. (#7690) 2021-05-14 10:25:34 -07:00
Sunghoon
a9b47ca8e7
update debug.keystore for a public usage (#7696) 2021-05-14 09:54:54 -07:00
liqunfu
359fe1d197
Liqun/ort training version (#7620) 2021-05-14 09:54:19 -07:00
ashbhandare
bfbcc89db1
Add MLFloat16 support for SoftmaxCrossEntropyLoss for CUDA EP (#7679)
* Forward op changes

* Add tests, improve kernel

* add opset 13 registration, remove unnecessary changes

* Add fp16 grad for SCELoss, review comments
2021-05-14 09:00:27 -07:00
Changming Sun
39fac6d304
Fix a buffer reuse bug in allocation_planner.cc (#7645) 2021-05-14 07:31:35 -07:00
Ashwini Khade
442c7300eb
add opset14 rnn ops (#7687)
* add opset14 rnn ops

* update kernel hashes
2021-05-14 05:52:54 -07:00
Yufeng Li
3725d0211f
support maxpool QDQ fusion from opset 12 (#7693) 2021-05-13 23:58:15 -07:00
Zhang Lei
50c5edcf13
Add nhwc support for QLinearAveragePool operator (#7656)
* Add nhwc support for QLinearAveragePool operator

* Update ContribOperators.md

* Update OperatorKernels.md with cpu,dnnl and cuda enabled.
2021-05-13 22:05:30 -07:00
baijumeswani
37f69fcee5
Regain performance by caching initializer names in ORTModule (#7685) 2021-05-13 20:54:49 -07:00
Edward Chen
19704aedbb
Update Objective-C API (#7675)
- Add session/run configuration
- Add additional supported tensor data types
- Clean up
2021-05-13 18:47:22 -07:00
ashbhandare
56e993a434
Bump to rel-1.9.1 (#7684) 2021-05-13 18:41:28 -07:00
Xueyun Zhu
32d8278c2f
reshape fix (#7678) 2021-05-13 16:34:03 -07:00
raviskolli
4b37901f10
Aten support for rocm (#7680)
* Aten support for rocm

* Removed aten_ops.cc as it is resued from cuda version
2021-05-13 15:56:03 -07:00
Aswin John Mathews
4afdc19958
ROCm optimized layernorm for MI100 (#7682)
* layernorm optimizations

* Changed HIP flag from HIP_VERSION to __HIP_PLATFORM_HCC__
2021-05-13 15:54:06 -07:00
satyajandhyala
d90a99aad5
Fix the build on dev machines by replacing std::tuple with two arguments with std::pair (#7683) 2021-05-13 15:11:51 -07:00
Hariharan Seshadri
7bb3f243ff
Revert (#7663) 2021-05-13 14:11:17 -07:00
Sunghoon
1ab8a95eb6
Bind existing SessionOptions and RunOptions in Javascript API with WebAssembly (#7621)
* support session options and run options. use onnxruntime c api.

* fix lint errors

* add an error code on throwing an exception

* resolve comments. change remaining C++ APIs to C API
2021-05-13 10:50:04 -07:00
Guoyu Wang
333318af04
[CoreML EP] add clip support (#7666) 2021-05-13 10:41:30 -07:00
Ryan Hill
5f62d4bb3b Oops, scope DistributedRunContext to just NCCL 2021-05-13 10:34:28 -07:00
Zhang Lei
46246f1bbd
Add nhwc transformer support and unittest for qlinear concat. (#7587) 2021-05-13 09:50:37 -07:00
satyajandhyala
3a407b40dd
Add the missing source file to the target onnxruntime_test_debug_node… (#7676)
* Add the missing source file to the target onnxruntime_test_debug_node_inputs_outputs
2021-05-13 09:14:12 -07:00
harshithapv
31ca21b782
Replace Where Grad "Mul" with "Where" (#7672)
* replace where grad mul with where

* clean up

* auto formatting

* remove not for second input
2021-05-13 08:54:43 -07:00
Ryan Hill
374ae71739 Missing static 2021-05-13 02:28:54 -07:00
Ryan Hill
8b538e7918 Test passed, now with fix 2021-05-13 02:28:49 -07:00
Ryan Hill
0a59bc3902 Test moving DistributedRunContext instance into shared provider layer
(with purpose error to verify it's being built properly)
2021-05-13 02:28:44 -07:00
Ryan Hill
741e09a882 Convert aten_op to shared (due to marge with master) 2021-05-12 22:32:51 -07:00
Ryan Hill
86cb70a118 Sync with Master 2021-05-12 21:28:07 -07:00
George Nash
b4e8e9b004
Add DnnlOpManager (#7521)
* Add  DnnlOpManager

The DnnlOpManager is able to more accurately check if a node is
supported by the DNNLExecutionProvider.

The DNNLExecutionProvider::GetCapability function has been updated
to use the DnnlOpManager.

This commit adds the ability to check if data type, attributes,
and tensor dimensions of the node are supported.

The IsDimensionSupported function is no longer needed since the checks
it was doing have been moved into the individual implementations of
the virtual class DnnlNodeCapability.

Signed-off-by: George Nash <george.nash@intel.com>

* Fix AveragePool entry in the DnnlOpManager

Added check for ceil_mode attribute in the PoolNodeCapability
check.  DnnlExecutionProvider does not support ceil_mode other
than the default value.

Signed-off-by: George Nash <george.nash@intel.com>
2021-05-12 22:04:26 -05:00
Ryan Hill
06128b9677 Missed a logging reference in mpi_context.h 2021-05-12 19:39:52 -07:00
Ryan Hill
3263241baa Update comments 2021-05-12 18:38:43 -07:00
Ryan Hill
1a20b6d13e Add in a couple more missing GetType methods 2021-05-12 18:36:05 -07:00
Ryan Hill
b1924a9323 Remove LOGS statements as a test 2021-05-12 18:36:01 -07:00