stevenlix
557b94637d
Add more TensorRT env variables to provider options ( #7698 )
...
* add all trt env variables to provider options
* add python test
* Update onnxruntime_c_api.h
* fix issues
* validate values for options
2021-05-16 22:09:52 -07:00
Tracy Sharpe
943ab9dcef
undo clang-format corruption ( #7720 )
2021-05-16 15:32:30 -07:00
liqunfu
d604281a86
Liqun/training pkg to run tests ( #7662 )
2021-05-16 09:10:57 -07:00
liqunfu
3ead2f2f39
update pt lightning version ( #7711 )
...
Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-15 21:46:16 -07:00
Yulong Wang
017b94ab36
Specify correct dependency for CI pipeline of nodejs binding ( #7717 )
2021-05-15 08:56:58 -07:00
Yufeng Li
6b0a7905ed
fix quant weight cleanup bug ( #7707 )
2021-05-14 22:04:35 -07:00
Sergii Dymchenko
9ba8da65d2
Fix BiasDropoutFusion when there are multiple Dropout consumers. ( #7708 )
...
* Add tests for multiple consimers bias dropout fusion.
* Don't fuse resudial Add for multiple Dropout consumers.
* Remove duplicate code.
* Fix comment typo.
Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
2021-05-14 21:03:23 -07:00
Hariharan Seshadri
53d1d55ea8
Add ability for pre-packed weights of shared initializers to be shared across sessions ( #7421 )
2021-05-14 20:44:42 -07:00
Ashwini Khade
c78a40c0c7
add opset14 registrations for cuda provider ( #7699 )
...
* add opset14 registrations for cuda provider
* update macros
* fix typo
* updates to rocm provider
2021-05-14 19:08:45 -07:00
Yulong Wang
97d9bcd644
[js/web] fix bundle for multi-thread, add e2e test and support nodejs ( #7688 )
...
* fix bundle for multi-thread, add e2e test and support nodejs
* add copyright banner
* resolve comments
* add comments for isMultiThreadSupported()
2021-05-14 18:15:38 -07:00
Yufeng Li
a74e41e47d
Add non-zero zp support for quant matmul and attention ( #7570 )
...
* add non-zero zp support
* support A and B scale with any dimensions
2021-05-14 16:50:31 -07:00
Yulong Wang
c53b5be509
force multi steps to use the same commit in CI ( #7697 )
2021-05-14 15:13:38 -07:00
Zhang Lei
0f7721a019
Fix bug for not checking original float value names when treat it as not existing. ( #7695 )
2021-05-14 12:50:30 -07:00
Changming Sun
1d403ba03b
Fix a compile warning in EigenNonBlockingThreadPool.h ( #7638 )
2021-05-14 11:38:34 -07:00
Zhang Lei
033f0b3b7c
fix typo. ( #7690 )
2021-05-14 10:25:34 -07:00
Sunghoon
a9b47ca8e7
update debug.keystore for a public usage ( #7696 )
2021-05-14 09:54:54 -07:00
liqunfu
359fe1d197
Liqun/ort training version ( #7620 )
2021-05-14 09:54:19 -07:00
ashbhandare
bfbcc89db1
Add MLFloat16 support for SoftmaxCrossEntropyLoss for CUDA EP ( #7679 )
...
* Forward op changes
* Add tests, improve kernel
* add opset 13 registration, remove unnecessary changes
* Add fp16 grad for SCELoss, review comments
2021-05-14 09:00:27 -07:00
Changming Sun
39fac6d304
Fix a buffer reuse bug in allocation_planner.cc ( #7645 )
2021-05-14 07:31:35 -07:00
Ashwini Khade
442c7300eb
add opset14 rnn ops ( #7687 )
...
* add opset14 rnn ops
* update kernel hashes
2021-05-14 05:52:54 -07:00
Yufeng Li
3725d0211f
support maxpool QDQ fusion from opset 12 ( #7693 )
2021-05-13 23:58:15 -07:00
Zhang Lei
50c5edcf13
Add nhwc support for QLinearAveragePool operator ( #7656 )
...
* Add nhwc support for QLinearAveragePool operator
* Update ContribOperators.md
* Update OperatorKernels.md with cpu,dnnl and cuda enabled.
2021-05-13 22:05:30 -07:00
Dwayne Robinson
a27ef39d8f
Merged PR 6047693: DML EP fix GetSupportedDeviceDataTypeMask for int64 (and some float64 issues)
...
Fix one issue with int64/uint64 not being returned when getting the supported data types. Also add various fixes for float64 (do not actually *enable* any float64 kernels in this CR, just fix some things that enable it to be enabled later.)
Related work items: #28761231
2021-05-14 04:16:12 +00:00
baijumeswani
37f69fcee5
Regain performance by caching initializer names in ORTModule ( #7685 )
2021-05-13 20:54:49 -07:00
Edward Chen
19704aedbb
Update Objective-C API ( #7675 )
...
- Add session/run configuration
- Add additional supported tensor data types
- Clean up
2021-05-13 18:47:22 -07:00
ashbhandare
56e993a434
Bump to rel-1.9.1 ( #7684 )
2021-05-13 18:41:28 -07:00
Xueyun Zhu
32d8278c2f
reshape fix ( #7678 )
2021-05-13 16:34:03 -07:00
raviskolli
4b37901f10
Aten support for rocm ( #7680 )
...
* Aten support for rocm
* Removed aten_ops.cc as it is resued from cuda version
2021-05-13 15:56:03 -07:00
Aswin John Mathews
4afdc19958
ROCm optimized layernorm for MI100 ( #7682 )
...
* layernorm optimizations
* Changed HIP flag from HIP_VERSION to __HIP_PLATFORM_HCC__
2021-05-13 15:54:06 -07:00
satyajandhyala
d90a99aad5
Fix the build on dev machines by replacing std::tuple with two arguments with std::pair ( #7683 )
2021-05-13 15:11:51 -07:00
Hariharan Seshadri
7bb3f243ff
Revert ( #7663 )
2021-05-13 14:11:17 -07:00
Sunghoon
1ab8a95eb6
Bind existing SessionOptions and RunOptions in Javascript API with WebAssembly ( #7621 )
...
* support session options and run options. use onnxruntime c api.
* fix lint errors
* add an error code on throwing an exception
* resolve comments. change remaining C++ APIs to C API
2021-05-13 10:50:04 -07:00
Guoyu Wang
333318af04
[CoreML EP] add clip support ( #7666 )
2021-05-13 10:41:30 -07:00
Zhang Lei
46246f1bbd
Add nhwc transformer support and unittest for qlinear concat. ( #7587 )
2021-05-13 09:50:37 -07:00
satyajandhyala
3a407b40dd
Add the missing source file to the target onnxruntime_test_debug_node… ( #7676 )
...
* Add the missing source file to the target onnxruntime_test_debug_node_inputs_outputs
2021-05-13 09:14:12 -07:00
harshithapv
31ca21b782
Replace Where Grad "Mul" with "Where" ( #7672 )
...
* replace where grad mul with where
* clean up
* auto formatting
* remove not for second input
2021-05-13 08:54:43 -07:00
George Nash
b4e8e9b004
Add DnnlOpManager ( #7521 )
...
* Add DnnlOpManager
The DnnlOpManager is able to more accurately check if a node is
supported by the DNNLExecutionProvider.
The DNNLExecutionProvider::GetCapability function has been updated
to use the DnnlOpManager.
This commit adds the ability to check if data type, attributes,
and tensor dimensions of the node are supported.
The IsDimensionSupported function is no longer needed since the checks
it was doing have been moved into the individual implementations of
the virtual class DnnlNodeCapability.
Signed-off-by: George Nash <george.nash@intel.com>
* Fix AveragePool entry in the DnnlOpManager
Added check for ceil_mode attribute in the PoolNodeCapability
check. DnnlExecutionProvider does not support ceil_mode other
than the default value.
Signed-off-by: George Nash <george.nash@intel.com>
2021-05-12 22:04:26 -05:00
Vincent Wang
dac24f7d63
Add ATenOp and call aten::embedding and its Backward Op from ORT ( #7590 )
...
* build with libtorch and impl torchembedding
* fix op shape infer
* local commit
* atenfunctionop
* call aten operator from online extension
* rollback build.py
* resolve comments
* bugfix
* fix build
* fix ortmodule test
* remove external outputs, resolve comments
* resolve comments
* export embedding to microsoft::atenop
* bugfix
2021-05-13 09:24:27 +08:00
ashari4
c808621e9d
Better detection of MPI using FindMPI ( #7653 )
...
* Better detection of MPI using FindMPI.
* Remove REQUIRED from find_package(MPI)
2021-05-12 17:54:20 -07:00
Guoyu Wang
a47a234b7e
Add minsdkver for AAR and AndroidTest ( #7669 )
2021-05-12 16:01:25 -07:00
Chi Lo
760828b2d4
Add FromProviderOptions()/ToProviderOptions() for TensorRT EP ( #7654 )
...
* integrate existed provider option configuration method
* add GetProviderOptions()
* fix bug
* Add tests
* Update test
2021-05-12 15:26:27 -07:00
Zhang Lei
1c7e683a95
Add Squeeze and Unsqueeze support for quantizaton tools. ( #7673 )
2021-05-12 14:56:46 -07:00
Zhang Lei
31d4413919
fix quantization tool bug when existing pass through only input ( #7674 )
2021-05-12 14:54:42 -07:00
Faith Xu
7cb9077043
Fix readme page ( #7659 )
...
* Delete mobile page
Moved to: https://www.onnxruntime.ai/docs/how-to/deploy-on-mobile.html
* Delete ONNX_Runtime_Mobile_NNAPI_perf_considerations.md
Moved to: https://www.onnxruntime.ai/docs/reference/execution-providers/NNAPI-ExecutionProvider.html#performance-tuning
* Fix links to website docs
* Update some summary text
* Add space
2021-05-12 14:30:23 -07:00
Weixing Zhang
9241f62e4c
enable MatMulScale and cast propagation for ROCm EP. ( #7657 )
2021-05-12 13:43:24 -07:00
M. Zeeshan Siddiqui
5d9885f706
Fix BadNames. ( #7658 )
2021-05-11 16:06:10 -07:00
Scott McKay
8deca24b1a
Don't remove an unused initializer if it is overridable. ( #7649 )
2021-05-12 07:03:54 +10:00
baijumeswani
c5aeaa9419
Support for unused model initializers ( #7631 )
...
* Support for unused model initializers
* Change graph_info.initializer* to sets
2021-05-11 12:26:56 -07:00
Sunghoon
88d2fc8f1e
ONNX Runtime React Native Library ( #7564 )
...
* onnxruntime react native binding
* add react native backend
* fix lint comments
* fix react native backend for ios
* remove unnecessary files to check in
* move onnxruntime-common to devDependency
* create two podspec files for iphoneos and iphonesimulator
* revise README.md and add third party notices for react native
* rename a package
* rename a package and revise README
* add a license into package.json
* revise README and comments
* fix typo
* fix lint errors
* fix lint errors
* add a prepack script. touch index.tsx and App.tsx to resolve CI issue
* remove a unsupported tsx format from clang-format
* fix a type and add steps tp publish a react native npm package
* resolve comments
* fix clang format
* remove promise wrap. change prepack to typescript
2021-05-11 10:34:40 -07:00
Olivia Jain
29172d8f54
Setup EP Dashboard ( #7321 )
...
* setting up dashboard
* posting to ort dashboard
* creating separate docker file
* including common deps
* tracking latency over time
2021-05-11 10:33:39 -07:00