Tiago Koji Castro Shibata
13ff79d1e4
Merged PR 6839685: Merge public onnxruntime
...
Related work items: #37239556
2022-01-14 18:52:23 +00:00
Vincent Wang
44e2db9397
CUDA BFloat16 Refactor ( #10085 )
2022-01-14 19:38:56 +08:00
Xavier Dupré
e38e51ea8e
Improve iobinding, faster name search ( #10005 )
...
* Improve iobinding, faster name search
2022-01-14 12:18:18 +01:00
Vincent Wang
3ea7fb0f9f
fix mem leak ( #10272 )
2022-01-14 14:54:19 +08:00
dependabot[bot]
2a55bc2c21
Bump engine.io from 4.1.1 to 4.1.2 in /js/web
...
Bumps [engine.io](https://github.com/socketio/engine.io ) from 4.1.1 to 4.1.2.
- [Release notes](https://github.com/socketio/engine.io/releases )
- [Changelog](https://github.com/socketio/engine.io/blob/4.1.2/CHANGELOG.md )
- [Commits](https://github.com/socketio/engine.io/compare/4.1.1...4.1.2 )
---
updated-dependencies:
- dependency-name: engine.io
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-01-13 18:26:02 -08:00
Baiju Meswani
2affd6e71e
orttraining packaging and ci pipelines to use cuda 11.3 ( #10252 )
2022-01-13 13:36:33 -08:00
Tiago Koji Castro Shibata
6cb18fc686
Merge remote-tracking branch 'upstream/master' into dmldev_temp
2022-01-13 10:36:07 -08:00
dependabot[bot]
4b205eb2b3
Bump follow-redirects from 1.13.3 to 1.14.7 in /js/web ( #10266 )
...
Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects ) from 1.13.3 to 1.14.7.
- [Release notes](https://github.com/follow-redirects/follow-redirects/releases )
- [Commits](https://github.com/follow-redirects/follow-redirects/compare/v1.13.3...v1.14.7 )
---
updated-dependencies:
- dependency-name: follow-redirects
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-13 09:05:22 -08:00
dependabot[bot]
943a1aa2d6
Bump follow-redirects from 1.14.5 to 1.14.7 in /js/node ( #10265 )
...
Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects ) from 1.14.5 to 1.14.7.
- [Release notes](https://github.com/follow-redirects/follow-redirects/releases )
- [Commits](https://github.com/follow-redirects/follow-redirects/compare/v1.14.5...v1.14.7 )
---
updated-dependencies:
- dependency-name: follow-redirects
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-13 09:03:41 -08:00
Edward Chen
d43ef67d2b
Move binary size check to separate pipeline ( #10254 )
...
Move binary size check(s) to a separate pipeline. In the future, other binary size-related builds can go here.
Add publishing of build artifacts for easier analysis.
Add optional build with debug info.
2022-01-12 19:21:20 -08:00
dependabot[bot]
3d9d8e20cc
Bump numpy from 1.19.2 to 1.21.0 in /tools/ci_build
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
2022-01-12 17:45:35 -08:00
Ryan Lai
8aad46c09b
Merged PR 6836425: Merge RI without squash
...
Related work items: #37712737
2022-01-13 01:24:21 +00:00
Ryan Lai
a44d24d8b5
Merge remote-tracking branch 'origin/dmldev' into user/rylai/ri_01_12_22
2022-01-12 16:26:27 -08:00
Ryan Lai
c07e251cec
Merged PR 6835169: RI 12/9/21 - 01/12/22
...
Build is green https://microsoft.visualstudio.com/WindowsAI/_build/results?buildId=43713985&view=results

Related work items: #37712737
2022-01-13 00:25:51 +00:00
Yi-Hong Lyu
499f1d5fd7
Quantization of Argmax ( #10213 )
...
This patch includes:
* int8/uint8 support for Argmax
* Quantization tool support for Argmax
2022-01-12 14:12:56 -08:00
Ryan Lai
6f7389bc03
Merge commit '4af232df0ce89f0f0227ca8bd10ed8808d4398e5' into HEAD
2022-01-12 11:48:43 -08:00
Tiago Koji Castro Shibata
98f85ae05b
Bump winrt version ( #10243 )
2022-01-12 10:52:27 -08:00
ashari4
aff96ce081
remove hardcoded type ( #10251 )
2022-01-12 10:00:34 -08:00
CarlPoirier
4af232df0c
Fix props file overwriting AdditionalIncludeDirectories ( #10124 )
...
Co-authored-by: Carl Poirier <carl.poirier@vab-solutions.com>
2022-01-11 23:30:40 -08:00
Rachel Guo
a099bd454b
[QDQ] Add shared qdq selectors ( #10178 )
...
* wip
* wip
* wip
* wip
* wip
* save
* minor changes
* update test graph name
* address pr comments
* update
* address pr comments
* address pr comments
* fix warning
* minor include fix
* update to nodegroupselectors
* delete unnecessary includes
Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>
2022-01-11 19:41:45 -08:00
RandySheriffH
79d2a0d185
Dynamic cost model to mitigate high E2E perf variance ( #9833 )
...
* commit dyamic block size
* summarize granularity
* add configure
* add test case
* call std stoi
* add comments
* fix typo
* rename var
* update comment
* reset default
* better comments
* extend LoopCounter for dynamic blocking
* fix comments and add more UT
* update comments
* swtich type to std::ptrdiff_t
* format code with better indention
* cast ptrdiff_t
* fix typo
2022-01-11 17:26:41 -08:00
Chen Fu
fb4dea39e2
Tolerate cpuinfo init failure ( #10199 )
...
Tolerate pytorch cpuinfo library init failure.
2022-01-11 16:31:56 -08:00
Olivia Jain
4048ed326c
Update EP Perf Pipeline ( #10149 )
...
* migrate to 1ES Hosted Pool
* migrate to Kusto database
* refactor and organize ep names with ORT prefix
* standardize TRT benchmarking with save/load engine, input binding, and workspace
* Add TRT 8.2 to ep perf pipeline
* update model_list.json with full onnx zoo
* add anubis credentials
* add anubis credentials
* clarify trt variables
* get system info from docker image
* remove unwanted commenting
2022-01-11 16:12:32 -08:00
Yufeng Li
cb9b0275b6
Set default quantization weight type to int8 ( #10209 )
...
* Set default quantization weight type to int8
2022-01-11 14:24:57 -08:00
Shucai Xiao
ce103ace93
Amdmigraphx fix build error ( #9272 )
...
* fix build error
* rename a missing api for the MIGraphX EP
2022-01-10 15:18:43 -08:00
Chi Lo
be9cc40aa5
Make some parameters configurable for calibration ( #10204 )
2022-01-10 13:00:58 -08:00
Jeff Bloomfield
32ee379f50
GraphPartitioner.cpp: fixed Merge function to handle duplicated partitions. ( #9929 )
...
Co-authored-by: Achira <achiras@microsoft.com>
2022-01-10 12:01:08 -08:00
Jingqiao Fu
5cd57bb726
add load from buffer ( #10162 )
...
* Add LoadFromBuffer API
2022-01-10 10:51:48 -08:00
Edward Chen
edd1a2cf61
Add more Java test logging. ( #10221 )
2022-01-10 09:42:46 -08:00
Dwayne Robinson
0f5e82c294
DirectML EP remove stale code for int64 via int32 double strides ( #9959 )
2022-01-10 02:07:22 -08:00
Dwayne Robinson
1f5b073508
Minor DirectML EP provider factory comments ( #9965 )
2022-01-10 02:06:31 -08:00
PeixuanZuo
7d93498e0e
[FIX] register softmaxgrad_13/logsoftmaxgrad_13 for rocm ( #10177 )
...
* [FIX] register softmaxgrad_13/logsoftmaxgrad_13 for rocm
* [FIX] update softmaxgrad_13/logsoftmaxgrad_13 implementation for rocm
2022-01-10 11:33:46 +08:00
Scott McKay
6e88c11cae
Refactor QDQ node group selection infrastructure ( #10195 )
...
* Separate out the QDQ node group selection from the SAT specific NodeSelector to make re-use in NNAPI etc. cleaner.
* Make MatMulIntegerToFloat matching optional.
Add move ctor to BaseSelector. Required now that it has a unique_ptr member.
* Avoid Guardian warning by using rvalue unique_ptr created with make_unique
2022-01-10 10:57:50 +10:00
Nat Kershaw (MSFT)
d52d3c0052
Update C/C++ API docs automation to create a PR (instead of push to publish branch) ( #10093 )
2022-01-07 16:16:47 -08:00
Ye Wang
5ebb857501
Update onnxruntime_unittests.cmake ( #10215 )
2022-01-07 16:14:15 -08:00
vade
bacae967a2
Update Cuda to 11.4.2, update architectures, support Ubuntu 20.04 ( #10169 )
2022-01-07 13:00:44 -08:00
Zhang Lei
2bbf1ac1e0
Using better words. ( #10210 )
2022-01-07 09:17:23 -08:00
Jeff Daily
e7efcc93fe
[ROCm] update hipify-perl location ( #10102 )
...
* [ROCm] update hipify-perl location
Depending on the ROCm version installed, hipify-perl might not always
live in the hard-coded path of /opt/rocm/bin. Use python 3.3's
shutil.which to locate the script.
* provide alternative locations for hipify-perl if not in PATH
* implement hipify-perl search as a function
This avoids running the logic during module import since all builds
import the amd_hipify module.
* fix flake8 errors
2022-01-06 17:21:02 -08:00
Abhishek Jindal
4ac3277743
adding definition of concat operator for mapping it to onnx ( #10062 )
...
* adding definition of concat operator for mapping it to onnx
* adding the opgen generator file to include tensorlist type for eager mode
2022-01-06 14:56:35 -08:00
Chris Hua
cab4579b83
remove six references ( #9941 )
...
Python 2 compatibility is no longer necessary and helps unblock upgrades to mypy and others.
2022-01-06 13:52:20 -08:00
Hariharan Seshadri
0552a47ec2
Enable CUDA provider option configuration for C# ( #10188 )
2022-01-06 11:03:14 -08:00
Ye Wang
08f512b25e
Fix a Win GPU reduced ops pipeline ( #10202 )
2022-01-06 09:46:34 -08:00
ashari4
4ab891999a
fix hardcoded type ( #10205 )
2022-01-06 09:28:22 -08:00
ashari4
7b5464ed7b
aten add_ op supports bf16 ( #10084 )
...
* hand implemented add_
2022-01-05 09:33:28 -08:00
Edward Chen
34c025109c
Exclude graph_runtime_optimization_test.cc from reduced ops build. ( #10191 )
2022-01-05 09:22:38 -08:00
Ye Wang
2803a9465d
Add example of registering custom cuda op as shared lib ( #10025 )
2022-01-05 09:22:15 -08:00
yz
2078210a1c
Improve logging for symbolic shape inference
2022-01-04 13:17:07 -08:00
Edward Chen
792db33f01
Enable loading of ORT format model graph runtime optimizations ( #9901 )
...
Initial implementation of load/replay of runtime optimizations in an ORT format model.
2022-01-04 12:09:07 -08:00
Tang, Cheng
97659495d9
fix aten view op ( #10050 )
...
* fix aten view op
* add test case
* fix signature
* fix the build
Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
2022-01-04 08:29:30 -08:00
George Wu
91f85dfdad
update Dockerfile.manylinux2014_cuda11_4_tensorrt8_2 to TensorRT 8.2.2.1 ( #10167 )
2022-01-03 20:38:37 -08:00