Commit graph

6215 commits

Author SHA1 Message Date
Sunghoon
b038f4e56f
Add a build option to create a WebAssembly static library (#10184)
* add p50 in test

* Add a build option to create a WebAssembly static library

Co-authored-by: Yulong Wang <yulongw@microsoft.com>
2022-01-18 18:05:04 -08:00
Yi-Hong Lyu
62eab67f79
Fuse DQ -> ArgMax into ArgMax (#10274) 2022-01-18 14:47:33 -08:00
Yi-Hong Lyu
e27f2dc932
int8/uint8 support for Argmax for opset 1, 11, 12 (#10296) 2022-01-18 14:37:34 -08:00
Yulong Wang
712f4e403d
[js/common] upgrade marked@4.0.10 (Dependbot warning) (#10313) 2022-01-18 14:00:10 -08:00
Scott McKay
c1c9fa18bf
C#: Avoid inefficient DenseTensor ctor in ToTensor extensions (#10240)
* Update extension helpers to avoid inefficient construction of DenseTensor.
Add tests for extension helpers.
2022-01-19 07:43:44 +10:00
Guoyu Wang
6ae22d562b
[QDQ] Move NNAPI EP to use NodeUnitIODef for non-QDQ ops (#10237) 2022-01-18 12:54:58 -08:00
Chen Fu
33dd2f8f5e
fix mac compilation error (#10268)
Fix Mac compilation error in new cpuinfo changes
2022-01-18 08:09:27 -08:00
Vincent Wang
c12cafa524
Optimize Transpose CUDA Kernel (#10230)
* optimize transpose cuda

* fix comment typo
2022-01-15 15:39:06 +08:00
RandySheriffH
a757bd7186
Render summarized ort perf with tree map in browser (#10189)
* render summarized ort perf with tree map

* add readme

* add comment

* Update readme.md

* Update readme.md
2022-01-14 15:45:32 -08:00
RandySheriffH
ab5fd42ed4
reset MIN for float/double (#10284) 2022-01-14 13:57:29 -08:00
pengwa
e365ad7f3a
fix deadlock in model.train mode forward run only (#9960)
* fix deadlock in model.train model forward run only

* fix tests

* clear the grad_fns before every forward run

* add clean up on exit

* fix

* refine code comments
2022-01-14 13:53:29 -08:00
Thiago Crepaldi
6a7d3deb22
Update pytorch-lightning (#10276) 2022-01-14 16:49:10 -05:00
Vincent Wang
44e2db9397
CUDA BFloat16 Refactor (#10085) 2022-01-14 19:38:56 +08:00
Xavier Dupré
e38e51ea8e
Improve iobinding, faster name search (#10005)
* Improve iobinding, faster name search
2022-01-14 12:18:18 +01:00
Vincent Wang
3ea7fb0f9f
fix mem leak (#10272) 2022-01-14 14:54:19 +08:00
dependabot[bot]
2a55bc2c21 Bump engine.io from 4.1.1 to 4.1.2 in /js/web
Bumps [engine.io](https://github.com/socketio/engine.io) from 4.1.1 to 4.1.2.
- [Release notes](https://github.com/socketio/engine.io/releases)
- [Changelog](https://github.com/socketio/engine.io/blob/4.1.2/CHANGELOG.md)
- [Commits](https://github.com/socketio/engine.io/compare/4.1.1...4.1.2)

---
updated-dependencies:
- dependency-name: engine.io
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-01-13 18:26:02 -08:00
Baiju Meswani
2affd6e71e
orttraining packaging and ci pipelines to use cuda 11.3 (#10252) 2022-01-13 13:36:33 -08:00
dependabot[bot]
4b205eb2b3
Bump follow-redirects from 1.13.3 to 1.14.7 in /js/web (#10266)
Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.13.3 to 1.14.7.
- [Release notes](https://github.com/follow-redirects/follow-redirects/releases)
- [Commits](https://github.com/follow-redirects/follow-redirects/compare/v1.13.3...v1.14.7)

---
updated-dependencies:
- dependency-name: follow-redirects
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-13 09:05:22 -08:00
dependabot[bot]
943a1aa2d6
Bump follow-redirects from 1.14.5 to 1.14.7 in /js/node (#10265)
Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.14.5 to 1.14.7.
- [Release notes](https://github.com/follow-redirects/follow-redirects/releases)
- [Commits](https://github.com/follow-redirects/follow-redirects/compare/v1.14.5...v1.14.7)

---
updated-dependencies:
- dependency-name: follow-redirects
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-13 09:03:41 -08:00
Edward Chen
d43ef67d2b
Move binary size check to separate pipeline (#10254)
Move binary size check(s) to a separate pipeline. In the future, other binary size-related builds can go here.
Add publishing of build artifacts for easier analysis.
Add optional build with debug info.
2022-01-12 19:21:20 -08:00
dependabot[bot]
3d9d8e20cc Bump numpy from 1.19.2 to 1.21.0 in /tools/ci_build
Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt)
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-01-12 17:45:35 -08:00
Yi-Hong Lyu
499f1d5fd7
Quantization of Argmax (#10213)
This patch includes:
* int8/uint8 support for Argmax
* Quantization tool support for Argmax
2022-01-12 14:12:56 -08:00
Tiago Koji Castro Shibata
98f85ae05b
Bump winrt version (#10243) 2022-01-12 10:52:27 -08:00
ashari4
aff96ce081
remove hardcoded type (#10251) 2022-01-12 10:00:34 -08:00
CarlPoirier
4af232df0c
Fix props file overwriting AdditionalIncludeDirectories (#10124)
Co-authored-by: Carl Poirier <carl.poirier@vab-solutions.com>
2022-01-11 23:30:40 -08:00
Rachel Guo
a099bd454b
[QDQ] Add shared qdq selectors (#10178)
* wip

* wip

* wip

* wip

* wip

* save

* minor changes

* update test graph name

* address pr comments

* update

* address pr comments

* address pr comments

* fix warning

* minor include fix

* update to nodegroupselectors

* delete unnecessary includes

Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>
2022-01-11 19:41:45 -08:00
RandySheriffH
79d2a0d185
Dynamic cost model to mitigate high E2E perf variance (#9833)
* commit dyamic block size

* summarize granularity

* add configure

* add test case

* call std stoi

* add comments

* fix typo

* rename var

* update comment

* reset default

* better comments

* extend LoopCounter for dynamic blocking

* fix comments and add more UT

* update comments

* swtich type to std::ptrdiff_t

* format code with better indention

* cast ptrdiff_t

* fix typo
2022-01-11 17:26:41 -08:00
Chen Fu
fb4dea39e2
Tolerate cpuinfo init failure (#10199)
Tolerate pytorch cpuinfo library init failure.
2022-01-11 16:31:56 -08:00
Olivia Jain
4048ed326c
Update EP Perf Pipeline (#10149)
* migrate to 1ES Hosted Pool

* migrate to Kusto database

* refactor and organize ep names with ORT prefix

* standardize TRT benchmarking with save/load engine, input binding, and workspace

* Add TRT 8.2 to ep perf pipeline

* update model_list.json with full onnx zoo

* add anubis credentials

* add anubis credentials

* clarify trt variables

* get system info from docker image

* remove unwanted commenting
2022-01-11 16:12:32 -08:00
Yufeng Li
cb9b0275b6
Set default quantization weight type to int8 (#10209)
* Set default quantization weight type to int8
2022-01-11 14:24:57 -08:00
Shucai Xiao
ce103ace93
Amdmigraphx fix build error (#9272)
* fix build error

* rename a missing api for the MIGraphX EP
2022-01-10 15:18:43 -08:00
Chi Lo
be9cc40aa5
Make some parameters configurable for calibration (#10204) 2022-01-10 13:00:58 -08:00
Jeff Bloomfield
32ee379f50
GraphPartitioner.cpp: fixed Merge function to handle duplicated partitions. (#9929)
Co-authored-by: Achira <achiras@microsoft.com>
2022-01-10 12:01:08 -08:00
Jingqiao Fu
5cd57bb726
add load from buffer (#10162)
* Add LoadFromBuffer API
2022-01-10 10:51:48 -08:00
Edward Chen
edd1a2cf61
Add more Java test logging. (#10221) 2022-01-10 09:42:46 -08:00
Dwayne Robinson
0f5e82c294
DirectML EP remove stale code for int64 via int32 double strides (#9959) 2022-01-10 02:07:22 -08:00
Dwayne Robinson
1f5b073508
Minor DirectML EP provider factory comments (#9965) 2022-01-10 02:06:31 -08:00
PeixuanZuo
7d93498e0e
[FIX] register softmaxgrad_13/logsoftmaxgrad_13 for rocm (#10177)
* [FIX] register  softmaxgrad_13/logsoftmaxgrad_13 for rocm
* [FIX] update softmaxgrad_13/logsoftmaxgrad_13 implementation for rocm
2022-01-10 11:33:46 +08:00
Scott McKay
6e88c11cae
Refactor QDQ node group selection infrastructure (#10195)
* Separate out the QDQ node group selection from the SAT specific NodeSelector to make re-use in NNAPI etc. cleaner.

* Make MatMulIntegerToFloat matching optional.
Add move ctor to BaseSelector. Required now that it has a unique_ptr member.

* Avoid Guardian warning by using rvalue unique_ptr created with make_unique
2022-01-10 10:57:50 +10:00
Nat Kershaw (MSFT)
d52d3c0052
Update C/C++ API docs automation to create a PR (instead of push to publish branch) (#10093) 2022-01-07 16:16:47 -08:00
Ye Wang
5ebb857501
Update onnxruntime_unittests.cmake (#10215) 2022-01-07 16:14:15 -08:00
vade
bacae967a2
Update Cuda to 11.4.2, update architectures, support Ubuntu 20.04 (#10169) 2022-01-07 13:00:44 -08:00
Zhang Lei
2bbf1ac1e0
Using better words. (#10210) 2022-01-07 09:17:23 -08:00
Jeff Daily
e7efcc93fe
[ROCm] update hipify-perl location (#10102)
* [ROCm] update hipify-perl location

Depending on the ROCm version installed, hipify-perl might not always
live in the hard-coded path of /opt/rocm/bin. Use python 3.3's
shutil.which to locate the script.

* provide alternative locations for hipify-perl if not in PATH

* implement hipify-perl search as a function

This avoids running the logic during module import since all builds
import the amd_hipify module.

* fix flake8 errors
2022-01-06 17:21:02 -08:00
Abhishek Jindal
4ac3277743
adding definition of concat operator for mapping it to onnx (#10062)
* adding definition of concat operator for mapping it to onnx

* adding the opgen generator file to include tensorlist type for eager mode
2022-01-06 14:56:35 -08:00
Chris Hua
cab4579b83
remove six references (#9941)
Python 2 compatibility is no longer necessary and helps unblock upgrades to mypy and others.
2022-01-06 13:52:20 -08:00
Hariharan Seshadri
0552a47ec2
Enable CUDA provider option configuration for C# (#10188) 2022-01-06 11:03:14 -08:00
Ye Wang
08f512b25e
Fix a Win GPU reduced ops pipeline (#10202) 2022-01-06 09:46:34 -08:00
ashari4
4ab891999a
fix hardcoded type (#10205) 2022-01-06 09:28:22 -08:00
ashari4
7b5464ed7b
aten add_ op supports bf16 (#10084)
* hand implemented add_
2022-01-05 09:33:28 -08:00