Commit graph

3893 commits

Author SHA1 Message Date
Yufeng Li
3cae28699b
work around of the build break in mac (#6069)
* Fix the build break in macos release

* revert android change
2020-12-07 20:39:36 -08:00
Ye Wang
fa06be2133
Support export >2G model when using optimizer.py only (#6014)
* checkin

* add warning if user specify same inut and output path
2020-12-07 17:18:49 -08:00
Edward Chen
b348538c8a
Update build docker image cache cleanup (#6048)
The current image cache cleanup is not removing many images. Upon examining the cache container registry logs, it appears there are some infrequent pulls of old images which may be made by something other than CI builds (perhaps some automated scan of the registry).
This change adds a minimum access count for images in the cache so that infrequently but periodically accessed images can be removed. The idea is that images used by CI builds that are worth caching will have a higher volume of accesses.
2020-12-07 13:07:19 -08:00
Tianlei Wu
51fbe87b9b
Update profiler tool to support gpt2 and longformer models (#6011)
* support gpt2 and longformer in profiler tool
* rename bert_profiler to profiler
* Add --basic_optimization to allow user to use basic level of graph optimization
* Add --kernel_time_only to filter kernel time and exclude fence time
* Add --threshold to filter nodes that with low run time percentage.
2020-12-07 10:33:41 -08:00
Changming Sun
925879a8b0
Remove python 3.8 Windows GPU build from python packaging pipeline (#6054)
Revert the last a few changes to get the pipeline back to a normal state.
2020-12-07 10:23:07 -08:00
George Wu
020efc9002
fix windows cuda support for python 3.8 + (#6046)
* fix

* noqa

* fix.

* remove unused import
2020-12-07 10:09:22 -08:00
ashbhandare
7cebf76a46
Improve checkpointing for Zero stage 1 (#5478)
* Initial running changes

* Checkpointing aggregation changes

* compare with older version

* initial cleanup

* Add zero test, minor fix

* Fix zero test, transform, formatting

* Review comments

* add more unit tests

* review comments

* Try fix CI

* Add additional check on just aggregation code

* Try fix ckpt gen

* Add pregenerated ckpt for CI, enable zero test in e2e

* Moving test to nightly, removing ckpt files

* Add tests to dist GPU CI

* Fix dist test

* Review comments

* Fix test
2020-12-07 09:16:01 -08:00
Hariharan Seshadri
a046ef133a
Update api_summary.rst (#6038) 2020-12-04 17:59:56 -08:00
dependabot[bot]
d5e8c48e54 Bump highlight.js from 10.2.1 to 10.4.1 in /nodejs
Bumps [highlight.js](https://github.com/highlightjs/highlight.js) from 10.2.1 to 10.4.1.
- [Release notes](https://github.com/highlightjs/highlight.js/releases)
- [Changelog](https://github.com/highlightjs/highlight.js/blob/master/CHANGES.md)
- [Commits](https://github.com/highlightjs/highlight.js/compare/10.2.1...10.4.1)

Signed-off-by: dependabot[bot] <support@github.com>
2020-12-04 16:45:07 -08:00
Edward Chen
d8139814fd
Clean up builds (#6015)
Update training Python packaging build to use get_docker_image.py.
Remove BUILD_EXTR_PAR docker build argument.
Update get_docker_image.py to check again for the image in the cache after building and before pushing to reduce the chance of a redundant push.
2020-12-04 15:13:17 -08:00
Sheil Kumar
00f43a3a68
add missing iclosable interface (#6036)
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2020-12-04 13:21:03 -08:00
Jesse Benson
14f6eb14b1 Use __launch_bounds__ workaround, rather than limiting threads to 256 on AMD. 2020-12-03 13:06:34 -08:00
Jesse Benson
98ea7372d3 Re-enable Lamb unit tests for AMD 2020-12-03 13:06:34 -08:00
Jesse Benson
245d43615d Fix AMD multi-tensor implementation. 2020-12-03 13:06:34 -08:00
Edward Chen
6572a4d306
Disable Python 3.9 for training Python packaging build. (#6012)
Disable Python 3.9 for training Python packaging build. Python 3.9 is not supported by the PyTorch dependency.
2020-12-03 11:42:28 -08:00
Tianlei Wu
cdb91208a3
longformer onnx conversion and benchmark tools (#6007)
* initial implementation of longformer tools for onnx conversion and benchmark

* Support ONNX conversion for transformers 4.0
Add an option to optimize onnx model, and export fp16 model
2020-12-03 11:37:30 -08:00
Cecilia Liu
3b198c9614
Support Fusion for 1 and 2 Inputs Bert Models Converted From tf (#5993)
Support fusion for 1 and 2 inputs Bert models converted from tf
2020-12-03 10:52:33 -08:00
Sherlock
c86a1e5c13
Fix Flaky orttraining tests (#5977)
* Fix Flacky orttraining  tests
2020-12-03 10:24:25 -08:00
Ryan Lai
2878e8eb2e
Fix nuget build error (#6009) 2020-12-03 09:28:39 -08:00
baijumeswani
2b35f7d4f6
Fix build.py bug which prevents running some unit tests (#5990)
Also ignore an exception occurred for execution providers which generate compiled nodes
2020-12-03 08:57:55 -08:00
Xavier Dupré
0acc3837ee
Make operator TreeEnsemble 5x faster for batches of size 100.000 (#5965)
* improves processing time by 10
* extend coverage unit test coverage
* better implementation for the multi regression case
* better comment, keep parallelization by trees when not enough trees
2020-12-03 14:36:42 +01:00
Xavier Dupré
524b9fa899
Initialize a structure in operator ReduceSum (#6005)
* fix initialisation issue
2020-12-03 12:41:26 +01:00
Zhang Lei
648c9c7789
Fix bugs for 1: Calibrator should check model inputs; 2: (#6017)
quantize_inupts forgot to use parameter initializer_use_weight_qtyp.
2020-12-03 00:00:16 -08:00
Xavier Dupré
bdd06f6310
Fix PR #5550 reverted in #5911 (performance improvment for operator Transpose) (#5916)
* Improves implementation of transpose operator
* Fix issue mentioned in #5911
* adding unit test for function DoTransposeImpl
2020-12-03 00:38:18 +01:00
Yufeng Li
f2dcba7afe
Fuse MatMulIntegerToFloat only when scales are scalar (#6008)
MatMulIntegerToFloat fusion fuses per-row and per-column MatMulInteger, which is not supported by the MatMulIntegerToFloat kernel now. Limit the fusion to per-matrix only before we supporting the per-channel fully.
2020-12-02 14:40:17 -08:00
Yufeng Li
4fdfbfd4b4
Add int32_t support for DeQuantizeLinear (#5994)
* Add int32_t support for DeQuantizeLinear

* DequantizeLinear with int32 should have not zero point
2020-12-02 12:35:41 -08:00
Olivia Jain
c727a28735
include gemm_helper.h (#5988) 2020-12-02 11:28:28 -08:00
Xiang Zhang
b4e6cc59c7
skip the check for A channel (#5989) 2020-12-02 11:23:54 -08:00
Guoyu Wang
cdacee6696
[NNAPI] Support non-1d tensor for C of Gemm op (#5982)
* Add support for non-1d tensor for C of Gemm

* check android api level before add squeeze

* Minor update

* Fix to accept c only in format of {1,1,...,1,n}
2020-12-02 00:22:38 -08:00
Guoyu Wang
6846c665ff
Use loose version in build.py (#5998) 2020-12-01 20:57:44 -08:00
Ryan Lai
897310f6fb
Add suspend handler with new telemetry event for UWP scenarios (#5907)
* Add suspend handler with new telemetry event

* Fix build warning

* Use cppwinrt from nuget

* Restore nuget packages

* add dependencies

* Add nuget_helpers

* Cleaned up

* Clean up

* Comment

* Add dependencies for the rest

* Remove unused line

* Update activation string

* PR comment to remove ALL
2020-12-01 20:26:18 -08:00
Edward Chen
6d642a3dba
Replace direct pulls from image cache container registry with get_docker_image.py, build definition clean up. (#5906) 2020-12-01 19:10:23 -08:00
Chun-Wei Chen
c63e8cf7d7
Remove chronological starttime assertion in InferenceTest.cs because it is not determined (#5976)
* remove chronological starttime assertion because it is not determined

* use different vars
2020-12-01 15:58:12 -08:00
Adam Pocock
396074d2a8
Fixing OrtEnvironment.getEnvironment() so it doesn't print a warning if the environment already exists with a non-default name. (#5973) 2020-12-01 15:21:06 -08:00
Scott McKay
30c7fffbab
Expand the documentation on using compiling EPs with a minimal build (#5893)
* Expand the documentation on using compiling EPs with a minimal build to call out a 'simple' option that is easier to use. Provide more background on what happens to help users choose the best option for them.
Tweak conversion script to be noisier about attempted usage of 'all' optimization level.

Co-authored-by: manashgoswami <magoswam@microsoft.com>
2020-12-02 09:12:36 +10:00
Nat Kershaw (MSFT)
46277bfd09
Remove survey link from README (#5979) 2020-12-01 11:36:00 -08:00
Ye Wang
5f516899bf
optimize a bert model converted using tf2onnx (#5492)
* optimize a bert model converted using tf2onnx

* add test data

* update

* remove comments

* format

* Revert "format"

This reverts commit f8ae88cb564bce5caf4780e56561403f3ba3d524.

* Revert "remove comments"

This reverts commit 59d8a693581a731fd0291b70fe2c9cec6c4950fe.

* add a squeeze node to convert a 3-d mask to 2-d

* update

* update

* verify and add comments
2020-12-01 11:19:16 -08:00
Changming Sun
3323fb6082
Update docker files to put 'unattended-upgrades' in a right place(#5983) 2020-12-01 10:45:03 -08:00
Wenbing Li
2ec211ea7b
Support the cross compiling for Apple Silicon (#5974)
* support macos_arm64 cross compiling

* update the build docs

* update as commented.

* Update BUILD.md
2020-12-01 10:00:06 -08:00
Alberto Magni
fb310fba0c
Avoid adding non-existent inputs to new Event nodes (#5915)
During graph resolve non-existent nodes cause shape-inference failures.
2020-12-01 08:21:05 -08:00
Changming Sun
2d9dcc4576
Add python 3.9 support (#5874)
1. Add python 3.9 support(except Linux ARM)
2. Add Windows GPU python 3.8 to our packaging pipeline.
2020-11-30 12:02:48 -08:00
Wenbing Li
1852ade75d
Enable the xcode build for Apple Silicon (arm64 MacOS) (#5924)
* fix the build script for macos/xcode

* add the version check

* correct the osx-arch configuration

* typo
2020-11-30 11:22:08 -08:00
Jesse Benson
45966d878a Code review feedback 2020-11-30 09:24:22 -08:00
Jesse Benson
86e30a2db6 Update CUDA IsAllFinite kernel 2020-11-30 09:24:22 -08:00
Jesse Benson
bd96f60888 Use CUDA's IsAllFinite kernel for ROCm 2020-11-30 09:24:22 -08:00
Moshe David
06ad516a5d
w (#5947)
Co-authored-by: modav <modav@microsoft.com>
2020-11-30 10:35:44 +10:00
Moshe David
5f5d4a10bd
[OpenVINO]Fix memory leak upon exception throwing (#5954)
1. Make sure to free the output_shape vector even if Output names mismatch between OpenVINO and ONNX exception is thrown
2. Piggy back this PR to remove un-needed call to fstream close method

Authored-by: modav <modav@microsoft.com>
2020-11-26 19:44:51 -08:00
Ivan Stojiljkovic
015fbb3dbb
Add support for Python 3.8+ on Windows when CUDA is enabled (#5956) 2020-11-26 15:52:30 -08:00
Moshe David
e207589631
[OpenVINO]Fix memory leak in IsDebugEnabled() under Windows (#5948)
* w

* w

Co-authored-by: modav <modav@microsoft.com>
2020-11-26 15:10:37 -08:00
Guoyu Wang
4afdced775
[NNAPI EP] Update squeeze ops (#5946)
* [NNAPI EP] Update squeeze ops
2020-11-26 21:00:54 +10:00