Commit graph

5069 commits

Author SHA1 Message Date
Ryan Hill
9ef92f352f Test Java pipeline fix 2021-05-19 16:33:55 -07:00
Ryan Hill
6a889ee8bf Remove cupti library dependency from core library 2021-05-19 13:00:37 -07:00
Ryan Hill
c569dee80a Merge branch 'master' of https://github.com/microsoft/onnxruntime into ryanunderhill/cuda_shared 2021-05-18 19:00:28 -07:00
Ryan Hill
23cad06809 Code review feedback for Java build on non Windows 2021-05-18 18:28:55 -07:00
Yulong Wang
077e8c6b40
allow update_version.py to update new npm packages (#7746)
* update versions for npm packages

* remove package-lock.json in list
2021-05-18 16:15:19 -07:00
Rachel Guo
3f204d191b
[CoreML EP] Add Squeeze Op support (#7730)
* add squeeze op builder initial

* fix mistakes

* modify

* enable UT passed and minor refine

* minor formatting

* address comment

Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>
2021-05-18 15:12:41 -07:00
Taewoo Kim
d1c531058a Add elseif statement for arm64e 2021-05-18 14:58:58 -07:00
Taewoo Kim
1e6ad669cf Support arm64e for osx
Add arm64e to choices variable
2021-05-18 14:58:58 -07:00
Thiago Crepaldi
e05b15175d
Add cpp ext lock file check during ORTModule init (#7740)
* Add cpp ext lock file check during ORTModule init

* Address comments
2021-05-18 12:57:05 -07:00
Xiaoyu Liu
224a664811
GPT-2 one step search tutorial (#7718)
* GPT2 with one step search tutorial
* remove quantization section

Co-authored-by: Xiaoyu Liu <xiaoyu@xiaoyu-VM.z4vh1dzj5eoevgybsksdpz2izh.jx.internal.cloudapp.net>
2021-05-18 12:31:39 -07:00
Ashwini Khade
7834ca983c
update optimizers for opset14 (#7722)
* update optimizers for opset14

* plus 1 more

* fix reshape fusion
2021-05-18 11:58:14 -07:00
Changming Sun
26a472c948
Increase test timeout from 1 hour to 2 hours (#7735)
I saw a test timeout in our nodejs packaging pipeline. I'm not sure if it is because it ran slower than before or it's a deadlock issue. Increasing the timeout will be helpful for investigating such issues.
2021-05-18 10:51:58 -07:00
baijumeswani
e161213f8e
Handle model with no parameters (#7736)
* Handle model with no parameters

* Set the minimum module_output_grads as 0 to handle parameterless models
2021-05-18 09:33:57 -07:00
Yulong Wang
96deec596f
fix npm test for webgl without wasm artifacts (#7742) 2021-05-18 09:19:19 -07:00
Xueyun Zhu
e92b3c1394
bumping up version number to 1.8 (#7733)
* bump to 1.8

* fix windows AI
2021-05-18 09:03:37 -07:00
Du Li
e4a985ff17
[JS/Web] WebGL Profiling Tool (#7724) 2021-05-18 06:31:00 -07:00
Hariharan Seshadri
43e2ee37f2
Some cosmetic changes (#7741) 2021-05-18 00:02:07 -07:00
stevenlix
a6972c8782
Fix issues in TensorRT provider options (#7738)
* add legacy env variable support in pybind

* formating code
2021-05-17 23:07:27 -07:00
Ryan Hill
70f52251d9 Merge with master 2021-05-17 22:33:34 -07:00
Ryan Hill
5da718a114 Formatting 2021-05-17 22:32:11 -07:00
Young Jin Kim
e9057d2e49
ZCode FastFormers changes (#5827)
* Add FBGEMM submodule

* Add fbgemm based per-channel quantization

* Add missing logic for pre-layernorm transformer model fusion

* add support for structured pruning architecture -fastformers

* Fix windows build

* Add a default behavior when head_size is not present for the backward compatibility

* Remove FBGEMM and default to tensor-wise quantization, column-wise quantization will be enabled later

* Fixed some unit test errors

* Fix windows compile error and unit test errors

* delete the option removed from the upstream

* Addresses review comments and fixes a merge error

* Remove commented out code

* add non-zero zp support

* support A and B scale with any dimensions

* fix build breaks

* fix warning in MSVC

* Fix bug for not checking original float value names when treat it as not existing.

* Clean up head size

* Clean up python tools

* Enable per column quantization

* fix quant weight cleanup bug

* A few code clean up

* Some code clean-up

* Some code clean-up

* Change option name

* update default value

* Rename option and parameter names

* Missing argument name change

* Add tests for quantization options for attention and matmul

Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>
Co-authored-by: Lei Zhang <zhang.huanning@hotmail.com>
2021-05-17 21:12:21 -07:00
Changming Sun
38d90b0f15
Cleanup install_deps.sh (#7734) 2021-05-17 19:27:47 -07:00
Yufeng Li
9075488368
Add per-column support for QLinearMatMul (#7729)
* Add per-column support for QLinearMatMul
2021-05-17 18:57:42 -07:00
Du Li
d3c4b70ede
[Web/JS] Fixing two bugs in reshape_pack and im2col_pack (#7689)
* fixing two bugs in reshape_pack and im2col_pack

* minor fix

* fix lint complaints
2021-05-17 18:28:09 -07:00
Xueyun Zhu
79854dda8f
bumping up verson (#7731) 2021-05-17 17:55:42 -07:00
Jesse Benson
f977644324 ROCM support int reductions 2021-05-17 16:42:06 -07:00
Sunghoon
da5f24bd2d
Support additional session options and run options in WebAssembly (#7712)
* add all session options and run options in C API except AddInitializer and AddFreeDimensionOverride

* remove unnecessary comment

* change extra session and run options to object notation

* resolve comments

* use an optional chaining for options

* resolve comments
2021-05-17 14:57:19 -07:00
Tixxx
6d9f541442
[JS]moved logging level flag to global env (#7700)
* moved logging level flag to global env

* added setter and getter for loggingLevel in Env

* moved implementation of env to a separate file
2021-05-17 14:16:59 -07:00
Ye Wang
5e8086ad8e
Support fusions inside subgraphs in optimizer tool (#7701)
* skip subgraph when updating model

* intreim checkin

* interim checkin 2

* support transformers optimizations in subgraph

* change more files

* fix comments typo
2021-05-17 12:43:55 -07:00
Zhang Lei
ad95b19322
Help build pass on ubuntu 16.04 with old g++. (#7706) 2021-05-17 11:37:00 -07:00
Yufeng Li
0e48187b4e
Add type checks for QDQ transformer (#7715) 2021-05-17 10:48:20 -07:00
baijumeswani
c873f5589d
Fix bug where the output names were sorted lexicographically (#7709) 2021-05-17 10:27:20 -07:00
Thiago Crepaldi
6c41ed597b
Add custom autograd function to prevent input passthrough on ORTModule (#7694)
* Changes for investigation

* Gradient for Identity

* Keep Identity betwen YieldOp and GraphOutput

* Revert debugging changes

* Add custom autograd fn to prevent input passthrough on ORTModule

* Add comment

Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-17 09:56:02 -07:00
Thiago Crepaldi
4fe2ffae16
Fix ORTModule python doc generation (#7704)
* Fix ORTModule python doc generation

* Address comment
2021-05-17 09:55:49 -07:00
Hariharan Seshadri
ebee380911
Partially revert PR #7348 (#7702) 2021-05-16 23:56:13 -07:00
Scott McKay
2b73163690
Exclude training specific logic from BatchNormalization to reduce binary size. (#7703) 2021-05-17 16:42:33 +10:00
ashbhandare
3c1cf82ff1 Bump to rel-1.9.1 (#7684) 2021-05-16 23:24:16 -07:00
stevenlix
557b94637d
Add more TensorRT env variables to provider options (#7698)
* add all trt env variables to provider options

* add python test

* Update onnxruntime_c_api.h

* fix issues

* validate values for options
2021-05-16 22:09:52 -07:00
Ryan Hill
8d911833ef Merge error 2021-05-16 20:50:27 -07:00
Ryan Hill
9ddbb046a2 Merge related issues and code review feedback. 2021-05-16 19:22:03 -07:00
Ryan Hill
67adb7bfe0 Merge with master 2021-05-16 18:41:57 -07:00
Tracy Sharpe
943ab9dcef
undo clang-format corruption (#7720) 2021-05-16 15:32:30 -07:00
liqunfu
d604281a86
Liqun/training pkg to run tests (#7662) 2021-05-16 09:10:57 -07:00
liqunfu
3ead2f2f39
update pt lightning version (#7711)
Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-15 21:46:16 -07:00
Yulong Wang
017b94ab36
Specify correct dependency for CI pipeline of nodejs binding (#7717) 2021-05-15 08:56:58 -07:00
Yufeng Li
6b0a7905ed
fix quant weight cleanup bug (#7707) 2021-05-14 22:04:35 -07:00
Sergii Dymchenko
9ba8da65d2
Fix BiasDropoutFusion when there are multiple Dropout consumers. (#7708)
* Add tests for multiple consimers bias dropout fusion.

* Don't fuse resudial Add for multiple Dropout consumers.

* Remove duplicate code.

* Fix comment typo.

Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>

Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
2021-05-14 21:03:23 -07:00
Hariharan Seshadri
53d1d55ea8
Add ability for pre-packed weights of shared initializers to be shared across sessions (#7421) 2021-05-14 20:44:42 -07:00
Ashwini Khade
c78a40c0c7
add opset14 registrations for cuda provider (#7699)
* add opset14 registrations for cuda provider

* update macros

* fix typo

* updates to rocm provider
2021-05-14 19:08:45 -07:00
Yulong Wang
97d9bcd644
[js/web] fix bundle for multi-thread, add e2e test and support nodejs (#7688)
* fix bundle for multi-thread, add e2e test and support nodejs

* add copyright banner

* resolve comments

* add comments for isMultiThreadSupported()
2021-05-14 18:15:38 -07:00