Ryan Hill
9ef92f352f
Test Java pipeline fix
2021-05-19 16:33:55 -07:00
Ryan Hill
6a889ee8bf
Remove cupti library dependency from core library
2021-05-19 13:00:37 -07:00
Ryan Hill
c569dee80a
Merge branch 'master' of https://github.com/microsoft/onnxruntime into ryanunderhill/cuda_shared
2021-05-18 19:00:28 -07:00
Ryan Hill
23cad06809
Code review feedback for Java build on non Windows
2021-05-18 18:28:55 -07:00
Yulong Wang
077e8c6b40
allow update_version.py to update new npm packages ( #7746 )
...
* update versions for npm packages
* remove package-lock.json in list
2021-05-18 16:15:19 -07:00
Rachel Guo
3f204d191b
[CoreML EP] Add Squeeze Op support ( #7730 )
...
* add squeeze op builder initial
* fix mistakes
* modify
* enable UT passed and minor refine
* minor formatting
* address comment
Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>
2021-05-18 15:12:41 -07:00
Taewoo Kim
d1c531058a
Add elseif statement for arm64e
2021-05-18 14:58:58 -07:00
Taewoo Kim
1e6ad669cf
Support arm64e for osx
...
Add arm64e to choices variable
2021-05-18 14:58:58 -07:00
Thiago Crepaldi
e05b15175d
Add cpp ext lock file check during ORTModule init ( #7740 )
...
* Add cpp ext lock file check during ORTModule init
* Address comments
2021-05-18 12:57:05 -07:00
Xiaoyu Liu
224a664811
GPT-2 one step search tutorial ( #7718 )
...
* GPT2 with one step search tutorial
* remove quantization section
Co-authored-by: Xiaoyu Liu <xiaoyu@xiaoyu-VM.z4vh1dzj5eoevgybsksdpz2izh.jx.internal.cloudapp.net>
2021-05-18 12:31:39 -07:00
Ashwini Khade
7834ca983c
update optimizers for opset14 ( #7722 )
...
* update optimizers for opset14
* plus 1 more
* fix reshape fusion
2021-05-18 11:58:14 -07:00
Changming Sun
26a472c948
Increase test timeout from 1 hour to 2 hours ( #7735 )
...
I saw a test timeout in our nodejs packaging pipeline. I'm not sure if it is because it ran slower than before or it's a deadlock issue. Increasing the timeout will be helpful for investigating such issues.
2021-05-18 10:51:58 -07:00
baijumeswani
e161213f8e
Handle model with no parameters ( #7736 )
...
* Handle model with no parameters
* Set the minimum module_output_grads as 0 to handle parameterless models
2021-05-18 09:33:57 -07:00
Yulong Wang
96deec596f
fix npm test for webgl without wasm artifacts ( #7742 )
2021-05-18 09:19:19 -07:00
Xueyun Zhu
e92b3c1394
bumping up version number to 1.8 ( #7733 )
...
* bump to 1.8
* fix windows AI
2021-05-18 09:03:37 -07:00
Du Li
e4a985ff17
[JS/Web] WebGL Profiling Tool ( #7724 )
2021-05-18 06:31:00 -07:00
Hariharan Seshadri
43e2ee37f2
Some cosmetic changes ( #7741 )
2021-05-18 00:02:07 -07:00
stevenlix
a6972c8782
Fix issues in TensorRT provider options ( #7738 )
...
* add legacy env variable support in pybind
* formating code
2021-05-17 23:07:27 -07:00
Ryan Hill
70f52251d9
Merge with master
2021-05-17 22:33:34 -07:00
Ryan Hill
5da718a114
Formatting
2021-05-17 22:32:11 -07:00
Young Jin Kim
e9057d2e49
ZCode FastFormers changes ( #5827 )
...
* Add FBGEMM submodule
* Add fbgemm based per-channel quantization
* Add missing logic for pre-layernorm transformer model fusion
* add support for structured pruning architecture -fastformers
* Fix windows build
* Add a default behavior when head_size is not present for the backward compatibility
* Remove FBGEMM and default to tensor-wise quantization, column-wise quantization will be enabled later
* Fixed some unit test errors
* Fix windows compile error and unit test errors
* delete the option removed from the upstream
* Addresses review comments and fixes a merge error
* Remove commented out code
* add non-zero zp support
* support A and B scale with any dimensions
* fix build breaks
* fix warning in MSVC
* Fix bug for not checking original float value names when treat it as not existing.
* Clean up head size
* Clean up python tools
* Enable per column quantization
* fix quant weight cleanup bug
* A few code clean up
* Some code clean-up
* Some code clean-up
* Change option name
* update default value
* Rename option and parameter names
* Missing argument name change
* Add tests for quantization options for attention and matmul
Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>
Co-authored-by: Lei Zhang <zhang.huanning@hotmail.com>
2021-05-17 21:12:21 -07:00
Changming Sun
38d90b0f15
Cleanup install_deps.sh ( #7734 )
2021-05-17 19:27:47 -07:00
Yufeng Li
9075488368
Add per-column support for QLinearMatMul ( #7729 )
...
* Add per-column support for QLinearMatMul
2021-05-17 18:57:42 -07:00
Du Li
d3c4b70ede
[Web/JS] Fixing two bugs in reshape_pack and im2col_pack ( #7689 )
...
* fixing two bugs in reshape_pack and im2col_pack
* minor fix
* fix lint complaints
2021-05-17 18:28:09 -07:00
Xueyun Zhu
79854dda8f
bumping up verson ( #7731 )
2021-05-17 17:55:42 -07:00
Jesse Benson
f977644324
ROCM support int reductions
2021-05-17 16:42:06 -07:00
Sunghoon
da5f24bd2d
Support additional session options and run options in WebAssembly ( #7712 )
...
* add all session options and run options in C API except AddInitializer and AddFreeDimensionOverride
* remove unnecessary comment
* change extra session and run options to object notation
* resolve comments
* use an optional chaining for options
* resolve comments
2021-05-17 14:57:19 -07:00
Tixxx
6d9f541442
[JS]moved logging level flag to global env ( #7700 )
...
* moved logging level flag to global env
* added setter and getter for loggingLevel in Env
* moved implementation of env to a separate file
2021-05-17 14:16:59 -07:00
Ye Wang
5e8086ad8e
Support fusions inside subgraphs in optimizer tool ( #7701 )
...
* skip subgraph when updating model
* intreim checkin
* interim checkin 2
* support transformers optimizations in subgraph
* change more files
* fix comments typo
2021-05-17 12:43:55 -07:00
Zhang Lei
ad95b19322
Help build pass on ubuntu 16.04 with old g++. ( #7706 )
2021-05-17 11:37:00 -07:00
Yufeng Li
0e48187b4e
Add type checks for QDQ transformer ( #7715 )
2021-05-17 10:48:20 -07:00
baijumeswani
c873f5589d
Fix bug where the output names were sorted lexicographically ( #7709 )
2021-05-17 10:27:20 -07:00
Thiago Crepaldi
6c41ed597b
Add custom autograd function to prevent input passthrough on ORTModule ( #7694 )
...
* Changes for investigation
* Gradient for Identity
* Keep Identity betwen YieldOp and GraphOutput
* Revert debugging changes
* Add custom autograd fn to prevent input passthrough on ORTModule
* Add comment
Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-17 09:56:02 -07:00
Thiago Crepaldi
4fe2ffae16
Fix ORTModule python doc generation ( #7704 )
...
* Fix ORTModule python doc generation
* Address comment
2021-05-17 09:55:49 -07:00
Hariharan Seshadri
ebee380911
Partially revert PR #7348 ( #7702 )
2021-05-16 23:56:13 -07:00
Scott McKay
2b73163690
Exclude training specific logic from BatchNormalization to reduce binary size. ( #7703 )
2021-05-17 16:42:33 +10:00
ashbhandare
3c1cf82ff1
Bump to rel-1.9.1 ( #7684 )
2021-05-16 23:24:16 -07:00
stevenlix
557b94637d
Add more TensorRT env variables to provider options ( #7698 )
...
* add all trt env variables to provider options
* add python test
* Update onnxruntime_c_api.h
* fix issues
* validate values for options
2021-05-16 22:09:52 -07:00
Ryan Hill
8d911833ef
Merge error
2021-05-16 20:50:27 -07:00
Ryan Hill
9ddbb046a2
Merge related issues and code review feedback.
2021-05-16 19:22:03 -07:00
Ryan Hill
67adb7bfe0
Merge with master
2021-05-16 18:41:57 -07:00
Tracy Sharpe
943ab9dcef
undo clang-format corruption ( #7720 )
2021-05-16 15:32:30 -07:00
liqunfu
d604281a86
Liqun/training pkg to run tests ( #7662 )
2021-05-16 09:10:57 -07:00
liqunfu
3ead2f2f39
update pt lightning version ( #7711 )
...
Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-15 21:46:16 -07:00
Yulong Wang
017b94ab36
Specify correct dependency for CI pipeline of nodejs binding ( #7717 )
2021-05-15 08:56:58 -07:00
Yufeng Li
6b0a7905ed
fix quant weight cleanup bug ( #7707 )
2021-05-14 22:04:35 -07:00
Sergii Dymchenko
9ba8da65d2
Fix BiasDropoutFusion when there are multiple Dropout consumers. ( #7708 )
...
* Add tests for multiple consimers bias dropout fusion.
* Don't fuse resudial Add for multiple Dropout consumers.
* Remove duplicate code.
* Fix comment typo.
Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
2021-05-14 21:03:23 -07:00
Hariharan Seshadri
53d1d55ea8
Add ability for pre-packed weights of shared initializers to be shared across sessions ( #7421 )
2021-05-14 20:44:42 -07:00
Ashwini Khade
c78a40c0c7
add opset14 registrations for cuda provider ( #7699 )
...
* add opset14 registrations for cuda provider
* update macros
* fix typo
* updates to rocm provider
2021-05-14 19:08:45 -07:00
Yulong Wang
97d9bcd644
[js/web] fix bundle for multi-thread, add e2e test and support nodejs ( #7688 )
...
* fix bundle for multi-thread, add e2e test and support nodejs
* add copyright banner
* resolve comments
* add comments for isMultiThreadSupported()
2021-05-14 18:15:38 -07:00