baijumeswani
e161213f8e
Handle model with no parameters ( #7736 )
...
* Handle model with no parameters
* Set the minimum module_output_grads as 0 to handle parameterless models
2021-05-18 09:33:57 -07:00
Yulong Wang
96deec596f
fix npm test for webgl without wasm artifacts ( #7742 )
2021-05-18 09:19:19 -07:00
Xueyun Zhu
e92b3c1394
bumping up version number to 1.8 ( #7733 )
...
* bump to 1.8
* fix windows AI
2021-05-18 09:03:37 -07:00
Du Li
e4a985ff17
[JS/Web] WebGL Profiling Tool ( #7724 )
2021-05-18 06:31:00 -07:00
Hariharan Seshadri
43e2ee37f2
Some cosmetic changes ( #7741 )
2021-05-18 00:02:07 -07:00
stevenlix
a6972c8782
Fix issues in TensorRT provider options ( #7738 )
...
* add legacy env variable support in pybind
* formating code
2021-05-17 23:07:27 -07:00
Young Jin Kim
e9057d2e49
ZCode FastFormers changes ( #5827 )
...
* Add FBGEMM submodule
* Add fbgemm based per-channel quantization
* Add missing logic for pre-layernorm transformer model fusion
* add support for structured pruning architecture -fastformers
* Fix windows build
* Add a default behavior when head_size is not present for the backward compatibility
* Remove FBGEMM and default to tensor-wise quantization, column-wise quantization will be enabled later
* Fixed some unit test errors
* Fix windows compile error and unit test errors
* delete the option removed from the upstream
* Addresses review comments and fixes a merge error
* Remove commented out code
* add non-zero zp support
* support A and B scale with any dimensions
* fix build breaks
* fix warning in MSVC
* Fix bug for not checking original float value names when treat it as not existing.
* Clean up head size
* Clean up python tools
* Enable per column quantization
* fix quant weight cleanup bug
* A few code clean up
* Some code clean-up
* Some code clean-up
* Change option name
* update default value
* Rename option and parameter names
* Missing argument name change
* Add tests for quantization options for attention and matmul
Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>
Co-authored-by: Lei Zhang <zhang.huanning@hotmail.com>
2021-05-17 21:12:21 -07:00
Changming Sun
38d90b0f15
Cleanup install_deps.sh ( #7734 )
2021-05-17 19:27:47 -07:00
Yufeng Li
9075488368
Add per-column support for QLinearMatMul ( #7729 )
...
* Add per-column support for QLinearMatMul
2021-05-17 18:57:42 -07:00
Du Li
d3c4b70ede
[Web/JS] Fixing two bugs in reshape_pack and im2col_pack ( #7689 )
...
* fixing two bugs in reshape_pack and im2col_pack
* minor fix
* fix lint complaints
2021-05-17 18:28:09 -07:00
Xueyun Zhu
79854dda8f
bumping up verson ( #7731 )
2021-05-17 17:55:42 -07:00
Jesse Benson
f977644324
ROCM support int reductions
2021-05-17 16:42:06 -07:00
Sunghoon
da5f24bd2d
Support additional session options and run options in WebAssembly ( #7712 )
...
* add all session options and run options in C API except AddInitializer and AddFreeDimensionOverride
* remove unnecessary comment
* change extra session and run options to object notation
* resolve comments
* use an optional chaining for options
* resolve comments
2021-05-17 14:57:19 -07:00
Tixxx
6d9f541442
[JS]moved logging level flag to global env ( #7700 )
...
* moved logging level flag to global env
* added setter and getter for loggingLevel in Env
* moved implementation of env to a separate file
2021-05-17 14:16:59 -07:00
Ye Wang
5e8086ad8e
Support fusions inside subgraphs in optimizer tool ( #7701 )
...
* skip subgraph when updating model
* intreim checkin
* interim checkin 2
* support transformers optimizations in subgraph
* change more files
* fix comments typo
2021-05-17 12:43:55 -07:00
Zhang Lei
ad95b19322
Help build pass on ubuntu 16.04 with old g++. ( #7706 )
2021-05-17 11:37:00 -07:00
Yufeng Li
0e48187b4e
Add type checks for QDQ transformer ( #7715 )
2021-05-17 10:48:20 -07:00
baijumeswani
c873f5589d
Fix bug where the output names were sorted lexicographically ( #7709 )
2021-05-17 10:27:20 -07:00
Thiago Crepaldi
6c41ed597b
Add custom autograd function to prevent input passthrough on ORTModule ( #7694 )
...
* Changes for investigation
* Gradient for Identity
* Keep Identity betwen YieldOp and GraphOutput
* Revert debugging changes
* Add custom autograd fn to prevent input passthrough on ORTModule
* Add comment
Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-17 09:56:02 -07:00
Thiago Crepaldi
4fe2ffae16
Fix ORTModule python doc generation ( #7704 )
...
* Fix ORTModule python doc generation
* Address comment
2021-05-17 09:55:49 -07:00
Hariharan Seshadri
ebee380911
Partially revert PR #7348 ( #7702 )
2021-05-16 23:56:13 -07:00
Scott McKay
2b73163690
Exclude training specific logic from BatchNormalization to reduce binary size. ( #7703 )
2021-05-17 16:42:33 +10:00
stevenlix
557b94637d
Add more TensorRT env variables to provider options ( #7698 )
...
* add all trt env variables to provider options
* add python test
* Update onnxruntime_c_api.h
* fix issues
* validate values for options
2021-05-16 22:09:52 -07:00
Tracy Sharpe
943ab9dcef
undo clang-format corruption ( #7720 )
2021-05-16 15:32:30 -07:00
liqunfu
d604281a86
Liqun/training pkg to run tests ( #7662 )
2021-05-16 09:10:57 -07:00
liqunfu
3ead2f2f39
update pt lightning version ( #7711 )
...
Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-05-15 21:46:16 -07:00
Yulong Wang
017b94ab36
Specify correct dependency for CI pipeline of nodejs binding ( #7717 )
2021-05-15 08:56:58 -07:00
Yufeng Li
6b0a7905ed
fix quant weight cleanup bug ( #7707 )
2021-05-14 22:04:35 -07:00
Sergii Dymchenko
9ba8da65d2
Fix BiasDropoutFusion when there are multiple Dropout consumers. ( #7708 )
...
* Add tests for multiple consimers bias dropout fusion.
* Don't fuse resudial Add for multiple Dropout consumers.
* Remove duplicate code.
* Fix comment typo.
Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>
2021-05-14 21:03:23 -07:00
Hariharan Seshadri
53d1d55ea8
Add ability for pre-packed weights of shared initializers to be shared across sessions ( #7421 )
2021-05-14 20:44:42 -07:00
Ashwini Khade
c78a40c0c7
add opset14 registrations for cuda provider ( #7699 )
...
* add opset14 registrations for cuda provider
* update macros
* fix typo
* updates to rocm provider
2021-05-14 19:08:45 -07:00
Yulong Wang
97d9bcd644
[js/web] fix bundle for multi-thread, add e2e test and support nodejs ( #7688 )
...
* fix bundle for multi-thread, add e2e test and support nodejs
* add copyright banner
* resolve comments
* add comments for isMultiThreadSupported()
2021-05-14 18:15:38 -07:00
Yufeng Li
a74e41e47d
Add non-zero zp support for quant matmul and attention ( #7570 )
...
* add non-zero zp support
* support A and B scale with any dimensions
2021-05-14 16:50:31 -07:00
Yulong Wang
c53b5be509
force multi steps to use the same commit in CI ( #7697 )
2021-05-14 15:13:38 -07:00
Zhang Lei
0f7721a019
Fix bug for not checking original float value names when treat it as not existing. ( #7695 )
2021-05-14 12:50:30 -07:00
Changming Sun
1d403ba03b
Fix a compile warning in EigenNonBlockingThreadPool.h ( #7638 )
2021-05-14 11:38:34 -07:00
Zhang Lei
033f0b3b7c
fix typo. ( #7690 )
2021-05-14 10:25:34 -07:00
Sunghoon
a9b47ca8e7
update debug.keystore for a public usage ( #7696 )
2021-05-14 09:54:54 -07:00
liqunfu
359fe1d197
Liqun/ort training version ( #7620 )
2021-05-14 09:54:19 -07:00
ashbhandare
bfbcc89db1
Add MLFloat16 support for SoftmaxCrossEntropyLoss for CUDA EP ( #7679 )
...
* Forward op changes
* Add tests, improve kernel
* add opset 13 registration, remove unnecessary changes
* Add fp16 grad for SCELoss, review comments
2021-05-14 09:00:27 -07:00
Changming Sun
39fac6d304
Fix a buffer reuse bug in allocation_planner.cc ( #7645 )
2021-05-14 07:31:35 -07:00
Ashwini Khade
442c7300eb
add opset14 rnn ops ( #7687 )
...
* add opset14 rnn ops
* update kernel hashes
2021-05-14 05:52:54 -07:00
Yufeng Li
3725d0211f
support maxpool QDQ fusion from opset 12 ( #7693 )
2021-05-13 23:58:15 -07:00
Zhang Lei
50c5edcf13
Add nhwc support for QLinearAveragePool operator ( #7656 )
...
* Add nhwc support for QLinearAveragePool operator
* Update ContribOperators.md
* Update OperatorKernels.md with cpu,dnnl and cuda enabled.
2021-05-13 22:05:30 -07:00
baijumeswani
37f69fcee5
Regain performance by caching initializer names in ORTModule ( #7685 )
2021-05-13 20:54:49 -07:00
Edward Chen
19704aedbb
Update Objective-C API ( #7675 )
...
- Add session/run configuration
- Add additional supported tensor data types
- Clean up
2021-05-13 18:47:22 -07:00
ashbhandare
56e993a434
Bump to rel-1.9.1 ( #7684 )
2021-05-13 18:41:28 -07:00
Xueyun Zhu
32d8278c2f
reshape fix ( #7678 )
2021-05-13 16:34:03 -07:00
raviskolli
4b37901f10
Aten support for rocm ( #7680 )
...
* Aten support for rocm
* Removed aten_ops.cc as it is resued from cuda version
2021-05-13 15:56:03 -07:00
Aswin John Mathews
4afdc19958
ROCm optimized layernorm for MI100 ( #7682 )
...
* layernorm optimizations
* Changed HIP flag from HIP_VERSION to __HIP_PLATFORM_HCC__
2021-05-13 15:54:06 -07:00