RandySheriffH
d3b684cd9e
Drop nuphar ( #11555 )
...
* drop nuphar code and configs
* refactor test case
* format python
* remove nuphar from training test
* remove commented nuphar logics
* restore llvm setting
* drop nuphar ci
* fix compile err
* fix compile err
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2022-09-07 15:11:18 -07:00
Jian Chen
acc8bdc6c5
Splitting quantize_tensor and quantize_input ( #12873 )
...
* Splitting quantize_tensor and quantize_input
* Reformat code
* Reformat code
* Update is_input_a_weight to is_input_a_initializer
2022-09-07 18:05:42 -04:00
Sheil Kumar
535b0835f2
User/sheilk/dft fixes ( #12862 )
...
* DirectML DFT Tests and Fixes
* Dynamicaly allocate temporaries using the allocator...
* Allocate during compute
* wrong dims
* CR feedback
2022-09-07 13:21:56 -07:00
sophies927
f63bd0765d
New GitHub templates ( #12777 )
...
* Create 01-build.yml
* Create 02-documentation.yml
* Create 03-mobile.yml
* Create 04-web.yml
* Create 05-performance.yml
* Create 06-training.yml
* Create 07-feature_request.yml
* Create 08-general.yml
* Create config.yml
* Delete bug-performance-issue.md
* Delete feature_request.md
* Create labeler.yml
* Create labeler.yml
* Update Performance template to make model info optional.
* Update feature request description placeholder
2022-09-07 11:59:56 -07:00
Hariharan Seshadri
ad69aac491
Introduce ordered quantization ops for the CUDA EP [1/n] ( #12582 )
...
Initial core small set for the ordered quantization ops for cuda EP.
2022-09-07 11:58:15 -07:00
petermcaughan
69f7cc6494
Add pybind support for all memory config options in OrtArenaCfg ( #12658 )
...
* Add support for initial_growth_chunk_size_bytes setting in OrtArenaCfg pybind
* Add overloaded constructor for KVP, UT still in progress
* Fix class member access in pybind, fix unit test
* Resolve linter warnings
* Improve formatting
* Simplify UT
* Fix linter formatting
Co-authored-by: Peter Mcaughan <petermca@microsoft.com>
2022-09-07 11:15:00 -07:00
Chen Fu
8004db4bf1
fix python import sequence warning ( #12864 )
...
fix python import sequence warning
2022-09-07 09:53:39 -07:00
Xavier Dupré
400195a10a
raise an exception when TreeEnsemble request a feature out of boundaries ( #12859 )
...
* Catch a potential error when the number of featues is low than the features referenced in TreeEnsemble
* add unit test
* remove extra spaces
2022-09-07 10:05:32 +02:00
Guenther Schmuelling
f856be162e
fix xnnpack wasm build ( #12845 )
2022-09-06 19:20:07 -07:00
Jan Tilly
437409c343
Add DONT_VECTORIZE flag to cmake ( #12169 )
...
Add DONT_VECTORIZE flag.
2022-09-07 12:14:14 +10:00
Scott McKay
706e03c63d
Add azp run helper ( #12832 )
...
* Add helper to add azp run comments to a PR.
2022-09-07 11:48:31 +10:00
Yi Zhang
c571b99336
Refactor setup_test_data ( #12818 )
...
* refactory setup_test_data
* mv setup test data to test stage
* model link for C# test
* add comment
2022-09-07 08:33:27 +08:00
Yulong Wang
726251609a
increase max memory to 4G for wasm ( #12798 )
2022-09-06 17:07:13 -07:00
Tianlei Wu
d19955fd89
fix transformers script issues ( #12802 )
...
Fix a few obvious issues:
(1) bert_perf_test.py create session without provider in line 65.
(2) compare_bert_results.py miss a parameter in create_session in line 37
(3) onnx_exporter.py returns value mismatch in lines 667, 690.
(4) remove some imports not used in the scripts.
(5) fusion_utils need not print "Removed 0 cast nodes" or "Removed 0 Identity nodes"...
(6) update requirements for numpy version since gpt2 parity tool use equal_nan in numpy v1.19+
2022-09-06 16:15:16 -07:00
Xavier Dupré
54360c88d2
Disable two warnings raised by tensorboard on Visual Studio ( #12773 )
2022-09-06 20:42:52 +02:00
Chen Fu
9ad5b95e4f
Fix math domain error with log10 ( #12841 )
...
fix math domain error with log10
2022-09-06 08:54:41 -07:00
Cheng
8cedafe250
[xnnpack] Have Initializer in Mobile related EPs in Minimal_build and creating EP specific dynamic-schema ( #12555 )
...
* Remove the dependence of Qlinearsoftmax schema
* refactor initializerview && create shared schema
* Dynamic Create EP specific schema
* Have Initializer in minimal_build
* address comments
* remove CancelFuseSubGraph
2022-09-06 14:32:15 +08:00
Scott McKay
ac4f1bf960
Update max opset for NNAPI and CoreML. ( #12831 )
...
Update max opset for NNAPI and CoreML. Changes in opsets 16 and 17 don't require any updates.
2022-09-05 09:37:14 +10:00
Baiju Meswani
9e47eb68e0
Remove unused orttraining amd dockerfiles and scripts ( #12707 )
2022-09-02 18:43:21 -07:00
Cheng
76d17b0f48
Add java API for xnnpack ( #12788 )
...
* Add java API for xnnpack
* provider option support
* a more general interface for creating EP
2022-09-03 08:29:40 +08:00
Baiju Meswani
295bd26980
Remove orttraining-distributed CI pipeline ( #12738 )
2022-09-02 14:34:26 -07:00
ashbhandare
27dde0b51f
Csharp bindings for on-device training APIs ( #12404 )
2022-09-02 13:13:48 -07:00
Jian Chen
2fe919c3ad
Adding Split Fusion ( #12732 )
...
* Adding Split Fusion
* Make changes to comments
* Format files and change typo
* Format files and change typo
* Format files and change typo
* Format files and change typo
* Format file
* Format files
* Format files
* Format files
* Format files
2022-09-02 14:17:10 -04:00
Baiju Meswani
56bae3b196
Use InplaceClipGradNorm for offline processing for on-device training ( #12603 )
2022-09-02 07:47:17 -07:00
Cassie Breviu
98b2b7f5bb
Update csharp documentation ( #12830 )
2022-09-01 22:14:03 -07:00
sophies927
548938fb97
Update stale.yml ( #12813 )
...
* Update stale.yml
Change the number of days of inactivity before an issue becomes stale from 60 to 5 and the number of days of inactivity before a stale issue is closed from 7 to 5. Update the exempt labels based on the redefined set of GH labels.
* Implement stale.yml feedback.
2022-09-01 20:50:46 -07:00
Changming Sun
ca5af24765
Update Sdl.ruleset to remove C26812 from the rules ( #12695 )
2022-09-01 20:05:20 -07:00
Hariharan Seshadri
931c8b0147
Resolve GH issue 12706 ( #12815 )
2022-09-01 18:30:57 -07:00
Justin Chu
6fe712b587
Create codeql.yml to replace LGTM ( #12790 )
...
**Description**: Create codeql.yml to replace LGTM
**Motivation and Context**
LGTM.com is shutting down and moving to github code scanning. This PR enables github code scanning.
cpp and c# support will be added in a separate pr.
2022-09-01 16:37:43 -07:00
ashbhandare
349469c381
Enable way to extract all parameters to and from a contiguous buffer. ( #12674 )
...
* implementation
* review comments
* review comment
* lint error
2022-09-01 15:23:30 -07:00
Hariharan Seshadri
52ce6a90b4
Props file cleanup ( #12782 )
2022-09-01 11:05:46 -07:00
George Nash
0125e15281
Fix include order build failure training build ( #12425 )
...
Signed-off-by: George Nash <george.nash@intel.com>
2022-09-01 10:48:40 -07:00
Sheil Kumar
e3b501125d
DFT on DirectML ( #12710 )
...
* DFT on DirectML
* feedback
* fix misc build issues
* fixes
* fix constant cpu inputs and optional tensors for external operators
* disable dft tests on 'pure' dml
2022-09-01 08:31:14 -07:00
Cheng
5dd9afe75a
python lint ( #12825 )
2022-09-01 22:38:25 +08:00
PeixuanZuo
adbc0757ad
[UPDATE] update ROCm ci pipeline to ROCm5.2.3 ( #12799 )
...
* [Update] update to rocm5.2.3
* [Fix] cmake version
* [Fix] disbale ortmodule tests
* [revert] revert performance number
2022-09-01 10:32:24 +08:00
Vincent Wang
262a597e2a
[CUDA] BiasSoftmax and Dropout Fusion ( #12667 )
...
* bias softmax dropout fusion
* fix rocm build
* move some files
2022-09-01 10:01:44 +08:00
Tianlei Wu
7e1e0f5c9e
fix build error for generation debug code ( #12801 )
...
fix dumping code
2022-08-31 13:53:01 -07:00
Justin Chu
a48b115540
Remove reference to the deprecated variable in torch.onnx.symbolic_helper ( #12452 )
...
**Description**: Remove reference to the deprecated variable in `torch.onnx.symbolic_helper` pytorch/pytorch#81953
- Removed unused imports
- Changed BANNED_AUTOGRAD_FUNCTION_NAMES to a frozenset
**Motivation and Context**
The cast_pytorch_to_onnx variable is deprecated and removed in `torch.onnx.symbolic_helper`. Since there is still a need for converting scalar types to onnx type, I copied the mapping to `_CAST_PYTORCH_TO_ONNX` in the module.
2022-08-31 11:55:56 -07:00
Dmitri Smirnov
f59a44e24f
Build VS 2022 no Abseil adjustment ( #12780 )
...
Build VS 2022 no Abseil
2022-08-31 11:47:43 -07:00
Nat Kershaw (MSFT)
cc94ba7740
Update C# API docs workflow ( #12796 )
2022-08-31 09:41:01 -07:00
Yi-Hong Lyu
6b148e15e1
Disable QGEMM, s8 A, s8 B (Packed) bench for AMD64 ( #12765 )
...
Besides, rename QGEMM tests for clarification.
2022-08-30 16:47:36 -07:00
Baiju Meswani
a52543ecd8
Generate windows training package ( #12789 )
2022-08-30 16:35:50 -07:00
Yulong Wang
82a28cc2c3
upgrade emsdk to 3.1.19 ( #12690 )
...
* upgrade emsdk to 3.1.19
* fix build break
* ignore '-Wunused-but-set-variable' in eigen
* add malloc and free in exported functions
* EXPORTED_FUNCTIONS
2022-08-30 13:42:45 -07:00
Yulong Wang
1a402a3f25
replace 'master' branch ref to 'main' for onnx repo ( #12678 )
2022-08-30 13:41:42 -07:00
Ye Wang
9aefcc251f
fix some prefast warnings ( #12730 )
...
fix warnings
2022-08-30 12:52:59 -07:00
cloudhan
9680ffd842
Fix rocm build caused by #12699 ( #12787 )
2022-08-30 20:26:16 +08:00
Yi Zhang
b4f6dad7c9
increase timeout limit of mac silicon package workflow ( #12784 )
...
increase timeout
2022-08-30 13:57:01 +08:00
cloudhan
9907b59a1e
Change cuda and rocm error checking helpers to return Status ( #12699 )
...
* CudaCall returns Status in non-throw and void in throw
* RocmCall returns Status in non-throw and void in throw
2022-08-30 13:18:47 +08:00
pengwa
a0c25e5c2f
Fix segment fault for alltoall ( #12701 )
...
* fix segment fault
* formatting
2022-08-30 11:27:14 +08:00
PeixuanZuo
19ca2a0089
[ADD] python package pipeline for ROCm5.2.3 ( #12770 )
...
* [TEST] test rocm5.2.3
[TEST] rm torchversion
[Update]sort
Co-authored-by: Ubuntu <peixuanzuo@peixuanzuomi200vm.zvflicr54joexhdgnhvmxrxygg.phxx.internal.cloudapp.net>
2022-08-30 11:05:59 +08:00