onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-04 04:07:22 +00:00

Author	SHA1	Message	Date
Wei-Sheng Chin	28f2e57de5	Use CUDA callback to release deferred-release buffers (#12883 ) * Use CUDA callback to release deferred-release buffers Polishment * Minor improvements. 1. Reorder a if-else so that frequent cases are checked first. 2. More documents. * Fix tests. Previously, in CUDAExecutionProvider::OnRunStart, we call GetPerThreadContext in auto& current_deferred_release_event = GetPerThreadContext().GetCurrentDeferredReleaseEvent(); so that a CUDAExecutionProvider always owns an active PerThreadContext and the ReleasePerThreadContext in CUDAExecutionProvider::OnRunEnd is always valid. However, this isn't true after we drop event- based deferred-release code, so we need to check if CUDAExecutionProvider really owns PerThreadContext than call ReleasePerThreadContext if yes. * Follow up for AMD GPU and improve CUDA part's return value.	2022-09-08 14:23:48 -07:00
Thiago Crepaldi	55c745eefd	Add support for ORTModule Torch cpp CUDA extension build within docker (#12868 ) Currently, CUDA hardware is not available to be leveraged by build during `docker build`. because of that, CUDA capable hardware would not have CUDA support This PR adds an env varf ONNXRUNTIME_FORCE_CUDA in which it allows CUDA extensions to be compiled even when CUDA support is not detected.	2022-09-08 15:30:44 -04:00
pallavides	6ebb7b91eb	Re-apply fix for mkl issue for eager mode (#12881 ) * reapply fix for mkl issue for eager mode * add comment, update link libs	2022-09-08 12:29:24 -07:00
Changming Sun	ff52d6a6bf	Delete Dockerfile.ubuntu (#12888 ) The file was solely for Nuphar.	2022-09-08 10:26:40 -07:00
Changming Sun	a811c7629f	Remove "Build Python Documentation" from py-packaging-stage.yml (#12890 ) Remove "Build Python Documentation" from py-packaging-stage.yml because the task has been moved to Github actions by @natke in PR #10116 .	2022-09-08 09:56:54 -07:00
sophies927	b1984278d9	Enable blank issues (#12885 )	2022-09-07 23:28:17 -07:00
guyang3532	4765e5c382	Using ORTModule to wrap a evaluation model should not change the mode (#12747 ) Using ORTModule to wrap a evaluation model should not change the mode of model	2022-09-08 10:54:59 +08:00
RandySheriffH	d3b684cd9e	Drop nuphar (#11555 ) * drop nuphar code and configs * refactor test case * format python * remove nuphar from training test * remove commented nuphar logics * restore llvm setting * drop nuphar ci * fix compile err * fix compile err Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-09-07 15:11:18 -07:00
Jian Chen	acc8bdc6c5	Splitting quantize_tensor and quantize_input (#12873 ) * Splitting quantize_tensor and quantize_input * Reformat code * Reformat code * Update is_input_a_weight to is_input_a_initializer	2022-09-07 18:05:42 -04:00
Sheil Kumar	535b0835f2	User/sheilk/dft fixes (#12862 ) * DirectML DFT Tests and Fixes * Dynamicaly allocate temporaries using the allocator... * Allocate during compute * wrong dims * CR feedback	2022-09-07 13:21:56 -07:00
sophies927	f63bd0765d	New GitHub templates (#12777 ) * Create 01-build.yml * Create 02-documentation.yml * Create 03-mobile.yml * Create 04-web.yml * Create 05-performance.yml * Create 06-training.yml * Create 07-feature_request.yml * Create 08-general.yml * Create config.yml * Delete bug-performance-issue.md * Delete feature_request.md * Create labeler.yml * Create labeler.yml * Update Performance template to make model info optional. * Update feature request description placeholder	2022-09-07 11:59:56 -07:00
Hariharan Seshadri	ad69aac491	Introduce ordered quantization ops for the CUDA EP [1/n] (#12582 ) Initial core small set for the ordered quantization ops for cuda EP.	2022-09-07 11:58:15 -07:00
petermcaughan	69f7cc6494	Add pybind support for all memory config options in OrtArenaCfg (#12658 ) * Add support for initial_growth_chunk_size_bytes setting in OrtArenaCfg pybind * Add overloaded constructor for KVP, UT still in progress * Fix class member access in pybind, fix unit test * Resolve linter warnings * Improve formatting * Simplify UT * Fix linter formatting Co-authored-by: Peter Mcaughan <petermca@microsoft.com>	2022-09-07 11:15:00 -07:00
Chen Fu	8004db4bf1	fix python import sequence warning (#12864 ) fix python import sequence warning	2022-09-07 09:53:39 -07:00
Xavier Dupré	400195a10a	raise an exception when TreeEnsemble request a feature out of boundaries (#12859 ) * Catch a potential error when the number of featues is low than the features referenced in TreeEnsemble * add unit test * remove extra spaces	2022-09-07 10:05:32 +02:00
Guenther Schmuelling	f856be162e	fix xnnpack wasm build (#12845 )	2022-09-06 19:20:07 -07:00
Jan Tilly	437409c343	Add DONT_VECTORIZE flag to cmake (#12169 ) Add DONT_VECTORIZE flag.	2022-09-07 12:14:14 +10:00
Scott McKay	706e03c63d	Add azp run helper (#12832 ) * Add helper to add azp run comments to a PR.	2022-09-07 11:48:31 +10:00
Yi Zhang	c571b99336	Refactor setup_test_data (#12818 ) * refactory setup_test_data * mv setup test data to test stage * model link for C# test * add comment	2022-09-07 08:33:27 +08:00
Yulong Wang	726251609a	increase max memory to 4G for wasm (#12798 )	2022-09-06 17:07:13 -07:00
Tianlei Wu	d19955fd89	fix transformers script issues (#12802 ) Fix a few obvious issues: (1) bert_perf_test.py create session without provider in line 65. (2) compare_bert_results.py miss a parameter in create_session in line 37 (3) onnx_exporter.py returns value mismatch in lines 667, 690. (4) remove some imports not used in the scripts. (5) fusion_utils need not print "Removed 0 cast nodes" or "Removed 0 Identity nodes"... (6) update requirements for numpy version since gpt2 parity tool use equal_nan in numpy v1.19+	2022-09-06 16:15:16 -07:00
Xavier Dupré	54360c88d2	Disable two warnings raised by tensorboard on Visual Studio (#12773 )	2022-09-06 20:42:52 +02:00
Chen Fu	9ad5b95e4f	Fix math domain error with log10 (#12841 ) fix math domain error with log10	2022-09-06 08:54:41 -07:00
Cheng	8cedafe250	[xnnpack] Have `Initializer` in Mobile related EPs in Minimal_build and creating EP specific dynamic-schema (#12555 ) * Remove the dependence of Qlinearsoftmax schema * refactor initializerview && create shared schema * Dynamic Create EP specific schema * Have Initializer in minimal_build * address comments * remove CancelFuseSubGraph	2022-09-06 14:32:15 +08:00
Scott McKay	ac4f1bf960	Update max opset for NNAPI and CoreML. (#12831 ) Update max opset for NNAPI and CoreML. Changes in opsets 16 and 17 don't require any updates.	2022-09-05 09:37:14 +10:00
Baiju Meswani	9e47eb68e0	Remove unused orttraining amd dockerfiles and scripts (#12707 )	2022-09-02 18:43:21 -07:00
Cheng	76d17b0f48	Add java API for xnnpack (#12788 ) * Add java API for xnnpack * provider option support * a more general interface for creating EP	2022-09-03 08:29:40 +08:00
Baiju Meswani	295bd26980	Remove orttraining-distributed CI pipeline (#12738 )	2022-09-02 14:34:26 -07:00
ashbhandare	27dde0b51f	Csharp bindings for on-device training APIs (#12404 )	2022-09-02 13:13:48 -07:00
Jian Chen	2fe919c3ad	Adding Split Fusion (#12732 ) * Adding Split Fusion * Make changes to comments * Format files and change typo * Format files and change typo * Format files and change typo * Format files and change typo * Format file * Format files * Format files * Format files * Format files	2022-09-02 14:17:10 -04:00
Baiju Meswani	56bae3b196	Use InplaceClipGradNorm for offline processing for on-device training (#12603 )	2022-09-02 07:47:17 -07:00
Cassie Breviu	98b2b7f5bb	Update csharp documentation (#12830 )	2022-09-01 22:14:03 -07:00
sophies927	548938fb97	Update stale.yml (#12813 ) * Update stale.yml Change the number of days of inactivity before an issue becomes stale from 60 to 5 and the number of days of inactivity before a stale issue is closed from 7 to 5. Update the exempt labels based on the redefined set of GH labels. * Implement stale.yml feedback.	2022-09-01 20:50:46 -07:00
Changming Sun	ca5af24765	Update Sdl.ruleset to remove C26812 from the rules (#12695 )	2022-09-01 20:05:20 -07:00
Hariharan Seshadri	931c8b0147	Resolve GH issue 12706 (#12815 )	2022-09-01 18:30:57 -07:00
Justin Chu	6fe712b587	Create codeql.yml to replace LGTM (#12790 ) Description: Create codeql.yml to replace LGTM Motivation and Context LGTM.com is shutting down and moving to github code scanning. This PR enables github code scanning. cpp and c# support will be added in a separate pr.	2022-09-01 16:37:43 -07:00
ashbhandare	349469c381	Enable way to extract all parameters to and from a contiguous buffer. (#12674 ) * implementation * review comments * review comment * lint error	2022-09-01 15:23:30 -07:00
Hariharan Seshadri	52ce6a90b4	Props file cleanup (#12782 )	2022-09-01 11:05:46 -07:00
George Nash	0125e15281	Fix include order build failure training build (#12425 ) Signed-off-by: George Nash <george.nash@intel.com>	2022-09-01 10:48:40 -07:00
Sheil Kumar	e3b501125d	DFT on DirectML (#12710 ) * DFT on DirectML * feedback * fix misc build issues * fixes * fix constant cpu inputs and optional tensors for external operators * disable dft tests on 'pure' dml	2022-09-01 08:31:14 -07:00
Cheng	5dd9afe75a	python lint (#12825 )	2022-09-01 22:38:25 +08:00
PeixuanZuo	adbc0757ad	[UPDATE] update ROCm ci pipeline to ROCm5.2.3 (#12799 ) * [Update] update to rocm5.2.3 * [Fix] cmake version * [Fix] disbale ortmodule tests * [revert] revert performance number	2022-09-01 10:32:24 +08:00
Vincent Wang	262a597e2a	[CUDA] BiasSoftmax and Dropout Fusion (#12667 ) * bias softmax dropout fusion * fix rocm build * move some files	2022-09-01 10:01:44 +08:00
Tianlei Wu	7e1e0f5c9e	fix build error for generation debug code (#12801 ) fix dumping code	2022-08-31 13:53:01 -07:00
Justin Chu	a48b115540	Remove reference to the deprecated variable in `torch.onnx.symbolic_helper` (#12452 ) Description: Remove reference to the deprecated variable in `torch.onnx.symbolic_helper` pytorch/pytorch#81953 - Removed unused imports - Changed BANNED_AUTOGRAD_FUNCTION_NAMES to a frozenset Motivation and Context The cast_pytorch_to_onnx variable is deprecated and removed in `torch.onnx.symbolic_helper`. Since there is still a need for converting scalar types to onnx type, I copied the mapping to `_CAST_PYTORCH_TO_ONNX` in the module.	2022-08-31 11:55:56 -07:00
Dmitri Smirnov	f59a44e24f	Build VS 2022 no Abseil adjustment (#12780 ) Build VS 2022 no Abseil	2022-08-31 11:47:43 -07:00
Nat Kershaw (MSFT)	cc94ba7740	Update C# API docs workflow (#12796 )	2022-08-31 09:41:01 -07:00
Yi-Hong Lyu	6b148e15e1	Disable QGEMM, s8 A, s8 B (Packed) bench for AMD64 (#12765 ) Besides, rename QGEMM tests for clarification.	2022-08-30 16:47:36 -07:00
Baiju Meswani	a52543ecd8	Generate windows training package (#12789 )	2022-08-30 16:35:50 -07:00
Yulong Wang	82a28cc2c3	upgrade emsdk to 3.1.19 (#12690 ) * upgrade emsdk to 3.1.19 * fix build break * ignore '-Wunused-but-set-variable' in eigen * add malloc and free in exported functions * EXPORTED_FUNCTIONS	2022-08-30 13:42:45 -07:00

1 2 3 4 5 ...

7326 commits