onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-14 20:48:00 +00:00

Author	SHA1	Message	Date
Jake Mathern	c0b68e77af	Fix warnings (#21809 ) ### Description Minor changes to resolve some warnings in ORT ### Motivation and Context Binskim for WindowsAI (which consumes ORT) treats warnings as errors, and has hit these warnings. As a security requirement, warnings like "signed/unsigned mismatch" must be resolved.	2024-08-21 14:23:37 -07:00
mindest	5b9369e93c	Fix typos according to reviewdog report. (#21335 ) ### Description Fix typos based on reviewdog report but with some exceptions/corrections.	2024-07-22 13:37:32 -07:00
Patrice Vignola	4d98f06f93	[DML EP] Add GroupQueryAttention (#20327 )	2024-04-19 10:25:29 -07:00
Pranav Sharma	668c70ee11	Add support for specifying a custom logging function per session. (#17727 ) ### Description Add support for specifying a custom logging function per session. Bindings for other languages will be added after this PR is merged. ### Motivation and Context Users want a way to override the logging provided by the environment.	2023-09-29 19:46:55 -07:00
Justin Chu	2575b9aaa1	Improve comments in winml/ (#17163 ) Follow up of #17144. Manually fixed indentation in block comments and replaced all tabs with spaces.	2023-08-15 23:30:56 -04:00
Justin Chu	416dc2e84d	Fix clang-format comment indents on Windows for winml/ (#17144 ) On Windows, clang-format has a bug when AlignTrailingComments.Kind is set to `Leave` (https://clang.llvm.org/docs/ClangFormatStyleOptions.html#aligntrailingcomments), where it will keep adding indentation to comments after each formatting runs. This PR changes to always align comments so we do not hit the bug. As a consequence of the options change we need to reformat some of the files. Note that this option is aligned with the rest of the repository.	2023-08-14 23:50:14 -04:00
Jeff Bloomfield	0180c0429f	Fix DML regression from allocator refactor and enable unrounded weight allocation in ORT API (#17030 ) This addresses a DML performance regression from the following PR resulting in allocations not being rounded and pooled in the DML execution provider. https://github.com/microsoft/onnxruntime/pull/15833 This also fixes a pre-existing limitation that allocations during session initialization (primarily large weights and persistent resources) only bypassed rounding and pooling while using the Winml API. The allocator now also respects a caller's rounding mode parameter when provided.	2023-08-10 17:02:24 -07:00
Justin Chu	eeef157888	Format c++ code under `winml/` (#16660 ) winml/ was previously excluded from lintrunner config. This change includes the directory and adds the clang-format config file specific to winml/ that fits existing style. --------- Signed-off-by: Justin Chu <justinchu@microsoft.com>	2023-07-25 21:56:50 -07:00
cao lei	329e8156d4	clean unused parameter in ORT_UNUSED_PARAMETER (#16538 ) ### Description clean unused parameter in ORT_UNUSED_PARAMETER ### Motivation and Context clean unused parameters in ORT_UNUSED_PARAMETER which are introduced from #15833	2023-07-07 13:20:36 -07:00
cao lei	dd72192cf4	ExecutionProvider API refactor - move allocator from EP level to SessionState level and indexed by OrtDevice (#15833 ) ### Description This PR is to refactor ExecutionProvider API for memory management, which is to move allocators from EP level to SessionState level and indexed by OrtDevice ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> This PR is to refactor ExecutionProvider API for memory management, which is to move allocators from EP level to SessionState level and indexed by OrtDevice. By this change, EP level will shift the burden of maintaining allocators, which will be user friendly for EP developers --------- Co-authored-by: Lei Cao <leca@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-06-19 17:44:45 -07:00
Dmitri Smirnov	ce3b4eabd3	Implement Optional Metadata support and C# test support (#15314 ) ### Description Implement Optional Type metadata support in the library. Implement optional support in C# API along with metadata. Implement Sequence, Map, Optional test data support and test execution. Prune tests and provide more details for failing tests in C# code. Note, this PR does not enable running onnx test models in C++. ### Motivation and Context Opset18 optional type support.	2023-04-11 09:41:59 -07:00
cao lei	50fa151298	remove device_id parameter out of ExecutionProvider::GetAllocator() (#14580 ) ### Description Remove the parameter device_id out of ExecutionProvider::GetAllocator() function ### Motivation and Context The parameter device_id is not necessary. We can fully rely on the second parameter OrtMemType mem_type to determine the device_id when getting allocator from executionProvider.	2023-02-13 10:01:07 -08:00
RandySheriffH	75584c5fa8	Enabling thread pool to be numa-aware (#13778 ) The PR enables ort thread pool to be numa-aware, so that threads could be evenly created and distributed among numa nodes. In addition, to facilitate performance tuning, the PR opens a new API allowing customers to attach threads to certain logical processors. Please check the API [definition](https://github.com/microsoft/onnxruntime/pull/13778/files#diff-5845a5c76fb64abdc8f0cffe21b37f8da1712674eb3abc4cd87190891be1bd48) for details. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-12-12 10:33:55 -08:00
Abhishek Udupa	83c59d2594	Session-aware and thread-safe CUDA profiler (#13706 ) ### Description The existing CUDA profiler is neither session-aware, nor thread-safe. This PR ensures both. ### Motivation and Context [PR 13549](https://github.com/microsoft/onnxruntime/pull/13549) brought thread-safety and session-awareness to the ROCm profiler. This PR brings the same goodness to the CUDA profiler as well. Sample outputs of a profiling run from the StableDiffusion model (this model was chosen because it requires orchestration of multiple sessions, and verifies that the profilers are now indeed session-aware) on both CUDA and ROCm EPs are attached, along with a script that checks that the trace files generated by the profile are well-formed. Update 11/29: Updated the profile outputs. The older profile outputs exhibited an issue where some timestamps were wildly out of range, leading to problems visualizing the traces. The bug has been fixed and the profile outputs have been updated, along with an update to the check script to ensure that timestamps are monotonically increasing. [sd_profile_outputs_cuda.tar.gz](https://github.com/microsoft/onnxruntime/files/10118088/sd_profile_outputs_cuda.tar.gz) [sd_profile_outputs_rocm.tar.gz](https://github.com/microsoft/onnxruntime/files/10118089/sd_profile_outputs_rocm.tar.gz) [check_profile_output_well_formedness.zip](https://github.com/microsoft/onnxruntime/files/10118090/check_profile_output_well_formedness.zip) Co-authored-by: Abhishek Udupa <abhishek.udupa@microsoft.com>	2022-12-09 13:22:12 -08:00
Numfor Tiapo	56387c3c31	Fix SDL Unmatched Annotation Errors (#13162 ) Fixes 3 SDL unmatched annotation errors. Co-authored-by: Numfor Mbiziwo-Tiapo <numform@microsoft.com>	2022-09-30 15:36:30 -07:00
Brian Martin	c20abcab87	User/brianma/eo (#13152 ) fixing SDL issues. One was a SAL mismatch, the other was handling an optional null pointer.	2022-09-30 09:43:56 -07:00
Edward Chen	3efd9a73bb	Refactor InferenceSession Load member functions. (#12430 ) Fix comparison of path characters when checking for ".ort" suffix. Some clean up of InferenceSession Load functions. - Reduce duplication between std::string/std::wstring versions. - Renaming for clarity.	2022-08-03 16:28:26 -07:00
Dmitri Smirnov	267a424e52	Retry Rework execution frame to reduce memory allocations (#11897 ) * Revert "Revert "Refactor ExecutionFrame and SessionState to reduce memory all… (#11888)" This reverts commit `d2cbae3a04`. * Revert prepacked_weights to avoid indirect inclusion in CUDA and TRT code that breaks the build.	2022-06-20 10:29:43 -07:00
Yi Zhang	d2cbae3a04	Revert "Refactor ExecutionFrame and SessionState to reduce memory all… (#11888 ) Revert "Refactor ExecutionFrame and SessionState to reduce memory allocations and improve data locality (#11804)" This reverts commit `2ecba6fd25`.	2022-06-17 17:07:21 +08:00
Dmitri Smirnov	2ecba6fd25	Refactor ExecutionFrame and SessionState to reduce memory allocations and improve data locality (#11804 ) Refactor ExecutionFrame and SessionState for better data locality and less memory allocations.	2022-06-16 16:50:48 -07:00
Sheil Kumar	6255194659	All LearningModelSessions created from a common LearningModelDevice should share the same thread pool (#11457 ) * Share thread pools between devices * make tests reuse device * Change cpu thread pool options for dml sessions to use 1 thread with no spinning * fix test failure * Update missing type constraints for dft * Add comment and rename inference session parameter * default missing causing inconsistent test behavior Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-05-13 11:12:43 -07:00
Numfor Tiapo	5fbfca3d58	Add Experimental API for setting model name (#10518 ) * Add experimental API for editing model name * Change EditModelName to 'SetName' * Change API to pass c_string * Update SetName to edit the proto * Test that the model proto gets changed * Remove comments * Skip inbox tests * Use filehelper path Co-authored-by: Numfor Mbiziwo-Tiapo <numform@microsoft.com>	2022-02-25 14:23:49 -08:00
Dwayne Robinson	0f5e82c294	DirectML EP remove stale code for int64 via int32 double strides (#9959 )	2022-01-10 02:07:22 -08:00
nums11	533b20c6ca	Merge remote-tracking branch 'upstream/master' into dmldev_temp	2021-11-18 14:21:34 -08:00
Sheil Kumar	3d0bd2596f	Enable creating OrtValues from ID3D12Resources from the onnxruntime C-API (#9686 ) * Add onnxruntime-windows api. * minor fixes * add to package headers * Build ort_dml_api for provider extensions. * Cleanup * misc comment * remove winml specific comments * use dml check in onnxruntime * Update include/onnxruntime/core/providers/dml/dml_provider_factory.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update include/onnxruntime/core/session/onnxruntime_c_api.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update include/onnxruntime/core/providers/dml/dml_provider_factory.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update include/onnxruntime/core/providers/dml/dml_provider_factory.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update onnxruntime/core/session/onnxruntime_c_api.cc Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update onnxruntime/core/session/ort_apis.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update winml/test/adapter/AdapterSessionTest.cpp Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update onnxruntime/core/session/onnxruntime_c_api.cc Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update winml/adapter/winml_adapter_c_api.cpp Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update include/onnxruntime/core/session/onnxruntime_c_api.h Co-authored-by: Pranav Sharma <prs@microsoft.com> * Update onnxruntime/core/session/onnxruntime_c_api.cc Co-authored-by: Pranav Sharma <prs@microsoft.com> * Update winml/adapter/winml_adapter_c_api.cpp * PR feedback * Update include/onnxruntime/core/providers/dml/dml_provider_factory.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update include/onnxruntime/core/providers/dml/dml_provider_factory.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * Update include/onnxruntime/core/providers/dml/dml_provider_factory.h Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> * PR feedback * merge resolution and unreference param * (naming) Remove Dml prefix * maybe unused version * move DML code into DML path. CIs failing because DML is not available when --use_dml is not on * fix warning causing local build failures after merging * Change getvaluememoryinfo to gettensormemoryinfo * minor breaks * fix comment paste * fix comment Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Dwayne Robinson <dwayner@microsoft.com> Co-authored-by: Pranav Sharma <prs@microsoft.com>	2021-11-13 03:34:54 -08:00
Sheil Kumar	a17bdaf725	Enable JoinModels API in WinML+RT Experimental API (#9746 ) * Dynamic onnx model fusion * empty node names shoudl remain empty * comments and cleanup * logic reversed for promoting_unlined_outputs * PR feedback * type * typo * fix model outputs with promote unlinked output * remove disembodied model Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-11-12 16:56:31 -08:00
Jeff Bloomfield	3a1b4045c9	Merge remote-tracking branch 'upstream/master' into DmlDev	2021-11-02 17:56:53 -07:00
sumitsays	88f61a1b2d	[DmlEp] DmlEp acknowledges ORT_NO_EXCEPTIONS (#9622 ) * Make DmlEp Clang compatible for EPIC * Fix build issues occurred when engine/lotus points to ORT Github latest * Fix more build errors * Fixed one build issue and removed temporary changes for Clang * Addressed comments on the PR. * [WIP] - DmlEp ORT NO Exception * Made DmlEp compatible with ORT_NO_EXCEPTION * Fixed typo * Addressed comments on the PR, mostly nit styling and using approriate HR error code * Added dependency of ErrorHandling.h * Addressed comment on the PR Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>	2021-11-01 17:32:43 -07:00
Ryan Lai	7dad90b200	Merge remote-tracking branch 'upstream/master' into HEAD	2021-10-26 10:38:41 -07:00
Edward Chen	79e736ed25	Make onnxruntime::Status nodiscard (#9279 ) Mark onnxruntime::Status class with [[nodiscard]] attribute. Fix existing warnings.	2021-10-08 17:10:31 -07:00
Tiago Koji Castro Shibata	20b9390d1d	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00
Sheil Kumar	c6cb49c5a1	DirectML.dll load fails when executable path contains Non-English characters (#9229 ) * enable unicode dml * add wide string L prefix * Add Fail Fast back Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-09-30 15:16:57 -07:00
Tiago Koji Castro Shibata	62c0d24340	Fix Windows Store build (#8753 ) * Remove APIs unavailable in Store in #8349, #8178, #8065 * Add UWP stubs of C runtime functions * Remove UWP incompatible tests from UWP build * Remove incompatible tests from Store * Use UWP stubs in store only * Skip partition check outside of Windows * Remove unused WRL include * Workaround Windows header not including what it uses * Fix precompiled header name clash * Workaround SDK bugs * DXCore workaround in Win7 * Fix warning * Fix more warnings * Bump WinML to target Windows 8 * Fix more warnings * Remove unnecessary workarounds * Remove Desktop only APIs from DML adapter	2021-08-23 11:19:03 -07:00
ytaous	0725f80d2d	Revert "Fix Windows Store build (#8481 )" (#8679 ) This reverts commit `53e7831b53`.	2021-08-11 00:37:36 -07:00
Tiago Koji Castro Shibata	53e7831b53	Fix Windows Store build (#8481 ) * Remove APIs unavailable in Store in #8349, #8178, #8065 * Add UWP stubs of C runtime functions * Remove UWP incompatible tests from UWP build * Remove incompatible tests from Store * Use UWP stubs in store only * Skip partition check outside of Windows * Remove unused WRL include * Workaround Windows header not including what it uses * Fix precompiled header name clash * Workaround SDK bugs * DXCore workaround in Win7 * Fix warning * Fix more warnings * Bump WinML to target Windows 8 * Fix more warnings * Remove unnecessary workarounds	2021-08-10 15:19:30 -07:00
Ryan Lai	b0c0b087a4	FIx merge conflict	2021-07-21 14:02:18 -07:00
Sheil Kumar	c3129306e5	Enable string attributes for experimental model building (#8428 ) * string attributes * Update error message Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-07-19 11:48:41 -07:00
sumitsays	43c45ddd66	Update DirectML EP changes from DmlDev as of 2021-06-07 (#7987 ) * Merged PR 6093117: Fix test_DynamicQuantizedLinear_max_adjusted_expanded by allowing Identity operator to run on non-float inputs Motivation: As part of the OnnxConformance Backend tests, DynamicQuantizedLinear_max_adjusted_expanded is failing. Root Cause: - The test model has `Identity` operator as one of the node. The input of this node is of non-float data type. - In DML, `Identity` operator is registered as operator which requires floating input. - As per `DirectMLSchema.h`, support for non-float input has been added for `Identity` operator in DML but the same has not been reflected in the `OperatorRegistration.cpp`. Changes: - Removed all traces of the requiresFloatFormatsForGraph flag from it's definition and usage. This flag was only used for Identity and it's related operator. - Added null check for the graphOutput nodeArg in GraphDescBuilder.cpp to stop the crash of the test. Related work items: #33076298 * Merged PR 6103324: Remove usage of non-generic error code (FWP_E_NULL_POINTER) Motivation: Addressing Dwayne comment on the previous PR. [Ref: [6093117](https://dev.azure.com/microsoft/WindowsAI/_git/onnxruntime/pullrequest/6093117?discussionId=44292162&path=%2Fonnxruntime%2Fcore%2Fproviders%2Fdml%2FDmlExecutionProvider%2Fsrc%2FGraphPartitioner.cpp)] Changes: Inside the DML EP, we should not use some other platform specific error codes. Instead we should a appropriate generic error code. Related work items: #33076298 Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>	2021-06-11 11:09:48 -07:00
Dwayne Robinson	8b0c2e1f3d	Merged PR 6101363: Int64 prototype work for ONNX runtime DML EP Add `support64BitTensorsViaEmulation` to the internal registration info, that informs the graph partitioner that int64 is supported via emulation, even if the device doesn't support it natively. See further description in the corresponding WindowsAI DML PR: https://dev.azure.com/microsoft/WindowsAI/_git/WindowsAI/pullrequest/6101182 Note a later PR will most likely delete this newly added flag and simplify much of the existing logic, even deleting the strides hack completely ^__^... Related work items: #28761231	2021-06-11 07:54:30 +00:00
Sumit Agarwal	3f43a84e10	Merged PR 6093117: Fix test_DynamicQuantizedLinear_max_adjusted_expanded by allowing Identity operator to run on non-float inputs Motivation: As part of the OnnxConformance Backend tests, DynamicQuantizedLinear_max_adjusted_expanded is failing. Root Cause: - The test model has `Identity` operator as one of the node. The input of this node is of non-float data type. - In DML, `Identity` operator is registered as operator which requires floating input. - As per `DirectMLSchema.h`, support for non-float input has been added for `Identity` operator in DML but the same has not been reflected in the `OperatorRegistration.cpp`. Changes: - Removed all traces of the requiresFloatFormatsForGraph flag from it's definition and usage. This flag was only used for Identity and it's related operator. - Added null check for the graphOutput nodeArg in GraphDescBuilder.cpp to stop the crash of the test. Related work items: #33076298	2021-05-28 17:44:37 +00:00
Hariharan Seshadri	4b691a5c0d	Add ability for memory arenas to "shrink" periodically (#7284 )	2021-05-08 07:53:21 -07:00
Ori Levari	dfca1a09d5	Add Thread Spinning Session Option in WinML (#7498 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2021-04-30 11:44:58 -07:00
Adrian Tsai	70e67ddd2b	Update DirectML version to 1.5.1 and enable ARM/ARM64 builds with DML (#7511 ) * Update DirectML to version 1.5.1 * Enable --use_dml with ARM and ARM64 * Add ARM/ARM64 binaries to nuget packages	2021-04-30 00:49:30 -07:00
Changming Sun	1012535dab	Change onnxruntime::make_unique to std::make_unique (#7502 ) 1. Change onnxruntime::make_unique to std::make_unique 2. Add "-std=c++14" to ROCM EP's build flags.	2021-04-29 17:04:53 -07:00
Sheil Kumar	87cb6fd495	Add LearningModelBuilder to WinML Experimental Namespace along with various Audio operators (#6623 ) * model building * fix build * winml adapter model building api * model building * make build * make build again * add model building with audio op * inplace and inorder fft * add ifft * works! * cleanup * add comments * switch to iterative rather than recursive and use parallelization * batched parallelization * fft->dft * cleanup * window functions * add melweightmatrix op * updates to make spectrogram test work * push latest * add onesided * cleanup * Clean up building apis and fix mel * cleanup * cleanup * naive stft * fix test output * middle c complete * 3 tones * cleanup * signal def new line * Add save functionality * Perf improvements, 10x improvement * cleanup * use bitreverse lookup table for performance * implement constant initializers for tensors * small changes * add matmul tests * merge issues * support add attribute * add tests for double data type windowfunctions and minor cleanup * stft onesided/and not tests * cleanup * cleanup * clean up * cleanup * remove threading attribute * forward declare orttypeinfo * warnings * fwd declare * fix warnings * 1 more warning * remove saving to e drive... * cleanup and fix stft test * add opset picker * small additions * add onnxruntime tests * add signed/unsigned * fix warning * fix warning * finish onnxruntime tests * make windows namespace build succeed * add experimental flag * add experimental api into nuget package * add experimental api build flag and add to windows ai nuget package * turn experimental for tests * add minimum opset version to new experimental domain * api cleanup * disable ms experimental ops test when --ms_experimental is not enabled * add macro behind flag * remove unused x * pr feedback Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-02-12 14:17:10 -08:00
Ori Levari	531eb064ab	fix sdl bugs for uninitialized variables and returns (#6450 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2021-01-29 15:00:44 -08:00
Ori Levari	3b1227c5ce	SDL annotation fixes (#6448 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2021-01-28 22:34:10 -08:00
Sheil Kumar	ea2b560055	Fix test breaks in Windows ingestion pipeline (#6476 ) * fix various build breaks with Windows build * fix runtime errors loading libraries from system32 * add build_inbox check to winml_test_common * use raw string * cleanup * fix dll load Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-01-28 14:37:15 -08:00
Sheil Kumar	d5f51c4033	Bug 31463811: Servicing: Redist (Nuget) conflicts with Microsoft.AI.MachineLearning starting 21H1+ (#6460 ) * update load library code to have the fullly qualified path * make it work for syswow32 * git Revert "make it work for syswow32" This reverts commit b9f594341b7cf07241b18d0c376af905edcabae3. Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-01-27 12:25:03 -08:00
Ori Levari	6507b4f818	Reintroduce experimental api changes and fix remote build break (#6385 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2021-01-22 15:15:53 -08:00

1 2

75 commits