onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-14 20:48:00 +00:00

Author	SHA1	Message	Date
Justin Chu	eeef157888	Format c++ code under `winml/` (#16660 ) winml/ was previously excluded from lintrunner config. This change includes the directory and adds the clang-format config file specific to winml/ that fits existing style. --------- Signed-off-by: Justin Chu <justinchu@microsoft.com>	2023-07-25 21:56:50 -07:00
Sheil Kumar	0c956bef0a	[WinML] Fix warnings in OnnxruntimeEngine and OnnxruntimeEngineBuilder (#16679 ) Fix [prefast:Warning]: C6101 (in '_winml::OnnxruntimeEngine::CreateTensorValueFromDefaultAllocator' Fix [prefast:Warning]: C6101 (in '_winml::OnnxruntimeEngineBuilder::CreateEngine' Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2023-07-12 13:09:50 -07:00
cao lei	329e8156d4	clean unused parameter in ORT_UNUSED_PARAMETER (#16538 ) ### Description clean unused parameter in ORT_UNUSED_PARAMETER ### Motivation and Context clean unused parameters in ORT_UNUSED_PARAMETER which are introduced from #15833	2023-07-07 13:20:36 -07:00
Sheil Kumar	f46956056d	Add WinML Experimental API to Register ORT CustomOps Libraries (#16535 ) Add WinML Experimental API to Register ORT CustomOps Libraries --------- Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2023-06-30 22:17:35 -07:00
cao lei	0c5f492493	remove AllocatorMgr class (#16509 ) ### Description Remove AllocatorManager class ### Motivation and Context After the refactor PR #15833 is in, AllocatorManager class is not referenced anymore.	2023-06-28 15:43:19 -07:00
Justin Chu	e2381c42f2	Use M_PI to replace 3.14 constants (#16421 ) ### Description Use M_PI to replace 3.14 constants ### Motivation and Context Fixes #16413	2023-06-20 15:09:10 -07:00
cao lei	dd72192cf4	ExecutionProvider API refactor - move allocator from EP level to SessionState level and indexed by OrtDevice (#15833 ) ### Description This PR is to refactor ExecutionProvider API for memory management, which is to move allocators from EP level to SessionState level and indexed by OrtDevice ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> This PR is to refactor ExecutionProvider API for memory management, which is to move allocators from EP level to SessionState level and indexed by OrtDevice. By this change, EP level will shift the burden of maintaining allocators, which will be user friendly for EP developers --------- Co-authored-by: Lei Cao <leca@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-06-19 17:44:45 -07:00
Sheil Kumar	2b7f26af7c	Add GridSample implementation to DirectML (#15788 ) Add GridSample implementation to DirectML EP. Temporary add HLSL shader in the DirectML EP to handle GridSample until officially added to DirectML.	2023-05-05 15:59:33 -07:00
Sheil Kumar	5bde1e8e37	Add Bluestein Z-Chirp Algorithm to DirectML DFT implementation (#15686 ) Add Bluestein Z-Chirp Algorithm to DirectML DFT implementation This will enable STFT and DFT on signals which have non-powers of 2.	2023-04-27 14:03:40 -07:00
Numfor Tiapo	f44f6c5b2e	Fix Prefast Errors (#15651 ) This PR adds fixes for prefast errors with the following codes: - C26814 - C26451 - C26400	2023-04-25 16:41:39 -07:00
Patrice Vignola	b49d428299	[DML EP] Add missing newline to image test logging (#15596 )	2023-04-21 13:39:07 -07:00
Tianlei Wu	5a675d9113	Disable random failing DML image batch test (#15624 ) ### Description Disable a test with random failure in Windows GPU CI Pipeline like the following: ``` 11: [ OK ] BatchTest/BatchTest.BatchSupport/163 (0 ms) 11: [ RUN ] BatchTest/BatchTest.BatchSupport/164 11: D:\a\_work\1\s\winml\test\image\imagetests.cpp(186): error: Expected: m_model_binding.Bind(output_data_binding_name, output_video_frames) doesn't throw an exception. 11: Actual: it throws. 11: D:\a\_work\1\s\winml\test\image\imagetests.cpp(211): error: Expected: m_result = m_session.Evaluate(m_model_binding, L"") doesn't throw an exception. 11: Actual: it throws. 11: total errors is 0/2073600, errors rate is 0total errors is 0/2073600, errors rate is 0total errors is 0/2073600, errors rate is 0[ FAILED ] BatchTest/BatchTest.BatchSupport/164, where GetParam() = ((L"fns-candy_Bgr8_Batch3.onnx", 0, { L"1080.jpg", L"fish_720_Gray.png", L"fish_720.png" }, 3, false), 0, 1, 1, 1, 4-byte object <02-00 00-00>) (3203 ms) ``` Since https://github.com/microsoft/onnxruntime/pull/15468 merged to main, about 10~15% build job failed in the test.	2023-04-21 13:29:56 -07:00
Changming Sun	5bed8d0285	Disable XNNPack EP's tests in Windows CI pipeline (#15406 ) ### Description 1. Disable XNNPack EP's tests in Windows CI pipeline The EP code has a known problem(memory alignment), but the problem does not impact the usages that we ship the code to. Now we only use XNNPack EP in mobile apps and web usages. We have already pipelines to cover these usages. We need to prioritize fixing the bugs found in these pipelines, and there no resource to put on this Windows one. We can re-enable the tests once we reached an agreement on how to fix the memory alignment bug. 2. Delete anybuild.yml which was for an already deleted pipeline. 3. Move Windows CPU pipelines to AMD CPU machine pools which are cheaper. 4. Disable some qdq/int8 model tests that will fail if the CPU doesn't have Intel AVX512 8-bit instructions.	2023-04-13 12:19:32 -07:00
Numfor Tiapo	e3086b2ed8	Move DML CI Pipeline to A10 (#15468 ) This change moves the DML CI pipeline to the A10 machines and fixes or disables tests that were failing from this change. - Max error rate threshold was increased for Image Tests - Some failing batch tests were disabled --------- Co-authored-by: Changming Sun <chasun@microsoft.com>	2023-04-12 10:19:40 -07:00
Dmitri Smirnov	ce3b4eabd3	Implement Optional Metadata support and C# test support (#15314 ) ### Description Implement Optional Type metadata support in the library. Implement optional support in C# API along with metadata. Implement Sequence, Map, Optional test data support and test execution. Prune tests and provide more details for failing tests in C# code. Note, this PR does not enable running onnx test models in C++. ### Motivation and Context Opset18 optional type support.	2023-04-11 09:41:59 -07:00
Sheil Kumar	7ccdf9ad8c	User/sheilk/sequence fix (#15239 ) Ensure that Loop operators run on CPU. Fix memcpy for Sequence Tensors, so that empty sequences (like when SequenceEmpty runs on DirectML) can be copied back to CPU.	2023-03-31 12:57:25 -07:00
cao lei	50fa151298	remove device_id parameter out of ExecutionProvider::GetAllocator() (#14580 ) ### Description Remove the parameter device_id out of ExecutionProvider::GetAllocator() function ### Motivation and Context The parameter device_id is not necessary. We can fully rely on the second parameter OrtMemType mem_type to determine the device_id when getting allocator from executionProvider.	2023-02-13 10:01:07 -08:00
RandySheriffH	75584c5fa8	Enabling thread pool to be numa-aware (#13778 ) The PR enables ort thread pool to be numa-aware, so that threads could be evenly created and distributed among numa nodes. In addition, to facilitate performance tuning, the PR opens a new API allowing customers to attach threads to certain logical processors. Please check the API [definition](https://github.com/microsoft/onnxruntime/pull/13778/files#diff-5845a5c76fb64abdc8f0cffe21b37f8da1712674eb3abc4cd87190891be1bd48) for details. Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2022-12-12 10:33:55 -08:00
Abhishek Udupa	83c59d2594	Session-aware and thread-safe CUDA profiler (#13706 ) ### Description The existing CUDA profiler is neither session-aware, nor thread-safe. This PR ensures both. ### Motivation and Context [PR 13549](https://github.com/microsoft/onnxruntime/pull/13549) brought thread-safety and session-awareness to the ROCm profiler. This PR brings the same goodness to the CUDA profiler as well. Sample outputs of a profiling run from the StableDiffusion model (this model was chosen because it requires orchestration of multiple sessions, and verifies that the profilers are now indeed session-aware) on both CUDA and ROCm EPs are attached, along with a script that checks that the trace files generated by the profile are well-formed. Update 11/29: Updated the profile outputs. The older profile outputs exhibited an issue where some timestamps were wildly out of range, leading to problems visualizing the traces. The bug has been fixed and the profile outputs have been updated, along with an update to the check script to ensure that timestamps are monotonically increasing. [sd_profile_outputs_cuda.tar.gz](https://github.com/microsoft/onnxruntime/files/10118088/sd_profile_outputs_cuda.tar.gz) [sd_profile_outputs_rocm.tar.gz](https://github.com/microsoft/onnxruntime/files/10118089/sd_profile_outputs_rocm.tar.gz) [check_profile_output_well_formedness.zip](https://github.com/microsoft/onnxruntime/files/10118090/check_profile_output_well_formedness.zip) Co-authored-by: Abhishek Udupa <abhishek.udupa@microsoft.com>	2022-12-09 13:22:12 -08:00
Sumit Agarwal	5b16593192	[DML EP] Attention Kernel bug fix (#13879 ) ### Description - Use same data type as input for mask_index tensor which is used as DML GEMM API's C parameter. - Remove gsl header include as it is already gets included transitively. ### Motivation and Context - Why is this change required? What problem does it solve? Bug found in internal conformance testing. - If it fixes an open issue, please link to the issue here. N/A	2022-12-07 15:24:27 -08:00
Yi Zhang	ae2a9373ab	reenable quant model tests (#13871 ) ### Description ### Motivation and Context Test data in the image has been fixed.	2022-12-07 23:33:22 +08:00
Numfor Tiapo	e0dcbc3832	Fix C26436 prefast errors (#13774 ) Fixes errors 9196, 9214, 9255, and 9314. Co-authored-by: Numfor Mbiziwo-Tiapo <numform@microsoft.com>	2022-12-01 09:07:44 -08:00
Numfor Tiapo	aa1390e963	Fix Prefast Errors (#13675 ) Fixes all C28204, C6031, and C26814 prefast errors. Co-authored-by: Numfor Mbiziwo-Tiapo <numform@microsoft.com>	2022-11-28 09:16:22 -08:00
Yi Zhang	a9a9c34d98	Fix WinML Test Case: create LearningModelBinding for every testcase (#13587 ) ### Description Fix #13509 ### Motivation and Context The exception was caused by the incorrect fetches, which was from the binding with last test cases. `efcbdac58e/onnxruntime/core/session/onnxruntime_c_api.cc (L809-L815)`	2022-11-09 11:20:48 +08:00
Numfor Tiapo	49e5a11ccd	Fix SDL and Prefast Errors (#13465 ) Fixes Errors 1978844, 1978870, 1978850, 1978855, and 9245 Co-authored-by: Numfor Mbiziwo-Tiapo <numform@microsoft.com>	2022-10-28 09:41:18 -07:00
Yi Zhang	e160688a9b	Skip some failed models winml and training workflows on Windows CPU (#13407 ) ### Description 1. update model name structure in model_tests.cpp with source name. To avoid `Condition test_param_names.count(param_name) == 0 failed. Duplicate parameterized test name 'BERT_Squad_opset10_CPU'` 2. skip some failed models https://github.com/onnx/models/issues/568 ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->	2022-10-25 10:05:04 +08:00
Numfor Tiapo	56387c3c31	Fix SDL Unmatched Annotation Errors (#13162 ) Fixes 3 SDL unmatched annotation errors. Co-authored-by: Numfor Mbiziwo-Tiapo <numform@microsoft.com>	2022-09-30 15:36:30 -07:00
Brian Martin	c20abcab87	User/brianma/eo (#13152 ) fixing SDL issues. One was a SAL mismatch, the other was handling an optional null pointer.	2022-09-30 09:43:56 -07:00
Edward Chen	454f77cd94	Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791 ) # Motivation Currently, ORT minimal builds use kernel def hashes to map from nodes to kernels to execute when loading the model. As the kernel def hashes must be known ahead of time, this works for statically registered kernels. This works well for the CPU EP. For this approach to work, the kernel def hashes must also be known at ORT format model conversion time, which means the EP with statically registered kernels must also be enabled then. This is not an issue for the always-available CPU EP. However, we do not want to require that any EP which statically registers kernels is always available too. Consequently, we explore another approach to match nodes to kernels that does not rely on kernel def hashes. An added benefit of this is the possibility of moving away from kernel def hashes completely, which would eliminate the maintenance burden of keeping the hashes stable. # Approach In a full build, ORT uses some information from the ONNX op schema to match a node to a kernel. We want to avoid including the ONNX op schema in a minimal build to reduce binary size. Essentially, we take the necessary information from the ONNX op schema and make it available in a minimal build. We decouple the ONNX op schema from the kernel matching logic. The kernel matching logic instead relies on per-op information which can either be obtained from the ONNX op schema or another source. This per-op information must be available in a minimal build when there are no ONNX op schemas. We put it in the ORT format model. Existing uses of kernel def hashes to look up kernels are replaced with the updated kernel matching logic. We no longer store kernel def hashes in the ORT format model’s session state and runtime optimization representations. We no longer keep the logic to generate and ensure stability of kernel def hashes.	2022-09-20 14:24:59 -07:00
Sumit Agarwal	f78ed1388a	Fixed build break: inbox version of WindowsAI repo	2022-09-09 18:25:01 -07:00
Sumit Agarwal	bcdddb47ba	Merge remote-tracking branch 'origin/main' into WindowsAI	2022-09-09 17:34:48 -07:00
sumitsays	05c65a54b3	[DML EP] Contrib Op: FusedMatMul (#12898 ) * Contrib Op: FusedMatMul for DML EP * Added relevant comments and extra validation * Polish * More polish * Last polish * Addressed comment on the PR * Addressed comment on the R * Removed un-necessary comments * Used c++ standard function * used std::c++ algorithms function * Removed unsed code Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com> Co-authored-by: Dwayne Robinson <fdwr@hotmail.com>	2022-09-09 09:37:38 -07:00
Sheil Kumar	535b0835f2	User/sheilk/dft fixes (#12862 ) * DirectML DFT Tests and Fixes * Dynamicaly allocate temporaries using the allocator... * Allocate during compute * wrong dims * CR feedback	2022-09-07 13:21:56 -07:00
Yulong Wang	1a402a3f25	replace 'master' branch ref to 'main' for onnx repo (#12678 )	2022-08-30 13:41:42 -07:00
Chun-Wei Chen	6246662b1d	[Dup] Fix SAME_UPPER/SAME_LOWER (auto_pad attribute) in ConvTranspose (#12537 ) * Fix SAME_UPPER/SAME_LOWER (auto_pad attribute) in ConvTranspose * Bump ONNX 1.10.2 globally * load ONNX_VERSION from VERSION_NUMBER * / * revert deprecate warning in ORT 1.12 * add a comment about why removing cntk_simple_seg * correct the implem in DML as well	2022-08-22 15:35:34 -07:00
Edward Chen	3efd9a73bb	Refactor InferenceSession Load member functions. (#12430 ) Fix comparison of path characters when checking for ".ort" suffix. Some clean up of InferenceSession Load functions. - Reduce duplication between std::string/std::wstring versions. - Renaming for clarity.	2022-08-03 16:28:26 -07:00
Sheil Kumar	7d712c8f8b	Fix WinML Tests are still targetting deprecated (deleted) experimental signal op definitions (#12006 ) * fix winml tests * remove legacy test * switch idft -> dft+inverse attr * upgrade opset 13->17 for signal ops tests	2022-06-27 16:35:50 -07:00
Dmitri Smirnov	267a424e52	Retry Rework execution frame to reduce memory allocations (#11897 ) * Revert "Revert "Refactor ExecutionFrame and SessionState to reduce memory all… (#11888)" This reverts commit `d2cbae3a04`. * Revert prepacked_weights to avoid indirect inclusion in CUDA and TRT code that breaks the build.	2022-06-20 10:29:43 -07:00
Yi Zhang	d2cbae3a04	Revert "Refactor ExecutionFrame and SessionState to reduce memory all… (#11888 ) Revert "Refactor ExecutionFrame and SessionState to reduce memory allocations and improve data locality (#11804)" This reverts commit `2ecba6fd25`.	2022-06-17 17:07:21 +08:00
Dmitri Smirnov	2ecba6fd25	Refactor ExecutionFrame and SessionState to reduce memory allocations and improve data locality (#11804 ) Refactor ExecutionFrame and SessionState for better data locality and less memory allocations.	2022-06-16 16:50:48 -07:00
Gary Miguel	e8b0d24071	Support per-test tolerances for ONNX tests (#11775 ) Prior to this every test shared the same tolerances. This meant that if an ONNX test failed due to a small but acceptable difference in output, the only alternative was to disable the test entirely. In op set 17, the DFT operator is being added. Without this change, the tests for that operator fail because the output is off by about 5e-5. It's better to keep test coverage for this new op rather than disable the test entirely. Also prior to this change, the global tolerances were not shared between C++, JavaScript, and Python tests. Now they are. Also fix various minor issues raised by linters. Unblocks https://github.com/microsoft/onnxruntime/issues/11640.	2022-06-14 15:12:23 -07:00
Sheil Kumar	22739137c4	Update signal op defs to match onnx17 defs, and add more tests (#11631 )	2022-05-28 16:00:09 -07:00
Sheil Kumar	6255194659	All LearningModelSessions created from a common LearningModelDevice should share the same thread pool (#11457 ) * Share thread pools between devices * make tests reuse device * Change cpu thread pool options for dml sessions to use 1 thread with no spinning * fix test failure * Update missing type constraints for dft * Add comment and rename inference session parameter * default missing causing inconsistent test behavior Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-05-13 11:12:43 -07:00
Sheil Kumar	85fa168dc1	Add optional dft_length input to the DFT and IDFT operators. (#11427 ) * Add optional dft_length input. * CR Feedback Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-05-03 16:17:43 -07:00
Sheil Kumar	027565b3b2	Add multi-dim dft test, and fix complex idft (#10947 ) * fix complex multi-dim dft * Add multi-dim dft test, and fix complex idft * remove incorrect inplace specification * Add DFT tests * update epsilon to 1000ths place Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-03-22 10:08:12 -07:00
Sheil Kumar	810c18e809	fix complex multi-dim dft (#10896 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-03-17 12:45:51 -07:00
Sheil Kumar	860f28254e	Update DFT definition to more closely align with PyTorch by enabling axis attribute, and arbitrary tensor rank. (#10842 ) * Add axis attribute * fix breaks * Enable axis-specified DFT * remove static cast Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-03-15 15:27:12 -07:00
Jingqiao Fu	f4fd67cc2c	Revert "add load from buffer (#10162 )" (#10590 ) This reverts commit `5cd57bb726`.	2022-03-08 13:35:23 -08:00
Numfor Tiapo	9ad95bf068	Skip SetName test on inbox build (#10699 )	2022-03-02 10:28:58 -08:00
Numfor Tiapo	5fbfca3d58	Add Experimental API for setting model name (#10518 ) * Add experimental API for editing model name * Change EditModelName to 'SetName' * Change API to pass c_string * Update SetName to edit the proto * Test that the model proto gets changed * Remove comments * Skip inbox tests * Use filehelper path Co-authored-by: Numfor Mbiziwo-Tiapo <numform@microsoft.com>	2022-02-25 14:23:49 -08:00

1 2 3 4 5

248 commits