onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-08 17:17:15 +00:00

Author	SHA1	Message	Date
Martin Hořeňovský	cd552e1bda	Add build.py option for disabling memleak checker The memleak checker used by default in Debug configuration does not play nice with embedding static lib of ONNXRuntime into binaries, because other code will not be using the same debug heap, leading to trouble. This makes it easier for outside builders to disable it for their build.	2021-12-09 01:39:09 +00:00
Nick Kreeger	051d005926	Fix build warnings with VS 2022 (#9967 )	2021-12-08 16:43:46 -06:00
Yufeng Li	0c72f1cd5a	add copyright (#9943 ) (#9970 )	2021-12-08 14:34:53 -08:00
Sunghoon	35cf8b8725	[js/react_native] npm audit fix (#9876 ) * add p50 in test * Audit fix for react native Co-authored-by: Yulong Wang <yulongw@microsoft.com>	2021-12-08 13:27:31 -08:00
Tang, Cheng	0adeb86bfd	Fix ortmodule for the pytorch model with ort device (#9927 ) * add ortmodule and eager mode test * add ortmodule dependency * convert between aten ort tensor and ortvalue * register the EP to ortmodule using ort device information * remove duplicated test * remove useless dependency * handle half precision type for ortmodule outputs * adjust the tensor conversion python code Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2021-12-08 13:04:29 -08:00
Patrik Vavercak	fb30e9fdae	Remove /safeseh link option from non-msvc builds (#9744 ) (#9935 )	2021-12-08 11:44:00 -08:00
Ye Wang	9e7d52a801	Update parity_check_helper.py (#9884 )	2021-12-08 10:51:46 -08:00
Yi-Hong Lyu	f60a287a64	Add __x86.get_pc_thunk.bx to avoid dependency (#9955 )	2021-12-08 04:50:41 -08:00
Dmitri Smirnov	a7f649db7c	Enable proper override using MIMalloc (#9944 ) Redirect memory allocations to MiMalloc and advance its version to v2.0.3 Refactor for a universal ifdef	2021-12-07 17:56:58 -08:00
Guoyu Wang	b34b991aea	Improve reduced ops and types build (#9908 ) * Improve reduceops and types build * minor update * fix test error * fix minimal build break * minor update and add comments * Address CR comments	2021-12-07 13:02:05 -08:00
Weixing Zhang	840212e115	Enable OneHot kernel for ROCm EP and add Dockerfile for ROCm 4.3.1 (#9656 ) * enable OneHot for ROCm EP * add dockerfile for ROCm 4.3.1 Co-authored-by: Weixing Zhang <wezhan@microsoft.com>	2021-12-07 12:47:00 -08:00
Justin Stoecker	63c8889944	Restore arm64x onnxruntime binaries (#9950 )	2021-12-07 12:39:46 -08:00
sfatimar	5e4d58a50a	Openvino ep nuget (#9909 ) * Changes Fixed merge conflicts Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> C# Nuget fix for windows -> OpenVINO Libs included in Nuget package -> Updated nuget.exe path for openvino ep build in Windows -> Include mvcmd file along with openvino dlls * Fixing PEP Style comments * Comment Removed Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: saharfraza <sfatima.3001@gmail.com>	2021-12-07 07:48:31 -08:00
Scott McKay	a23cd5b697	Update Xamarin sample code (#9925 ) * Update InferenceSample so the mobile app is more user friendly. Remove fasterrcnn xamarin sample. Adding a more complete xamarin sample targeted at end users to the inference examples repo. See https://github.com/microsoft/onnxruntime-inference-examples/pull/58	2021-12-07 16:18:58 +10:00
MengX1991	58728f95a2	Fix conv quant reduce range option (#9922 ) Co-authored-by: menj <menj@microsoft.com>	2021-12-06 16:43:54 -08:00
Yufeng Li	5871ca1cd1	Comment out unused parameter (#9914 )	2021-12-06 15:39:58 -08:00
Weixing Zhang	67a30ef716	Address some code scan issues (#9873 ) Potential comparison of a constant with another constant. at D:\a\_work\1\s\orttraining\orttraining\training_ops\cuda\reduction\\reduction_all.cu@97,42 Co-authored-by: Weixing Zhang <wezhan@microsoft.com>	2021-12-06 13:50:52 -08:00
Yufeng Li	e613019174	add s8s8 support for quantized conv and gemm (#9902 ) * add s8s8 support for quantized conv and gemm	2021-12-03 14:55:18 -08:00
Ye Wang	d8c71304c1	Update Dev_Guide.md (#9921 )	2021-12-03 10:30:16 -08:00
Chi Lo	02aa16e3ea	QDQ tool modification part3 (#9904 ) * refine per channel quantization for qdq * remove old option * add comment * add import itertools	2021-12-03 10:09:34 -08:00
Dwayne Robinson	4ff78aae45	Merge pull request #9917 from microsoft/user/dwayner/FnsCandyTolerance30696168 Update WinML model tests for FNS candy and Inception float16	2021-12-02 22:45:45 -08:00
Sheil Kumar	5edaa75ef6	Fix LoadFromStream to not use wss::Buffer internally (#9918 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-12-02 21:29:06 -08:00
Tang, Cheng	8db49e3d0f	add ortmodule and eager mode test (#9888 ) * add ortmodule and eager mode test * add ortmodule dependency * fix eager pipeline * skip tthe ortmodule test for windows due to win ci issue * remove useless win ci change * add torch Co-authored-by: Abhishek Jindal <abjindal@microsoft.com>	2021-12-02 19:49:18 -08:00
Dwayne Robinson	6e4c534ce2	Relax tolerance slightly more for Intel after autopilot run	2021-12-02 19:42:31 -08:00
George Nash	1c38ceda49	Add fusion support for Dnnl execution provider (#9897 ) * Op fusion support added In addition the following op fusions are detected - ConvRelu - MatMulAdd This change includes - Change abstraction of Subgraph + node + tensor to support delete insert modify - add nodearg class to establish connection from tensor to node - add graphtransformer class to support fusion - add topological sort to ensure propoer node ordering after fusion - add convrelu + matmuladd primitive to support execution of fused nodes - Fix FusionResolution with missing tensors when fusing, if the target node contains fewer tensors then original patterns (Gelu and FastGelu ignores many initializers), potentially delete them also from inputs and initializers Also check tensor has no producer and consumer before deleting Signed-off-by: Wang <zhaoyang.wang@intel.com> * Gelu and FastGelu Fusion for DNNL EP The basics of the Gelu/FastGelu code is modeled after: - core/optimizer/fast_gelu_fusion.cc and - core/optimizer/gelu_fusion.cc OneDNN does not have support for 'Erf' unless it is part of 'Gelu'. This results in detecting 'Gelu' fusion twice. Once when detecting if the 'Erf' Operator is supported and again in the subgraph transformer code. The capability code is finding the Gelu using onnxruntime:GraphViewer and onnxruntime::Node. While the transformer code is using DnnlSubgraph and DnnlNode. This results in two parts of code looking for the same pattern but unfortanatly having little code reuse. This also adds support for Biased versions of Gelu and FastGelu if they already exist in a model. Signed-off-by: George Nash <george.nash@intel.com> * Code Clean Up Signed-off-by: Wang <zhaoyang.wang@intel.com> Co-authored-by: Wang <zhaoyang.wang@intel.com>	2021-12-02 18:11:19 -08:00
Abhishek Jindal	06e63218be	changing commit for windows build for eager mode (#9912 )	2021-12-02 14:07:46 -08:00
Dwayne Robinson	77e67a6de7	Add one more example line	2021-12-02 13:34:01 -08:00
Dwayne Robinson	ef7671b938	Comment out old lines	2021-12-02 13:30:34 -08:00
Scott McKay	912e50f61c	Add CI minimal build with all options disabled. Fix python binding code if sparse tensors are disabled. (#9898 ) * Add 2 builds to validate the cmake defines for excluding optional components work in both full and minimal builds. * Create empty config for no-ops build * Create empty config for no-ops build - attempt #2 * Create empty config for no-ops build - attempt #3 * Update python binding code to work when sparse tensors are disabled.	2021-12-03 06:56:51 +10:00
Dwayne Robinson	5ba4079546	Merge branch 'DmlDev' into user/dwayner/FnsCandyTolerance30696168	2021-12-02 01:04:49 -08:00
Dwayne Robinson	7a3abd863f	Update WinML model test tolerances for tiny_yolov2 and FNS_Candy	2021-12-02 00:48:54 -08:00
Scott McKay	3f5c1e1c58	Update to include the Xamarin targets for internal ORT builds so the managed nuget package is consistent as both CPU and GPU builds produce a package called Microsoft.ML.OnnxRuntime.Managed. (#9906 )	2021-12-02 18:43:34 +10:00
Jeff Daily	8d88a6ac7f	add --amdgpu-target=gfx90a (#9820 )	2021-12-01 22:28:52 -08:00
Scott McKay	00c979db4d	Update doc for operators/opsets supported by mobile package (#9899 )	2021-12-02 13:51:22 +10:00
RandySheriffH	078782ea3c	exclude test case from reduced-op build (#9895 ) Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2021-12-01 15:50:36 -08:00
Ryan Lai	3f82b2d5b1	Merged PR 6723735: Need to unsquash the last RI to maintain history Related work items: #37220320	2021-12-01 22:13:25 +00:00
Ryan Lai	73d106e33a	Merge commit '9345894c823de67f913d907da91dbc07ac322041' into HEAD	2021-12-01 14:11:29 -08:00
Sherlock	175acf08f4	ScatterND supports negative indices (#9739 ) * ScatterND supports negative indices	2021-11-30 21:17:32 -08:00
Hariharan Seshadri	c161813217	Misc InstanceNorm CUDA kernel changes (#9879 )	2021-11-30 19:15:45 -08:00
Sherlock	6de79d82c8	Fix Training Packaging pipeline (#9885 ) * Fix Training Packaging pipeline	2021-11-30 15:26:10 -08:00
Ryan Lai	d8a7e1d159	Merged PR 6718335: RI 11/30 from github Pipeline green https://microsoft.visualstudio.com/WindowsAI/_build/results?buildId=42142807&view=results ![image.png](https://microsoft.visualstudio.com/274e76ac-6b29-4f77-a85d-7914c77cabd5/_apis/git/repositories/853d2ddc-663c-4fe8-8036-dbf0d50db2d9/pullRequests/6718335/attachments/image.png) Related work items: #37220320	2021-11-30 21:29:25 +00:00
Abhishek Jindal	740679d329	Abjindal/fix windows ci pipeline (#9883 ) * switching to /wd4800 for eager mode * fixing compile flags ignore warnings, previously it was only using the last one	2021-11-30 10:33:13 -08:00
RandySheriffH	9345894c82	Add build option to enable cuda profiling (#9875 )	2021-11-29 22:44:50 -08:00
RandySheriffH	16bfd3c771	Cancel transpose optimizer for resize (#9870 ) * cancel transpose optimizer for resize * add UT * addressing comments * fix build err	2021-11-29 22:30:16 -08:00
Chi Lo	0baf687f2d	QDQ tool modification part2 (#9720 ) * Add finetuned qdq options * Add description * Add unit tests * Modify for channel axis * Remove too specific feature. Move this implementation to e2e example * Add OpTypesSupportPerChannelQuantization * fix bug for unit test * Keep flags OpTypesSupportPerChannelQuantization and QDQChannelAxis for internal use Will have a follow-up PR to fine tune the code * remove unnecessary warning Co-authored-by: stevenlix <38092805+stevenlix@users.noreply.github.com> Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>	2021-11-29 21:45:41 -08:00
Ryan Lai	57a6f7c205	Various fixes to fix WindowsAI RI build. (#9877 ) * WAI RI fixes * span changes * Spaces * Additional warnings to fix * Fix redundant commment	2021-11-29 21:33:15 -08:00
Yufeng Li	a0afd7303d	add int8_t support for pool operators (#9852 ) * add int8_t support for pool operators	2021-11-29 18:43:43 -08:00
George Wu	ec9b0ed800	[python manylinux package] emit warning if missing CUDA/TensorRT dependency causes ld_preload to fail and user tries to register either CUDA/TensorRT EP (#9872 ) * add warning if ld_preload fails for CUDA or TRT when trying to register either provider * refactor * change wording from register to create	2021-11-29 16:04:23 -08:00
Yulong Wang	27e337ed7a	[js/node] npm audit fix (#9861 )	2021-11-29 13:11:19 -08:00
Ye Wang	bf716e667c	layernorm throw error if input has no data (#9837 )	2021-11-29 12:43:17 -08:00

1 2 3 4 5 ...

6089 commits