onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-18 21:21:17 +00:00

Author	SHA1	Message	Date
Ashwini Khade	ceb76429db	Merge pull request #12056 from microsoft/bmeswani/merge-training_dev/on_device_poc Merge On-Device-Training Offline Tooling and C/C++ APIs	2022-07-21 15:09:48 -07:00
Xinya Zhang	03dfcb0e87	[ROCm] Enable int8 for MatMulInteger Op (#11776 )	2022-07-21 11:20:48 -07:00
Rachel Guo	496618594f	Update supported ops md for NNAPI/CoreML EP (#12245 ) * update supported ops md * address pr comments * address pr comments * wording	2022-07-21 10:23:08 -07:00
Yi Zhang	007ef42749	Fix: Test coverage is undercounting and profiling errors (#12260 ) add data relocation for onnx_test_runner	2022-07-21 16:19:24 +08:00
Jian Chen	43e1e89453	Update aarch64 building pool to aiinfra-linux-ARM64-CPU-2019 (#12243 ) * Setting new pool for arm64 * Setting defualt pool name * adding DockerInstaller stage * try to install docker from apt-get * change to specific * adding chmod to docker.sock * install dotnet sdk * specic dotnet 3.1.x * add manuall step to install dotnet * typo bass * remove inputs * change dotnet installation dir * skipComponentGovernanceDetection on arm64 linux * variables typo * variables: - name: skipComponentGovernanceDetection value: true * update variables * skipComponentGovernanceDetection set to true * moving varliables * moving the variables again * setting condition on cgd * indentation * indentation again * conditional variable * if * remove cgd * conditionl on cgd * condition * parameters * clean up	2022-07-20 12:08:02 -04:00
mindest	add631410a	[ROCm] Re-enable ReduceL1, L2 and related tests (#12209 ) Re-enable ReduceL1,L2 and related tests	2022-07-20 13:13:02 +08:00
Changming Sun	2cb642927b	Simplify get_docker_image.py (#12166 ) Simplify get_docker_image.py by leverage docker itself remote cache functionality.	2022-07-19 09:53:01 -07:00
Alexey Gladyshev	66978c7ef5	[TVM EP][CI] Added TVMso EP testing into CI (#12188 ) * refactor test for model with undefined shapes * add test for TVMso EP * update build script for TVM EP tests * fix pylint * disable test for Windows * fix black * fix python format * fix pylint * fix python format * replace Path.resolve with os.path.join * fix python path issue Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>	2022-07-19 16:05:28 +02:00
Sean Murray	93229949d4	Fix bug where onnxruntime_USE_NCCL flag would default to ON (#12195 ) Fix bug where onnxruntime_USE_NCCL flag would default to ON, causing ORT to not build properly. New functionality: flag is ON when training is enabled and NCCL is not disabled. Flag is OFF otherwise	2022-07-18 12:13:08 -07:00
leqiao-1	09af4a7fdd	remove wrong placed libs (#12201 )	2022-07-18 09:22:22 -07:00
PeixuanZuo	7b53b223b8	[UPDATE] update AMD CI pipeline to Rocm5.2 with torch1.11 (#12162 ) * [UPDATE] update ci to rocm5.2 + torch1.11 * [Revert] disable ort module test * [DELETE] delete Rocm5.1.1 ci test result * [UPDATE] update the comments	2022-07-14 16:38:16 +08:00
Valery Chernov	3b0aaa9e0e	[TVM EP] support build on Windows (#11851 ) * add description of build ORT+TVM EP on Windows * fix cmake error related to symlink creation on Windows * add llvm config path to build flags for correct build on Windows * update TVM_EP.md for llvm_config build arg * fix warnings skipping during build on Windows * fix using string or wstring for model path to correct build on Windows (MSVC error) * fix error in custom logger for correct build on Windows * implement glob algorithm for Windows * additional build fixes * update TVM with export of VM symbols for dll * description of nasm issue and workaround * update TVM with export of Executable from VM symbols for dll * description of installation of ipp-crypto dependencies on Windows * cmake key for ipp-crypto build * fix wstring for TVMso EP * fix ipp-crypto build * cmake key onnxruntime_TVM_USE_HASH switch off not specific methods, but full hash functionality * fix absolute path to compiled lib * update TVM_EP.md, fix lint warnings * update TVM_EP.md * small fixes after review * switch on handshake functionality for Linux workflow Co-authored-by: Valery Chernov <valery.chernov@deelvin.com> Co-authored-by: KJlaccHoeUM9l <wotpricol@mail.ru>	2022-07-13 10:48:42 +02:00
Edward Chen	6e051016c1	Add Python package to perf test pipeline. (#12135 )	2022-07-12 10:50:24 -07:00
LironKesem	9647a3be40	Add tests for all unary aten ops supported in eager mode (#12087 ) * Add tests for all uniary aten ops supported in eager mode * fixing the PR draft * fixing the merge * changing eval to be at compile time * adding requirements for eager * 1.adding function to {ops}_out 2.cleaning the code and adding comments * editing the code according to code review Co-authored-by: root <root@AHA-LIRONKESE-1>	2022-07-12 08:53:19 -04:00
Carson Swope	c675c4750a	include coreml_provider_factory.h in macos build instead of coreml_ex… (#12138 ) include coreml_provider_factory.h in macos build instead of coreml_execution_provider.h	2022-07-11 18:27:01 -07:00
PeixuanZuo	1c39d22f4e	[ADD] Rocm5.2 for Rocm python packaging pipeline (#12129 ) [ADD] rocm5.2	2022-07-11 11:10:45 +08:00
PeixuanZuo	b50239251d	[FIX] Add required variable for Rocm packaging ci pileine (#12118 ) [fix] packaging ci compiler error [FIX] pipeline variable [Frevert] fix compiler	2022-07-07 11:36:26 -07:00
zhangyaobit	a9b9c7f69f	Add autotuning support to FastGelu (#12093 ) * Add autotuning for FastGelu (Draft). * Clean up. * delete unused header file * Fix lint errors. * Add missing template parameter. * Improvements. * Fix type. * Fix namespace issue.	2022-07-06 23:17:48 -07:00
Hubert Lu	dbcf54aa41	Add hipified SkipLayerNorm code for ROCmEP (#12107 ) * First attempt for half2 vectorized memory access in SkipLayerNorm * Add some functions for debugging * Clean up the code * Clean up the code * Generalize the vectorized kernels with aligned_vector and remove cudaDeviceProp * Add a unit test for a larger input size * Fix some Lint C++ warnings * Use ILP = 4 for the vectorized kernels * Rewrite the vectorized kernel and templatize ComputeSkipLayerNorm * Use conditional operator for input_v * Refactor LaunchSkipLayerNormKernel and replace the original SkipLayerNormKernelSmall with the vectorized kernel * Clean some comments and rename the layernorm function * Use ComputeSkipLayerNorm to replace LaunchSkipLayerNormKernel * Resolve a Lint C++ warning * Fix SkipLayerNormBatch1_Float16_vec output data * Add hipified code of bert SkipLayerNorm for ROCmEP * Resolve some Lint C++ warnings * Resolve some Lint C++ warnings * Resolve some Lint C++ warnings * Resolve Python formatting issue	2022-07-06 22:13:11 -07:00
ytaous	446f899fed	[ROCm] Temp disable AMD UT (#12105 ) temp disable UT Co-authored-by: Ethan Tao <ettao@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-07-06 11:08:26 -07:00
Edward Chen	bd76e21fb3	Add pipeline for building perf test binaries. (#12067 ) Add initial pipeline for building perf test binaries. It only builds Android binaries now but can be expanded later.	2022-07-06 09:42:49 -07:00
ytaous	7b8f45dd60	[ROCm] Enable build option for autograd (#11945 ) * add autograd build option * disable UTs * disable UTs * UT-step1 * UT-step1 * UT-step2 * UT-step2 * UT-step2 * UT-step2 * UT-step2 * UT-step2 * Fix UTs * increase shm * code clean up Co-authored-by: Ethan Tao <ettao@microsoft.com@orttrainingdev7.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-07-05 18:11:29 -07:00
Dwayne Robinson	32a8751dc4	DML EP Update to DML 1.9 (#12090 ) * Update to DML 1.9 * Appease obnoxious Python formatting tool	2022-07-05 16:30:54 -07:00
Scott McKay	bfe1eca10c	Add targets files for new .net6 frameworks (#12016 ) * Add net6 targets. Remove maccatalyst as we don't have a native build targetting that. * Set platform in macos targets * Add targetFramework entries * Move NativeLib.DllName definition and set using preprocessor values for simplicity. Couldn't get it to build with the preprocessor based setup when it was in a separate file. Update the nuspec generation to set platform version for .net6 targets. TODO: Validate versions. I copied them from the managed nuget package the packaging pipeline generated prior to adding targets. Possibly w could/should lower some of the versions. Hopefully the need to specify a version goes away when the release version of VS2022 supports .net6. * Try android 31.1 as https://github.com/actions/virtual-environments/blob/main/images/win/Windows2022-Readme.md suggests that should be available on the CI machines * Fix patch version mismatch Add some extra debug info in case it helps * Debug nuget location in CI * Add workspace entry back in * Add steps * One more attempt with hardcoded nuget.exe path and original android31.0 version * Better fix - found explicit nuget download and updated version there. * flake8 fixes * Fix black complaints. * Exit Microsoft_ML_OnnxRuntime_CheckPrerequisites for net6 iOS. * Removed outdated comment	2022-07-01 09:13:55 -07:00
Baiju Meswani	a457ddc41d	Merge branch 'master' of https://github.com/microsoft/onnxruntime into bmeswani/merge_pr	2022-06-30 21:53:07 +00:00
Wil Brady	fdf12a5c35	Fix windows eager build break by pinning to torch version 1.11.0 (#12033 ) Fix windows and linux eager build to torch 1.11.0.	2022-06-30 07:01:13 -04:00
Yulong Wang	bd973bcf1e	[js/rn] upgrade dependencies for e2e test (#11863 ) * [js/rn] upgrade dependencies for e2e test * use JDK11 only for gradle * expand variable	2022-06-27 14:56:49 -07:00
Scott McKay	f72288b453	Fix a couple of typos (#11943 ) Fix couple of typos	2022-06-27 10:32:14 +10:00
Baiju Meswani	d25cf4df26	Merge branch 'master' into training_dev/on_device_poc	2022-06-24 20:18:19 +00:00
Hubert Lu	f4ba199bad	Optimize FastGelu with float2 and float4 vectorized kernels on ROCm (#11491 ) * Using vectorized loads (float2) for fp16 to improve performance * Fix a few warnings from cpplint * Fix a few warnings from cpplint * Use __float2half2_rn and fix some cpplint warnings * Move some computaions to LaunchFastGeluKernel * Fix some Lint C++ warning * Using vectorized loads (float4) for fp16 to improve performance * Switch whether to optimize FastGelu with float4 vectorization * Switch to float4 memory access based on input_length in FastGelu * Comment how to set the threshold of float2 and float4 vectorized kernels * Add FastGelu fp16 unit tests for bias_length = 2 and 8 * Make vectorized kernels generic with aligned_vector * Unify the vectorized kernels with/without bias * Refactor the code to suppress cpplint warnings * Solve formatting issues * Remove cudaDeviceProp from FastGeluKernel and LaunchFastGeluKernel * Move fast_gelu_impl.h to rocm/bert * Fix some Lint C++ warnings and code alignment	2022-06-24 12:46:17 -07:00
pengwa	c398ad513f	Fix orttraining-linux-ci-pipeline - Symbolic shape infer (#11965 ) fix symbolic shape error due to upgraded numpy + legacy sympy	2022-06-23 08:23:36 -07:00
Baiju Meswani	fac8dae9df	Add support for gradient clipping, AdamWOptimizer and tensorseq as inputs (#11697 )	2022-06-22 10:27:58 -07:00
Gary Miguel	4bf22e2a40	Update ONNX to 1.12 (#11924 ) Follow-ups that need to happen after this and before the next ORT release: * Support SequenceMap with https://github.com/microsoft/onnxruntime/pull/11731 * Support signal ops with https://github.com/microsoft/onnxruntime/pull/11778 Follow-ups that need to happen after this but don't necessarily need to happen before the release: * Implement LayerNormalization kernel for opset version 17: https://github.com/microsoft/onnxruntime/issues/11916 Fixes #11640	2022-06-21 17:19:52 -07:00
Dwayne Robinson	64f95d400a	Update DML 1.9 Nuget package to fix WindowsAI nuget pipeline build issue (#11934 )	2022-06-21 15:55:51 -07:00
Scott McKay	3b1224dc08	Add .net6 support to the C# nuget package. (#11908 ) * Add .net6 support to the C# nuget package. Currently requires jumping through a lot of hoops due to .net 6 only being supported in the preview release of VS 2022. Build existing targets using msbuild. Add .net6 targets and build using dotnet. Create nuget package with combined targets. A few misc automated changes from VS to spacing and adding a couple of properties.	2022-06-22 08:08:24 +10:00
Adrian Lizarraga	b20daeda81	Update Linux Multi GPU TensorRT pipeline to TensorRT 8.4 (#11923 ) * Try manually installing trt8.4 in multi-gpu pipeline * Remove stmts that clean up cmake, ctest. Update tensorrt repository name passed to get_docker_image.py * Update trt and cudnn home * Don't install trtexec cli tool. * Increase job timeout * Revert timeout change and use trt placeholder builder build option	2022-06-21 07:59:11 -07:00
Yi Zhang	7f1e9e8c67	Bash: there should be a whitespace after not operator. (#11910 ) add whitespace after not	2022-06-21 05:14:32 +08:00
sfatimar	f97bd38c4f	UEP 4.1 release (#11834 ) * Add pypi build changes to latest Master * Add ORT training part of OV build * Disabling SqueezeOpTest.BadAxes * Add ONNXruntime branch ARG to Docker build * Changes to include file details versions * Commit File Version Updates * Change naming for linux build * Add fix for pylint format errors * Fix pylint warnings. * Fix pylint errors - stage 2 Signed-off-by: Preetha Veeramalai <preetha.veeramalai@intel.com> * Fix pylint errors - stage 3 * Fix pylint format - stage4 Signed-off-by: Preetha Veeramalai <preetha.veeramalai@intel.com> * Commit for Wheel Release >0.35.1 Co-authored-by: Preetha Veeramalai <preetha.veeramalai@intel.com> Co-authored-by: mayavijx <mayax.vijayan@intel.com> Co-authored-by: Sahar Fatima <sfatima.3001@gmail.com> Co-authored-by: nmaajidk <n.maajid.khan@intel.com>	2022-06-17 14:49:04 -07:00
Yi Zhang	f70201c801	Make sure the command works in both centos and ubuntu. (#11894 ) make one bash condition compatible with POSIX	2022-06-17 12:19:22 -07:00
Adrian Lizarraga	ad4abbd75e	[EP-Perf-Dashboard] Add support for TensorRT 8.4 to EP Perf Dashboard (#11876 ) Co-authored-by: George Wu <jywu@microsoft.com>	2022-06-17 09:16:51 -07:00
Yi Zhang	8bb0062873	add manylinux_2_27 CPU wheel (#11886 ) * add manylinux_2_27 * minor refactory * change base image * minor refactor * add tests * fix condition	2022-06-17 19:38:38 +08:00
Changming Sun	10478a09ca	Revert "add manylinux_2_27 wheel (#11832 )" This reverts commit `bbace23d0c`.	2022-06-16 18:28:12 -07:00
Dwayne Robinson	3d99f16e98	Merge pull request #11827 from microsoft/user/dwayner/DmlEp1.9 Integrate WindowsAI feature branch with DML EP features and DML 1.9	2022-06-16 13:04:00 -07:00
George Wu	df5ee6aa4e	[TensorRT EP] support TensorRT 8.4 (#11866 ) * update trt 8.4ga * trt 8.4 linux ci pipeline * fix cmake * placeholder_builder * trt 8.4 windows pipeline * gpu package pipeline * trt 8.4.1.5 , packaging pipeline updates * python packaging * ctest timeout * python packaging test * bump timeout * python format * format * revert * newline * enable trt python tests * typo * python format * disable on windows	2022-06-16 07:46:40 -07:00
Dwayne Robinson	babd6e3fcd	Update DirectML preview package with unmangled names	2022-06-15 18:16:58 -07:00
Scott McKay	d64f23fec0	EP factory creation cleanup and enhancements. (#11798 ) * Rework the EP factory creation setup so we're not cut-and-pasting function declarations in multiple places. Convert append EP for SNPE to be generic, and also use for XNNPACK. Add XNNPACK to C# API * Don't need stub for MIGraphX as it's using provider bridge. * Remove old 'create' functions that aren't applicable now that the EPs are built as separate libraries. * Only use EPs that require the layout transform if the opset is supported by the layout transformer. * Update wasm registration of xnnpack.	2022-06-16 07:01:41 +10:00
Yi Zhang	bbace23d0c	add manylinux_2_27 wheel (#11832 ) * add manylinux_2_27	2022-06-15 10:26:51 +08:00
Changming Sun	51ed27cf22	Delete win-gpu-cuda-10-2-pipeline.yml (#11847 )	2022-06-14 18:34:56 -07:00
Adrian Lizarraga	aef53e2b0d	Support uploading EP perf data to a configurable database. (#11819 )	2022-06-13 14:06:50 -07:00
Changming Sun	a93ebd2503	Move tvm pipeline to Github Actions (#11721 )	2022-06-13 11:38:44 -07:00

1 2 3 4 5 ...

1538 commits