onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-16 21:00:14 +00:00

Author	SHA1	Message	Date
Yulong Wang	bf4c3fa3d6	[node.js binding] aggregate binaries for multiple platforms in single NPM package (#9501 )	2021-10-25 20:16:10 -07:00
marcusfreisleben	651955d3c9	CUDA: Enable parallel compilation (#8974 ) * Pass on parallel option to nvcc * Fixed build.py * Added missing string conversion * Adressed review points	2021-10-25 16:42:58 -07:00
Changming Sun	d83adaaf9f	Remove optional-lite (#9424 )	2021-10-22 16:45:45 -07:00
Jeff Daily	ca7116ca3e	CUDA EP's ResizeImpl now uses functors, hipify for ROCm EP (#9466 ) Support for device function pointers is not yet available for ROCm. Instead, the device function pointers were converted to device functors. Case statements, lambdas, and macros are used for dispatch; as a result, all combinations of kernels are compiled with inlined functors. The basis of this approach can be found in PyTorch. Lastly, hipify and register Resize and Upsample for ROCm EP.	2021-10-21 15:02:41 -07:00
Jeff Daily	66ceb6926d	rehipify ROCm EP files under orttraining (#9443 ) * rehipify rocm ep files under orttraining committed to source control * fix flake8 error	2021-10-21 13:36:21 -07:00
Jeff Daily	89a22fb641	Add TopK to ROCm EP (#9391 ) * Add TopK to ROCm EP * flake8 fix	2021-10-20 10:39:44 -07:00
Jeff Daily	f8acc6d0e8	Add NonMaxSuppression and RoiAlign to ROCm EP (#9394 )	2021-10-20 10:38:45 -07:00
Jeff Daily	c33391329a	Add QuantizeLinear and DequantizeLinear to ROCm EP (#9401 )	2021-10-20 10:37:58 -07:00
Changming Sun	406f1629c1	Remove Featurizers code (#9300 )	2021-10-20 10:20:35 -07:00
Jeff Daily	52c53e396d	hipify tensor/gather_nd_impl.cu (#9392 )	2021-10-19 14:15:49 -07:00
Jeff Daily	a2ba923ac7	hipify fast_divmod.h (#9400 )	2021-10-19 12:34:46 -07:00
Jeff Daily	a8e2e8d76a	hipify tensor/transpose.cc and tensor/transpose.h (#9397 )	2021-10-19 12:27:36 -07:00
Yulong Wang	5b65f1cb44	fixes SDL Native Rules warning in Node.js binding CI (#9402 )	2021-10-18 13:05:46 -07:00
Changming Sun	6ecb990fae	Update win-ci-pipeline.yml	2021-10-18 10:43:19 -07:00
Jeff Daily	c8789d3047	[ROCm] static re-hipify of CUDA EP to ROCm EP, now a shared provider (#8877 ) * re-hipify all rocm EP sources * fix all other files affected by re-hipify * add cuda_provider_factory.h to amd_hipify.py * do not use cudnn_conv_algo_search in ROCm EP, missing reduce min registration * Fix ReduceConsts template specialization introduced in #9101. Fixes the error when building for ROCm 4.3.1: error: too many template headers for onnxruntime::rocm::ReduceConsts<__half>::One (should be 0) * fix flake8 error in amd_hipify.py * speed up hipify with concurrent.futures * flake8 fix in amd_hipify.py	2021-10-14 15:15:51 -07:00
Abhishek Jindal	87e726d1a0	Abjindal/merge eager with external custom ops (#8986 ) * switching to pytorch nightly build * adding eager mode * enable pybind and remove install step * removing auditwheel repair process * installing package * adding auditwheel back * disabling auditwheel repair for eager mode * typo correction	2021-10-14 13:19:45 -07:00
Abhishek Jindal	23700a15a0	Abjindal/eager windows build (#9326 ) * removing warnings which are causing errors from torch and changing flags for Windows * adding MKL library resolution and comments * cleaning up the code * fixing onnxruntime_python file for windows build * fix the include order to aovid the python_d.lib issue on win debug build * changes for warnings, typos and other comments * merge conflict * adding fix for mkl library error * Revert "adding fix for mkl library error" This reverts commit `73b87c73c2`. * fix for dll path for windows * typo for dll path Co-authored-by: Cheng Tang <chenta@microsoft.com>	2021-10-14 12:54:49 -07:00
Jeff Daily	3e879aab6b	work around ucx in rocm ci Dockerfile (#9360 )	2021-10-14 09:49:31 -07:00
Moshe David	510b747821	w (#9319 ) Co-authored-by: modav <modav@microsoft.com>	2021-10-12 16:02:40 -07:00
Sunghoon	2f1204a5d5	[js/web] Enable wasm profiling and preserve function names in profiling (#9314 ) * add p50 in test * allow WebAssembly profiling and preserve function names Co-authored-by: Yulong Wang <yulongw@microsoft.com>	2021-10-11 22:04:50 -07:00
Suffian Khan	70cf61fa84	disable bart-l for now (#9305 )	2021-10-07 16:55:54 -07:00
Maajid khan	72c4cea9e6	[OpenVINO-EP] V3.2 Release (#9232 ) * model caching changes for 2021.4 Signed-off-by: Your Name <you@example.com> * changed the ov version check * Minor changes added Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Added support for external data format Starting from OpenVINO 2021.4 version, OpenVINO-EP will support onnx models with Weights saved in external file location. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Introduced Hetero/Multi options for perf_test Enabled to use HETERO/MULTI device feature from OpenVINO-EP using the onnxruntime_perf_test tool. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * cleaned up CMake code for older OV version support OV 2020.3 is now longer supported by OpenVINO-EP. This check is not required now. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Add option to disable graph partitioning Added a option to diable graph partitioning during build time for OpenVINO-EP. with this option, when the model is not fully supported on OpenVINO-EP, the model fully fall backs to default CPU EP (MLAS). Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Changed the flag for diabling graph partitioning Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixes the flake8 check error Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Added changes for disable graph partition option Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed flake8 indentation error Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: Your Name <you@example.com>	2021-10-07 16:02:19 -07:00
Changming Sun	4f4875b0e8	Add "workspace: clean: all" to anybuild build yaml file	2021-10-06 22:49:37 -07:00
Suffian Khan	510b58c877	Increase AMD CI pipeline timeout to 120 min (#9280 ) * increase timeout * add timeout * add timeout * rename	2021-10-06 10:43:09 -07:00
Changming Sun	334980e016	Delete nocontribops pipelines	2021-10-06 10:30:32 -07:00
Sheil Kumar	8f6fd014e4	Force Windows AI NuGet pipeline to use Windows SDK 19041 (#9255 ) * Force Windows AI Nuget pipeline to use 19041 Windows SDK as 22000 casues a downlevel regression by importing LoadLibraryW * move into quotes Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-10-01 21:46:14 -07:00
Yulong Wang	448325b254	[js/web] name ort web for consistency (#9240 )	2021-09-30 22:53:26 -07:00
Yulong Wang	8c57d51928	support WebAssembly SIMD for qgemm (#9191 ) * support WebAssembly SIMD for qgemm * remove '--experimental-wasm-bulk-memory' for test	2021-09-30 12:40:56 -07:00
Changming Sun	e1b84eefcc	Revert "Revert "linux trt package pipeline (#7537 )"" This reverts commit `b606005858`.	2021-09-30 11:39:23 -07:00
Edward Chen	5326397a6a	[iOS] Facilitate usage of pods with custom builds (#9216 ) Refactor iOS framework build/pod package creation into a separate script that can be used with custom builds. Add documentation.	2021-09-30 08:44:00 -07:00
Changming Sun	b606005858	Revert "linux trt package pipeline (#7537 )" This reverts commit `faea7a222d`.	2021-09-28 19:09:04 -07:00
RandySheriffH	058108bef9	Execution Provider Profiler (#8406 ) * implement cuda provider * define profiler common * call start after register * add memcpy event * add cuda correlation * format code * add cupti to test path * switch to CUpti_ActivityKernel3 * reset cupti path * fix test case * fix trt pipeline * add namespace * format code * exclude training from testing * remove mutex	2021-09-28 13:59:52 -07:00
Suffian Khan	6f580f07de	Switch AMD CI pipeline to use environment image from onnxruntimecibuildenvironment (#9206 ) * shift docker image reference for amd ci pipeline * fix service endpoint * reduce perf tolerance	2021-09-28 13:06:16 -07:00
Changming Sun	1104e8d3e5	Linux Anybuild build pipeline (#9091 )	2021-09-28 11:22:27 -07:00
Sheil Kumar	f16bb37fae	make uwp store apps link to statically-linked crt desktop builds (#9182 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-09-27 15:37:15 -07:00
baijumeswani	c30cc9190a	Change the agent pool for orttraining-distributed pipeline (#9179 )	2021-09-26 21:26:44 -07:00
Guoyu Wang	ddafe50199	Fix Android build break after Virtual Environment update to 20210919 (#9163 )	2021-09-23 10:07:18 -07:00
Suffian Khan	47888392ab	Fix nightly CI pipeline to generate ROCm 4.2 wheels and add ROCm 4.3.1 wheels (#9101 ) * make work for both rocm 4.2 and rocm 4.3.1 * fix rocm 4.3.1 docker image reference * fix CUDA_VERSION to ROCM_VERSION * fix ReduceConsts conflict def * add ifdef to miopen_common.h as well * trailing ws	2021-09-19 23:36:03 -07:00
Yulong Wang	0183492931	simplify Web CI pipeline (#9094 )	2021-09-17 13:00:07 -07:00
Guoyu Wang	438175cb34	Build shared host protoc in iOS CI pipelines (#9087 ) * iOS build, share host protoc build * Change android ci * ios packaging ci * checkout submodule * revert * update package pipeline * minor update	2021-09-16 17:16:50 -07:00
Maajid khan	7fc28cd539	[OpenVINO-EP] UEP v3.1 Release with OpenVINO 2021.4.1 (#9081 ) * 2021.4.1 Docker and ci changes * OV version change * Removing Imagescaler op from the op's list Reverting this change which was added in last PR. Imagescaler is now deprecated. so removing it from the supported list. Also this op is causing regression in the performance of the FP16 models. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Re-writing the help message for num_of_threads Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: Aravind Gunda <aravindx.gunda@intel.com>	2021-09-16 17:09:07 -07:00
Yulong Wang	498461485f	optimize WebAssembly CI pipeline (#9058 )	2021-09-16 01:07:36 -07:00
Guoyu Wang	bee5c26580	Add CPU_ONLY runtime option to NNAPI EP (#9066 ) * Add NNAPI cpu only option * update java * Update comments	2021-09-15 15:50:18 -07:00
Suffian Khan	e758870b18	Upgrade ROCm CI pipeline for ROCm 4.3.1 and permit run inside container (#9070 ) * try to run inside 4.3.1 container * no \ in container run command * remove networking options * try with adding video render groups * add job to build docker image * try without 1st stage * change alpha, beta to float * try adding service connection * retain huggingface directory * static video and render gid * use runtime expression for variables * install torch-ort * pin sacrebleu==1.5.1 * update curves for rocm 4.3.1 * try again * disable determinism and only check tail of loss curve and with a much larger threshold of 0.05 * disable RoBERTa due to high run variablity on ROCm 4.3.1 * put reduction unit tests back in	2021-09-15 12:32:02 -07:00
Changming Sun	4930320647	Delete linux-pytorch-custom-ops-ci-pipeline.yml (#9023 )	2021-09-14 21:51:21 +00:00
Suffian Khan	4322f7e647	Fix ROCm wheels CI pipeline break by installing latest protobuf from source (#9047 ) * install protobuf from source * fix rm command in Dockerfile * fix options on rm command * fix cd into protobuf source directory * try again * remove strip step * debug list the files * ls on /usr * more debug * more debug * adjust LD_LIBRARY_PATH * try remove protobuf before ORT build	2021-09-14 12:07:00 -07:00
Guoyu Wang	cf70635d2a	Add Android executable drop in the Package pipeline (#9050 ) * add copy executable for android job * minor fix * Variable fix * Move to use tgz because zip is not part of the docker image * update compression	2021-09-14 11:45:33 -07:00
Yulong Wang	be80698698	[js/web] a bugfix and add tests for wasm proxy worker (#9048 ) * [js/web] add tests for wasm proxy worker * fix script src override	2021-09-14 10:38:58 -07:00
Guoyu Wang	c709380c52	Add full iOS job in package pipeline (#9036 ) * Add full ios xcframework job * create zip file of the xcframework	2021-09-13 15:54:11 -07:00
baijumeswani	1422a9ba6b	Remove previous temporary fixes and address TODOs (#9020 )	2021-09-13 10:10:07 -07:00

1 2 3 4 5 ...

1250 commits