onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-29 03:30:52 +00:00

Author	SHA1	Message	Date
Tiago Koji Castro Shibata	13ff79d1e4	Merged PR 6839685: Merge public onnxruntime Related work items: #37239556	2022-01-14 18:52:23 +00:00
Vincent Wang	44e2db9397	CUDA BFloat16 Refactor (#10085 )	2022-01-14 19:38:56 +08:00
Xavier Dupré	e38e51ea8e	Improve iobinding, faster name search (#10005 ) * Improve iobinding, faster name search	2022-01-14 12:18:18 +01:00
Vincent Wang	3ea7fb0f9f	fix mem leak (#10272 )	2022-01-14 14:54:19 +08:00
dependabot[bot]	2a55bc2c21	Bump engine.io from 4.1.1 to 4.1.2 in /js/web Bumps [engine.io](https://github.com/socketio/engine.io) from 4.1.1 to 4.1.2. - [Release notes](https://github.com/socketio/engine.io/releases) - [Changelog](https://github.com/socketio/engine.io/blob/4.1.2/CHANGELOG.md) - [Commits](https://github.com/socketio/engine.io/compare/4.1.1...4.1.2) --- updated-dependencies: - dependency-name: engine.io dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2022-01-13 18:26:02 -08:00
Baiju Meswani	2affd6e71e	orttraining packaging and ci pipelines to use cuda 11.3 (#10252 )	2022-01-13 13:36:33 -08:00
Tiago Koji Castro Shibata	6cb18fc686	Merge remote-tracking branch 'upstream/master' into dmldev_temp	2022-01-13 10:36:07 -08:00
dependabot[bot]	4b205eb2b3	Bump follow-redirects from 1.13.3 to 1.14.7 in /js/web (#10266 ) Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.13.3 to 1.14.7. - [Release notes](https://github.com/follow-redirects/follow-redirects/releases) - [Commits](https://github.com/follow-redirects/follow-redirects/compare/v1.13.3...v1.14.7) --- updated-dependencies: - dependency-name: follow-redirects dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-01-13 09:05:22 -08:00
dependabot[bot]	943a1aa2d6	Bump follow-redirects from 1.14.5 to 1.14.7 in /js/node (#10265 ) Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.14.5 to 1.14.7. - [Release notes](https://github.com/follow-redirects/follow-redirects/releases) - [Commits](https://github.com/follow-redirects/follow-redirects/compare/v1.14.5...v1.14.7) --- updated-dependencies: - dependency-name: follow-redirects dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-01-13 09:03:41 -08:00
Edward Chen	d43ef67d2b	Move binary size check to separate pipeline (#10254 ) Move binary size check(s) to a separate pipeline. In the future, other binary size-related builds can go here. Add publishing of build artifacts for easier analysis. Add optional build with debug info.	2022-01-12 19:21:20 -08:00
dependabot[bot]	3d9d8e20cc	Bump numpy from 1.19.2 to 1.21.0 in /tools/ci_build Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.21.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2022-01-12 17:45:35 -08:00
Ryan Lai	8aad46c09b	Merged PR 6836425: Merge RI without squash Related work items: #37712737	2022-01-13 01:24:21 +00:00
Ryan Lai	a44d24d8b5	Merge remote-tracking branch 'origin/dmldev' into user/rylai/ri_01_12_22	2022-01-12 16:26:27 -08:00
Ryan Lai	c07e251cec	Merged PR 6835169: RI 12/9/21 - 01/12/22 Build is green https://microsoft.visualstudio.com/WindowsAI/_build/results?buildId=43713985&view=results ![image.png](https://microsoft.visualstudio.com/274e76ac-6b29-4f77-a85d-7914c77cabd5/_apis/git/repositories/853d2ddc-663c-4fe8-8036-dbf0d50db2d9/pullRequests/6835169/attachments/image.png) Related work items: #37712737	2022-01-13 00:25:51 +00:00
Yi-Hong Lyu	499f1d5fd7	Quantization of Argmax (#10213 ) This patch includes: * int8/uint8 support for Argmax * Quantization tool support for Argmax	2022-01-12 14:12:56 -08:00
Ryan Lai	6f7389bc03	Merge commit '4af232df0ce89f0f0227ca8bd10ed8808d4398e5' into HEAD	2022-01-12 11:48:43 -08:00
Tiago Koji Castro Shibata	98f85ae05b	Bump winrt version (#10243 )	2022-01-12 10:52:27 -08:00
ashari4	aff96ce081	remove hardcoded type (#10251 )	2022-01-12 10:00:34 -08:00
CarlPoirier	4af232df0c	Fix props file overwriting AdditionalIncludeDirectories (#10124 ) Co-authored-by: Carl Poirier <carl.poirier@vab-solutions.com>	2022-01-11 23:30:40 -08:00
Rachel Guo	a099bd454b	[QDQ] Add shared qdq selectors (#10178 ) * wip * wip * wip * wip * wip * save * minor changes * update test graph name * address pr comments * update * address pr comments * address pr comments * fix warning * minor include fix * update to nodegroupselectors * delete unnecessary includes Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>	2022-01-11 19:41:45 -08:00
RandySheriffH	79d2a0d185	Dynamic cost model to mitigate high E2E perf variance (#9833 ) * commit dyamic block size * summarize granularity * add configure * add test case * call std stoi * add comments * fix typo * rename var * update comment * reset default * better comments * extend LoopCounter for dynamic blocking * fix comments and add more UT * update comments * swtich type to std::ptrdiff_t * format code with better indention * cast ptrdiff_t * fix typo	2022-01-11 17:26:41 -08:00
Chen Fu	fb4dea39e2	Tolerate cpuinfo init failure (#10199 ) Tolerate pytorch cpuinfo library init failure.	2022-01-11 16:31:56 -08:00
Olivia Jain	4048ed326c	Update EP Perf Pipeline (#10149 ) * migrate to 1ES Hosted Pool * migrate to Kusto database * refactor and organize ep names with ORT prefix * standardize TRT benchmarking with save/load engine, input binding, and workspace * Add TRT 8.2 to ep perf pipeline * update model_list.json with full onnx zoo * add anubis credentials * add anubis credentials * clarify trt variables * get system info from docker image * remove unwanted commenting	2022-01-11 16:12:32 -08:00
Yufeng Li	cb9b0275b6	Set default quantization weight type to int8 (#10209 ) * Set default quantization weight type to int8	2022-01-11 14:24:57 -08:00
Shucai Xiao	ce103ace93	Amdmigraphx fix build error (#9272 ) * fix build error * rename a missing api for the MIGraphX EP	2022-01-10 15:18:43 -08:00
Chi Lo	be9cc40aa5	Make some parameters configurable for calibration (#10204 )	2022-01-10 13:00:58 -08:00
Jeff Bloomfield	32ee379f50	GraphPartitioner.cpp: fixed Merge function to handle duplicated partitions. (#9929 ) Co-authored-by: Achira <achiras@microsoft.com>	2022-01-10 12:01:08 -08:00
Jingqiao Fu	5cd57bb726	add load from buffer (#10162 ) * Add LoadFromBuffer API	2022-01-10 10:51:48 -08:00
Edward Chen	edd1a2cf61	Add more Java test logging. (#10221 )	2022-01-10 09:42:46 -08:00
Dwayne Robinson	0f5e82c294	DirectML EP remove stale code for int64 via int32 double strides (#9959 )	2022-01-10 02:07:22 -08:00
Dwayne Robinson	1f5b073508	Minor DirectML EP provider factory comments (#9965 )	2022-01-10 02:06:31 -08:00
PeixuanZuo	7d93498e0e	[FIX] register softmaxgrad_13/logsoftmaxgrad_13 for rocm (#10177 ) * [FIX] register softmaxgrad_13/logsoftmaxgrad_13 for rocm * [FIX] update softmaxgrad_13/logsoftmaxgrad_13 implementation for rocm	2022-01-10 11:33:46 +08:00
Scott McKay	6e88c11cae	Refactor QDQ node group selection infrastructure (#10195 ) * Separate out the QDQ node group selection from the SAT specific NodeSelector to make re-use in NNAPI etc. cleaner. * Make MatMulIntegerToFloat matching optional. Add move ctor to BaseSelector. Required now that it has a unique_ptr member. * Avoid Guardian warning by using rvalue unique_ptr created with make_unique	2022-01-10 10:57:50 +10:00
Nat Kershaw (MSFT)	d52d3c0052	Update C/C++ API docs automation to create a PR (instead of push to publish branch) (#10093 )	2022-01-07 16:16:47 -08:00
Ye Wang	5ebb857501	Update onnxruntime_unittests.cmake (#10215 )	2022-01-07 16:14:15 -08:00
vade	bacae967a2	Update Cuda to 11.4.2, update architectures, support Ubuntu 20.04 (#10169 )	2022-01-07 13:00:44 -08:00
Zhang Lei	2bbf1ac1e0	Using better words. (#10210 )	2022-01-07 09:17:23 -08:00
Jeff Daily	e7efcc93fe	[ROCm] update hipify-perl location (#10102 ) * [ROCm] update hipify-perl location Depending on the ROCm version installed, hipify-perl might not always live in the hard-coded path of /opt/rocm/bin. Use python 3.3's shutil.which to locate the script. * provide alternative locations for hipify-perl if not in PATH * implement hipify-perl search as a function This avoids running the logic during module import since all builds import the amd_hipify module. * fix flake8 errors	2022-01-06 17:21:02 -08:00
Abhishek Jindal	4ac3277743	adding definition of concat operator for mapping it to onnx (#10062 ) * adding definition of concat operator for mapping it to onnx * adding the opgen generator file to include tensorlist type for eager mode	2022-01-06 14:56:35 -08:00
Chris Hua	cab4579b83	remove six references (#9941 ) Python 2 compatibility is no longer necessary and helps unblock upgrades to mypy and others.	2022-01-06 13:52:20 -08:00
Hariharan Seshadri	0552a47ec2	Enable CUDA provider option configuration for C# (#10188 )	2022-01-06 11:03:14 -08:00
Ye Wang	08f512b25e	Fix a Win GPU reduced ops pipeline (#10202 )	2022-01-06 09:46:34 -08:00
ashari4	4ab891999a	fix hardcoded type (#10205 )	2022-01-06 09:28:22 -08:00
ashari4	7b5464ed7b	aten add_ op supports bf16 (#10084 ) * hand implemented add_	2022-01-05 09:33:28 -08:00
Edward Chen	34c025109c	Exclude graph_runtime_optimization_test.cc from reduced ops build. (#10191 )	2022-01-05 09:22:38 -08:00
Ye Wang	2803a9465d	Add example of registering custom cuda op as shared lib (#10025 )	2022-01-05 09:22:15 -08:00
yz	2078210a1c	Improve logging for symbolic shape inference	2022-01-04 13:17:07 -08:00
Edward Chen	792db33f01	Enable loading of ORT format model graph runtime optimizations (#9901 ) Initial implementation of load/replay of runtime optimizations in an ORT format model.	2022-01-04 12:09:07 -08:00
Tang, Cheng	97659495d9	fix aten view op (#10050 ) * fix aten view op * add test case * fix signature * fix the build Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-01-04 08:29:30 -08:00
George Wu	91f85dfdad	update Dockerfile.manylinux2014_cuda11_4_tensorrt8_2 to TensorRT 8.2.2.1 (#10167 )	2022-01-03 20:38:37 -08:00

... 32 33 34 35 36 ...

7863 commits