onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-14 18:12:05 +00:00

Author	SHA1	Message	Date
Changming Sun	6a45f9f059	Pin protobuf version to 3.18.1 (#11645 )	2022-05-26 21:14:56 -07:00
microsoft-github-policy-service[bot]	006597b9b8	Microsoft mandatory file (#11619 ) Co-authored-by: microsoft-github-policy-service[bot] <77245923+microsoft-github-policy-service[bot]@users.noreply.github.com>	2022-05-25 13:56:10 -07:00
Yulong Wang	f0dff6bb74	[js/rn] add expo config plugin support (#11556 ) * [js/rn] add expo config plugin support * resolve comments	2022-05-25 11:55:35 -07:00
Ryan Hill	d03d7afef8	Fix build errors when building with enable_memory_profile (#11617 )	2022-05-25 10:08:33 -07:00
Hariharan Seshadri	6e65bac5c2	Memory usage optimization in LongFormer Attention (#11611 )	2022-05-25 10:07:41 -07:00
Adrian Lizarraga	883e4bc341	Update the 'Linux-GPU-EP-Perf' pipeline to build ORT from source by default. (#11610 )	2022-05-25 09:29:49 -07:00
Thiago Crepaldi	427230431a	Fix torch cpp ext build when CPU wheel is installed but GPU card is present (#11608 ) * Fix torch cpp ext build when CPU wheel is installed but GPU card is present Also there is a minor improvement for ATen operator that allows both "::op" and "aten::op" name for operators * Fix flake8 false positive	2022-05-25 09:44:26 -04:00
George Nash	147a1737f9	MatMul postop fusion for dnnl ep (#11565 ) This includes a series of unit test that exercise the MatMul fusion. This is not an exhaustive list of tests. The tests focuse on paterns seen in in models, with additional tests to cover at least one instance of each operator type that can be part of the fusion. Signed-off-by: George Nash <george.nash@intel.com>	2022-05-24 22:19:38 -07:00
Yulong Wang	4e9ad7b6ae	Update .flake8 to exclude .git directory (#11615 )	2022-05-24 19:43:02 -07:00
Gary Miguel	e3a2d5cca8	Add additional python requirements (#11522 ) These are used by some of the python code in the package, e.g., `0292356bd7/onnxruntime/python/tools/transformers/optimizer.py (L25)` `c8270c2940/onnxruntime/python/tools/symbolic_shape_infer.py (L10)` `0292356bd7/onnxruntime/python/tools/transformers/torch_onnx_export_helper.py (L9)`	2022-05-20 16:16:18 -07:00
Yulong Wang	69aaf03345	allow catch all exceptions (#11498 )	2022-05-20 03:35:47 -07:00
PeixuanZuo	a67994316a	Update rocm ci to ROCm5.1.1 + torch1.10.0 * [UPDATE] update amd ci pipeline 2 rocm5.1.1 * [FIX] json format error * [ERROR] disable unit tests * [FIX] ucx error * [FIX] cmake version * [FIX] units test	2022-05-20 11:07:21 +08:00
Tang, Cheng	abecb56832	fix buid break (#11492 ) Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-05-19 16:10:45 -07:00
Vincent Wang	436c4f9b79	Add BFloat16 (bf16) support for ATen (#11546 ) Co-authored-by: Vincent Wang <weicwang@microsoft.com>	2022-05-19 10:04:08 -04:00
Adrian Lizarraga	e45197fa8c	[trt-ep-perf] Fix upload time of EP perf data (#11531 ) Fix the post.py script to use the actual "upload time" in ISO format instead of the day/month/year of the commit date.	2022-05-18 15:36:21 -07:00
Valery Chernov	8092d9f9a2	[TVM EP] Support inference by shared library created by TVM (#11389 ) * add so_folder option to TVM EP options. add TvmSoEP class and update TVM EP factory * compilation from so_folder was implemented * update TVMCompiler for default pipeline and compilation from shared lib * filter excess so-file in so_folder * clean Compile method and vm conditions * implementation of TVMSoCompile on native side instead of python API * cpplint fixes * some fixes after review * more cpplint fixes * more fixes after review * align TVMso EP with new API for compilation from #10632 * small fixes for cpplint Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>	2022-05-18 14:50:54 +02:00
Adrian Lizarraga	48efeca66c	[trt-ep-perf] Fix bug that suppresses latency gain reporting (#11321 ) Fix bug that prevents EP perf script from reporting latency gain for TensortRT/CUDA	2022-05-17 14:00:52 -07:00
Edward Chen	782f9e394d	[CoreML EP] Fix condition in PRelu op supported check. (#11543 )	2022-05-17 09:03:24 -07:00
Ryan Hill	deef214772	Update gather to use multiple threads (#11524 )	2022-05-16 19:31:14 -07:00
Edward Chen	5eaa893936	[CoreML EP] Add support for PRelu (#11474 )	2022-05-16 16:30:09 -07:00
Justin Chu	d9c9adb78b	Add python static type checking in CI checks (#11518 ) - Enable pyright and pylint (https://github.com/microsoft/pyright) in CI - Enable pyright, pylint and bandit by default in VS code Pylint has some good style checks. pyright is Microsoft's static type checker.	2022-05-16 13:26:56 -07:00
PeixuanZuo	c556f5f22f	Add AMD python package ROCm5.1.1+torch1.11 (#11516 ) * [FIX] fix name error * [ADD] add rocm5.1.1 python package * [ADD] torch1.10.0 rocm requirements * [UPDATE] update docker Repository name	2022-05-16 08:14:11 +08:00
Sheil Kumar	6255194659	All LearningModelSessions created from a common LearningModelDevice should share the same thread pool (#11457 ) * Share thread pools between devices * make tests reuse device * Change cpu thread pool options for dml sessions to use 1 thread with no spinning * fix test failure * Update missing type constraints for dft * Add comment and rename inference session parameter * default missing causing inconsistent test behavior Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-05-13 11:12:43 -07:00
Yi Zhang	5709ed2e16	Fix shellcheck warning (#11489 ) * fix shellcheck warning * Update java_linux_final_test.sh	2022-05-13 15:36:59 +08:00
RajalakshmiSR	b14c1fd479	POWER: Optimize MlasQLinearAddKernelHelper() (#11454 ) This patch uses vector instrinsics to optimize MlasQLinearAddKernelHelper function for POWER processor. Co-authored-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>	2022-05-12 18:38:45 -07:00
George Wu	09590f013a	fix windows ci debug build break (#11495 ) * update msc version check * update comment * typo * whitespace	2022-05-12 16:54:00 -07:00
Rachel Guo	4aef7e3aab	[CoreML EP] Add DepthToSpace op support (#11468 ) * initial impl of depthtospace coreml support * fix build * address pr comments * minor update * minor pr comments Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net> Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>	2022-05-12 13:48:51 -07:00
Yi Zhang	a3f05da338	Revert "[TVM EP] update set input to remove excess copying inside TVM (#11247 )" (#11504 ) This reverts commit `5ae461ec0a`.	2022-05-13 02:27:36 +08:00
Tianlei Wu	ece1274ffa	revert safeint version (#11500 )	2022-05-12 11:24:43 -07:00
Justin Chu	f94b25933a	ci(cpplint): Ignore runtime/references warnings (#11499 ) Allow non-const references `6f85d3e5c8/docs/Coding_Conventions_and_Standards.md (L11-L12)`	2022-05-12 07:51:45 -07:00
Justin Chu	6f85d3e5c8	fix(onnx_export): Extract arg value from torch Value (#11471 ) Description: Extract arg value from torch Value Motivation and Context Input to gelu is `torch._C.Value` type values. This caused the `if approximate == "none"` check to always fail, preventing the optimized `com.microsoft::Gelu` op from being used.	2022-05-11 11:36:43 -07:00
Tianlei Wu	f5473596fa	Change longformer default kernel (#11470 ) * change default to compact memory kernel * Remove a cuda stream synchronize that is not needed * Update longformer benchmark tool	2022-05-11 10:54:59 -07:00
Changming Sun	48ae27d578	Update protobuf-java to 3.20.1 (#10420 )	2022-05-11 07:52:12 -07:00
Changming Sun	207ad7eef9	Remove spdlog from cgmanifest.json	2022-05-10 22:02:21 -07:00
Changming Sun	027fc1d391	Completely delete ORT server	2022-05-10 22:02:21 -07:00
Changming Sun	903743e823	Delete unused TRT docker files (#11486 ) * Delete unused TRT docker files * revert tools/ci_build/github/linux/docker/Dockerfile.manylinux2014_cuda11_4_tensorrt8_0	2022-05-10 22:00:53 -07:00
Changming Sun	0ac2e6e546	Update install-entrypoint.sh: add version lock for NCCL (#11475 )	2022-05-10 15:37:55 -07:00
Yulong Wang	3437967e63	[js/rn] fix CI packaging for react native E2E test (#11463 ) * [js/rn] fix ORTRN packaging in CI * fix env var setting	2022-05-09 18:09:52 -07:00
Edward Chen	738d9b153c	Consolidate several types into onnxruntime::ArgType. (#11430 )	2022-05-09 14:44:28 -07:00
Rachel Guo	288892335e	[NNAPI EP] Add support for DepthToSpace Op (#11354 ) * initial implementation for support nnapi depthtospace * modify depthtospace output tensor shape and enable test pass * minor update * minor update * modify input output layout order and hack nnapi instance to use nchw flag for optest * address pr comments * add depthtospace to layout logic * format length and revert UT log level * add nchw and android feature level check in opsupportchecker * minor fix * update * update * fix * minor update	2022-05-09 11:38:12 -07:00
Changming Sun	3b16fb2000	Delete java-test-final-jar-step.yml (#8894 )	2022-05-09 11:25:03 -07:00
Justin Chu	c541063245	Format coding conventions documentation (#11405 ) Add proper formatting to code blocks to make the doc more readable. - Wrap code blocks with ` - Fix typos	2022-05-09 10:19:15 -07:00
symphonylyh	c2de603c10	Contrib ops for TRT plugin: Disentangled Attention Plugin (#11287 ) * Add disentangled attention TRT plugin as contrib op * update plugin name & remove null character * update onnx-tensorrt submodule with my beta version * use suggested plugin name & simpler shape propagation * update onnx-tensorrt gitsubmodule to temporary fork * update onnx-tensorrt to temporary commit * redirect submodule back to latest 8.2-GA release of onnx-tensorrt repo Co-authored-by: HHH-ComputeLab <haohangh@nvidia.com>	2022-05-08 15:25:25 -07:00
George Wu	70e501866b	Revert "[TensorRT EP] reduce CI pipelines test execution time (#11440 )" (#11460 ) This reverts commit `8d6ade9e08`.	2022-05-07 11:41:11 -07:00
Dwayne Robinson	69b2fab810	Update DirectML from 1.8.0 to 1.8.2 (#11459 )	2022-05-06 17:52:52 -07:00
RandySheriffH	8467af832f	Fix reduced pipeline by excluding test case standalone op (#11458 ) * exclude reduce build from standalone op test * exclude test from reduced op build	2022-05-06 16:19:49 -07:00
Brian Popow	3624f7c5a5	Update samples (#11420 )	2022-05-06 13:32:16 -07:00
Hubert Lu	2a90922f01	Using vectorized loads (float2) for fp16 to improve performance (#11390 )	2022-05-05 14:19:21 -07:00
Changming Sun	d2ae0f49b2	Make Graph::InlineFunction be able to process initializers (#11443 )	2022-05-05 12:30:29 -07:00
George Wu	8d6ade9e08	[TensorRT EP] reduce CI pipelines test execution time (#11440 ) * add global builder placeholder to improve CI test time for TRT EP * fix build error * rename var, put in unnamed namespace * fix build error * fix	2022-05-05 09:25:54 -07:00

1 2 3 4 5 ...

6779 commits