onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-08 17:17:15 +00:00

Author	SHA1	Message	Date
ytaous	e0334f177c	Opset12 upgrade for existing models used by perf/e2e pipelines (#4238 ) * opset12 support * opset12 support * on comments Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-06-15 14:26:53 -07:00
Ashwini Khade	4486c66ed4	enable conv transpose 3D (#4218 ) * enable convtranspose 3D * test fix	2020-06-15 13:38:32 -07:00
Bowen Bao	b08771f00e	Add ONNX Training Post-Passes to Front-End - Cont (#4041 ) * Add ONNX postpasses * add flag + add bert test from onnx file * address PR comments * fix typo * fix rebase * address comments * Fix test failures * add new pass for expand for new pt version, add comments * fix rebase Co-authored-by: lahaidar <lahaidar@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-06-15 10:33:26 -07:00
Cecilia Liu	0b5bbb16b8	Benchmark With IO Binding (#4206 ) * add io binding to benchmark.py	2020-06-15 10:06:33 -07:00
Weixing Zhang	b4b1c6440a	Enable ORT with CUDA 11 toolkit (#4168 ) * ORT on CUDA 11 1. Seperate HOROVOD and MPI 2. Seperate NCCL from HOROVOD in CMakeLists.txt 2. Remove dependency on external cub 3. cudnnSetRNNDescriptor is changed in cuDNN 8.0 * polish the code about MPI/NCCL in CMakeLists.txt and build.py * check CUDA version * ${MPI_INCLUDE_DIRS} should be PUBLIC * sm30, sm50 are deprecated in CUDA 11 Toolkit * update change based on code review feedback. * add sm_52 * improve MPI/NCCL build path Co-authored-by: Weixing Zhang <wezhan@microsoft.com>	2020-06-15 08:47:03 -07:00
Emad El-Haraty	88a9cceb41	fix relative links in CONTRIBUTING.md (#4212 ) * fix a links to Engineering Design and API in CONTRIBUTING.md * fix additional links in CONTRIBUTING.md * correct the link to the public API in CONTRIBUTING.md Co-authored-by: Emad El-Haraty <emad.elharaty@limebike.com>	2020-06-15 06:48:09 -07:00
Guoliang Hua	d0d31efd86	fix transformer doc format (#4003 ) fix transformer doc format	2020-06-15 01:30:47 -07:00
Wei-Sheng Chin	ecc901717e	Use subset to release gradient tensors earlier (#4222 )	2020-06-14 22:52:54 -07:00
Andrews548	886befaba1	Add BatchNorm and Concat to ACL EP (#4190 ) * Fix acl padding * Add BatchNormalization operator to ACL Execution Provider * Add Concat operator to ACL Execution Provider Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-06-14 21:48:22 -07:00
Hariharan Seshadri	877862184e	Fix subgraph based reshape fusion (#4185 )	2020-06-14 21:10:08 -07:00
Tracy Sharpe	bf3c32166d	fix optional input/outputs (#4229 )	2020-06-15 08:10:22 +10:00
Hariharan Seshadri	5708c4feaf	Handle corner case in Resize op (#4183 ) * Handle corner case in Resize op * Nit * Fix build * PR feedback	2020-06-13 18:05:25 -07:00
Tracy Sharpe	7a96cfc8f5	operator code cleanup (#4228 ) Search/replace of the pattern "const auto foo = tensor.Shape()" to "const auto& foo = tensor.Shape()" to avoid unneeded copies at runtime and reduce code size (8KB drop for onnxruntime.dll). Remove some unnecessary header includes.	2020-06-13 14:47:44 -07:00
jornt-xilinx	c55f6d76be	[Vitis-AI EP] Fix to enable multi-output subgraphs inside Vitis-AI EP + edit docs (#4171 )	2020-06-13 04:56:07 -07:00
Wei-Sheng Chin	de9da123cf	Enable static memory planning for pipeline. (#4204 ) * Enable static memory planning for pipeline. 1. We fix a bug when resolving symbolic shape for scalars. 2. We pass the original inputs to all pipeline stages so that the symbolic shapes can be resolved. * Further Improvements 1. Address comments. 2. Further reduce activation size by ~50% when pipeline is on. This is done by removing all but one gradient tensor from the last RecordEvent in the backward pass. * Address a comment * Fix Windows build	2020-06-12 21:43:50 -07:00
Hariharan Seshadri	b377266eb3	Fix Mac build linker warnings (#4155 )	2020-06-12 21:10:12 -07:00
Hariharan Seshadri	91a41298cc	Fix ORT build when onnxruntime_PYBIND_EXPORT_OPSCHEMA is enabled (#3954 )	2020-06-12 19:32:57 -07:00
Tracy Sharpe	155e22d1ab	MLAS: fuse float output into quantized GEMM (#4215 ) Add more variants of MlasGemm that do a u8x8 GEMM with the output type as float. This fuses the common sequence of MatMulInteger + Cast + Mul(OutputScale) + optional Add(BiasVector).	2020-06-12 17:50:40 -07:00
Tiago Koji Castro Shibata	2e3607c7cd	Remove hardcoded desktop lib (#4193 )	2020-06-12 16:51:54 -07:00
Edward Chen	f74861841e	Fix dangling pointer to local string variable in onnxruntime_pybind_state.cc.	2020-06-12 14:28:39 -07:00
Edward Chen	6b4f652017	Clean up status checks in gradient_graph_builder_test.cc.	2020-06-12 14:28:39 -07:00
Edward Chen	7096e6f5ef	Reduce severity of GraphAugmenter logging statement.	2020-06-12 14:28:39 -07:00
Changming Sun	6f4320fb85	Fix the python package name issue (#4207 ) Fix the package package name issue. In my last change(#4197) about enabling code sign. I forgot to pass the additional flags to setup.py,	2020-06-12 08:32:59 -07:00
Yufeng Li	87d68d8531	matmul integer fusion (#4195 ) * Introduce DynamicQuantizeMatMul It fuses DynamicQuantizeLinear, MatMul and following cast, multiplier. It gets float in and float out for quantized matmul. We have a MLAS kernel in implementation for this op.	2020-06-11 21:42:09 -07:00
Tianlei Wu	2605faef88	Add past state support in Attention Op for GPT-2 (#4107 ) Update Attention op to allow past state input and output. Add fusion script and tests	2020-06-11 14:19:55 -07:00
pengwa	e6ccb1ac28	GatherNDGrad for CPU (#4123 ) * GatherNDGrad on CPU * Remove __CUDA_ARCH__ check in .cc files	2020-06-12 02:43:49 +08:00
Xueyun Zhu	65a682354b	enable pipeline to run with mixed precision (#4113 ) * enable pipeline to run with mixed precision * address feedback * address feedback * test log * pipe infomation if test fails * ci failure	2020-06-10 22:16:24 -07:00
Changming Sun	8f8d899bf2	Enable code sign in c api pipeline and python pipeline	2020-06-10 19:31:22 -07:00
Yulong Wang	73bc6be5d1	build: split nodejs binding build and test to avoid timeout issue (#4188 ) * split nodejs binding build and test * enable nodejs tests	2020-06-10 19:16:32 -07:00
Matthew Hill	117b2e7743	Fix GPU memory leak on TensorRT (#4172 )	2020-06-10 16:56:51 -07:00
Dmitri Smirnov	af0750ba1b	Java GPu artifact naming (#4179 ) Modify gradle build so artifactID has _gpu for GPU builds. Pass USE_CUDA flag on CUDA build Adjust publishing pipelines to extract POM from a correct path. Co-Authored-By: @Craigacp	2020-06-10 11:15:48 -07:00
George Wu	e8ed14bcb3	disable MEMLEAK CHECKER for openvino	2020-06-10 11:12:17 -07:00
stevenlix	c296884fc3	bump up ORT version to 1.3.1 (#4181 )	2020-06-10 08:44:03 -07:00
Changming Sun	c0bdbc0b39	Enable telemetry for the C API and python pipeline (#4174 )	2020-06-10 00:07:46 -07:00
Tracy Sharpe	35d9f396c4	MLAS: refactor quantized GEMM loops (#4182 )	2020-06-09 23:28:55 -07:00
George Wu	9d65ce53bc	move back to toolset 14.16 to possibly work around nvcc bug (#4180 )	2020-06-09 19:36:30 -07:00
Changming Sun	a7366d82af	Disable nuphar large model test (#4173 ) Disable nuphar large model test, because it takes too long(40+ minutes), while the default cpu provider takes about 5 minutes. After this change, we still keep a lot of other nuphar model tests, I think that should be enough.	2020-06-09 17:45:17 -07:00
Ashwini Khade	9eba9fba7c	Fix for BiasGelu fusion optimizer (#4160 ) * Fix for BiasGelu fusion optimizer * changes per review comments	2020-06-09 14:33:34 -07:00
Yulong Wang	2b3ce1b090	add script to support update nodejs binding version (#4164 )	2020-06-09 13:12:55 -07:00
Sheil Kumar	4377ff4a1a	Enable .NET Core 2.0 and .NET Framework 4.6.1 in Microsoft.AI.MachineLearning NuGet package (#4125 ) * add project to download cswinrt and build winrt c# interop dll * Add to nuget package * reverse if check * run generation before core compile * add generated files to compile * update .net package to binplace native libs * add props to .netstandard2.0 folder * auto binplace ml native binaries * force 'Any CPU' platform build * Fix anycpu and platform targets * fix flake errors * fix variable order * fix flake pep8 errors, semicolon Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-06-09 09:08:19 -07:00
Scott McKay	28d12dc4f0	Try to avoid std::move in return whilst keeping CentOS build happy. (#4163 )	2020-06-09 21:41:49 +10:00
oak-tree	541eafb41a	Fixed the link to model test documenation (#4011 )	2020-06-08 17:27:55 -07:00
Changming Sun	2ab3a19728	Enlarge the read buffer size in C#/Java test code (#4150 ) 1. Enlarge the read buffer size further, so that our code can run even faster. TODO: need apply the similar changes to python some other language bindings. 2. Add coreml_VGG16_ImageNet to the test exclusion set of x86_32. It is not a new model but previously we didn't run the test against x86_32.	2020-06-08 16:13:11 -07:00
Tiago Koji Castro Shibata	8eb6a539bd	Hardcode WinML tests umbrella lib (#4161 )	2020-06-08 15:24:08 -07:00
suffiank	7f5339505e	Discover trainable parameters using reverse DFS from loss node (#4116 ) Discover trainable parameters using reverse DFS from loss node, omitting recursion along untrainable inputs. Co-authored-by: suffian khan <sukha@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net> Co-authored-by: suffian khan <sukha@microsoft.com>	2020-06-08 14:16:10 -07:00
Yulong Wang	842be1535d	[Node.js binding] add linux and mac package (#4157 ) * try mac pipeline * fix path separator * copy prebuilds folder * split esrp yaml for win/mac * disable mac signing temporarily * add linux * fix indent * add nodetool in linux * add nodetool in win-ci-2019 * replace linux build by custom docker scripts * use manylinux as node 12.16 not working on centos6 * try ubuntu * loosen timeout for test case - multiple runs calls	2020-06-08 14:12:05 -07:00
Sergii Dymchenko	653417ae4b	Fix scaler->scalar typo. (#4142 )	2020-06-08 13:02:12 -07:00
Tiago Koji Castro Shibata	6bbd18efd0	Hardcode WinML umbrella lib to windowsapp.lib (#4133 )	2020-06-08 11:04:44 -07:00
Wenbing Li	ee35320974	The fixings for python scripts in ONNXRuntime (#4135 ) * The fixings for python scripts in ONNXRuntime * update according the comments	2020-06-08 10:27:32 -07:00
Faith Xu	3390431d80	Update MCR image table (#4137 )	2020-06-08 10:13:13 -07:00

1 2 3 4 5 ...

2707 commits