onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-04 04:07:22 +00:00

Author	SHA1	Message	Date
CarlPoirier	4af232df0c	Fix props file overwriting AdditionalIncludeDirectories (#10124 ) Co-authored-by: Carl Poirier <carl.poirier@vab-solutions.com>	2022-01-11 23:30:40 -08:00
Rachel Guo	a099bd454b	[QDQ] Add shared qdq selectors (#10178 ) * wip * wip * wip * wip * wip * save * minor changes * update test graph name * address pr comments * update * address pr comments * address pr comments * fix warning * minor include fix * update to nodegroupselectors * delete unnecessary includes Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>	2022-01-11 19:41:45 -08:00
RandySheriffH	79d2a0d185	Dynamic cost model to mitigate high E2E perf variance (#9833 ) * commit dyamic block size * summarize granularity * add configure * add test case * call std stoi * add comments * fix typo * rename var * update comment * reset default * better comments * extend LoopCounter for dynamic blocking * fix comments and add more UT * update comments * swtich type to std::ptrdiff_t * format code with better indention * cast ptrdiff_t * fix typo	2022-01-11 17:26:41 -08:00
Chen Fu	fb4dea39e2	Tolerate cpuinfo init failure (#10199 ) Tolerate pytorch cpuinfo library init failure.	2022-01-11 16:31:56 -08:00
Olivia Jain	4048ed326c	Update EP Perf Pipeline (#10149 ) * migrate to 1ES Hosted Pool * migrate to Kusto database * refactor and organize ep names with ORT prefix * standardize TRT benchmarking with save/load engine, input binding, and workspace * Add TRT 8.2 to ep perf pipeline * update model_list.json with full onnx zoo * add anubis credentials * add anubis credentials * clarify trt variables * get system info from docker image * remove unwanted commenting	2022-01-11 16:12:32 -08:00
Yufeng Li	cb9b0275b6	Set default quantization weight type to int8 (#10209 ) * Set default quantization weight type to int8	2022-01-11 14:24:57 -08:00
Shucai Xiao	ce103ace93	Amdmigraphx fix build error (#9272 ) * fix build error * rename a missing api for the MIGraphX EP	2022-01-10 15:18:43 -08:00
Chi Lo	be9cc40aa5	Make some parameters configurable for calibration (#10204 )	2022-01-10 13:00:58 -08:00
Jeff Bloomfield	32ee379f50	GraphPartitioner.cpp: fixed Merge function to handle duplicated partitions. (#9929 ) Co-authored-by: Achira <achiras@microsoft.com>	2022-01-10 12:01:08 -08:00
Jingqiao Fu	5cd57bb726	add load from buffer (#10162 ) * Add LoadFromBuffer API	2022-01-10 10:51:48 -08:00
Edward Chen	edd1a2cf61	Add more Java test logging. (#10221 )	2022-01-10 09:42:46 -08:00
Dwayne Robinson	0f5e82c294	DirectML EP remove stale code for int64 via int32 double strides (#9959 )	2022-01-10 02:07:22 -08:00
Dwayne Robinson	1f5b073508	Minor DirectML EP provider factory comments (#9965 )	2022-01-10 02:06:31 -08:00
PeixuanZuo	7d93498e0e	[FIX] register softmaxgrad_13/logsoftmaxgrad_13 for rocm (#10177 ) * [FIX] register softmaxgrad_13/logsoftmaxgrad_13 for rocm * [FIX] update softmaxgrad_13/logsoftmaxgrad_13 implementation for rocm	2022-01-10 11:33:46 +08:00
Scott McKay	6e88c11cae	Refactor QDQ node group selection infrastructure (#10195 ) * Separate out the QDQ node group selection from the SAT specific NodeSelector to make re-use in NNAPI etc. cleaner. * Make MatMulIntegerToFloat matching optional. Add move ctor to BaseSelector. Required now that it has a unique_ptr member. * Avoid Guardian warning by using rvalue unique_ptr created with make_unique	2022-01-10 10:57:50 +10:00
Nat Kershaw (MSFT)	d52d3c0052	Update C/C++ API docs automation to create a PR (instead of push to publish branch) (#10093 )	2022-01-07 16:16:47 -08:00
Ye Wang	5ebb857501	Update onnxruntime_unittests.cmake (#10215 )	2022-01-07 16:14:15 -08:00
vade	bacae967a2	Update Cuda to 11.4.2, update architectures, support Ubuntu 20.04 (#10169 )	2022-01-07 13:00:44 -08:00
Zhang Lei	2bbf1ac1e0	Using better words. (#10210 )	2022-01-07 09:17:23 -08:00
Jeff Daily	e7efcc93fe	[ROCm] update hipify-perl location (#10102 ) * [ROCm] update hipify-perl location Depending on the ROCm version installed, hipify-perl might not always live in the hard-coded path of /opt/rocm/bin. Use python 3.3's shutil.which to locate the script. * provide alternative locations for hipify-perl if not in PATH * implement hipify-perl search as a function This avoids running the logic during module import since all builds import the amd_hipify module. * fix flake8 errors	2022-01-06 17:21:02 -08:00
Abhishek Jindal	4ac3277743	adding definition of concat operator for mapping it to onnx (#10062 ) * adding definition of concat operator for mapping it to onnx * adding the opgen generator file to include tensorlist type for eager mode	2022-01-06 14:56:35 -08:00
Chris Hua	cab4579b83	remove six references (#9941 ) Python 2 compatibility is no longer necessary and helps unblock upgrades to mypy and others.	2022-01-06 13:52:20 -08:00
Hariharan Seshadri	0552a47ec2	Enable CUDA provider option configuration for C# (#10188 )	2022-01-06 11:03:14 -08:00
Ye Wang	08f512b25e	Fix a Win GPU reduced ops pipeline (#10202 )	2022-01-06 09:46:34 -08:00
ashari4	4ab891999a	fix hardcoded type (#10205 )	2022-01-06 09:28:22 -08:00
ashari4	7b5464ed7b	aten add_ op supports bf16 (#10084 ) * hand implemented add_	2022-01-05 09:33:28 -08:00
Edward Chen	34c025109c	Exclude graph_runtime_optimization_test.cc from reduced ops build. (#10191 )	2022-01-05 09:22:38 -08:00
Ye Wang	2803a9465d	Add example of registering custom cuda op as shared lib (#10025 )	2022-01-05 09:22:15 -08:00
yz	2078210a1c	Improve logging for symbolic shape inference	2022-01-04 13:17:07 -08:00
Edward Chen	792db33f01	Enable loading of ORT format model graph runtime optimizations (#9901 ) Initial implementation of load/replay of runtime optimizations in an ORT format model.	2022-01-04 12:09:07 -08:00
Tang, Cheng	97659495d9	fix aten view op (#10050 ) * fix aten view op * add test case * fix signature * fix the build Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-01-04 08:29:30 -08:00
George Wu	91f85dfdad	update Dockerfile.manylinux2014_cuda11_4_tensorrt8_2 to TensorRT 8.2.2.1 (#10167 )	2022-01-03 20:38:37 -08:00
Chi Lo	c29397ad4f	Modify the code to get correct ragne for symmetric quantization (#10170 )	2022-01-03 19:13:37 -08:00
Nat Kershaw (MSFT)	0c517112c4	Automate Python API docs generation (#10116 )	2022-01-03 18:22:22 -08:00
Yufeng Li	230f323600	add qdq support for LeakyRelu (#10077 ) * add qdq support for LeakyRelu	2022-01-03 14:48:49 -08:00
Tongliang Liao	1d3b34cc92	Add `.git` suffix to github URL. Although github works with both, this is more precise. Having an extension also makes it easy to match with regex, when we want to inject code to reroute traffic to our own git mirror.	2022-01-03 14:38:35 -08:00
Yufeng Li	7208fcbe1c	use wasmscalar as default kernel (#9988 ) * use wasmscalar as default kernel	2022-01-03 10:55:08 -08:00
Dmitri Smirnov	28ce2a5a78	Re-work hierarchy, fix virtual method overload/hiding (#10160 ) Re-work hierarchy, fix virtual method overload/hiding Use std::optional with a clear comment on the member thread-safety.	2022-01-03 10:24:49 -08:00
Abhishek Jindal	d5742f3a43	moving from torch nightly build to stable build (#10150 ) * moving from torch nightly build to stable build * using torch cpu version * using torch cpu version from link	2021-12-29 19:35:10 -08:00
Edward Chen	3bc91c2151	Move reduced ops files into build directory (#10030 ) In a reduced ops build, some source files get updated. This change moves the updated files into the build directory. This way, it is easier to simultaneously manage different build directories (with possibly different reduced ops configurations) based on a single source directory.	2021-12-28 19:04:20 -08:00
Scott McKay	a367f0664d	From Python 3.8 and on you need to explicitly add the current directory for libraries to be loaded from it. Update onnxruntime_test_python.py with that handling. (#10129 )	2021-12-28 16:10:26 +10:00
George Wu	3d6786c92e	update tensorrt multi gpu pipeline to tensorrt 8.2 (#10141 )	2021-12-27 15:43:27 -08:00
Vincent Wang	ceb17f82ff	Use FusedMatMul When Transpose is Between First Dim and Contiguous Batch Dims (#9734 ) * fusedmatmul support transpose batches * fix win build * fix contrib op md * more comments	2021-12-27 10:49:46 +08:00
Vincent Wang	f780f06240	ConcatGrad for OpSet13 (#10109 )	2021-12-24 10:02:52 +08:00
stevenlix	05d20343ee	Remove duplicated constant initializer copies for TensorRT nodes (#10105 ) * add new field constant_initializers in metadef and remove constant initializers from trt node inputs * remove redundancy * use GetConstantInitializer() to get constant initializers * add ORT_ENFORCE check Co-authored-by: Ubuntu <azureuser@orteplinuxdev.bxgbzpva45kedp3rhbsbit4phb.jx.internal.cloudapp.net>	2021-12-22 12:19:56 -08:00
Sheil Kumar	ce1a9ca618	Fix Microsoft.AI.MachineLearning NuGet App failure with multiple binaries copied to same destination (#10076 ) * Include onnxruntime binary when not using pacakge referene or uap app. * Remove the lib\uap10.0 build from the nuget package - causing conflicts * Add UWP test * remove build files * remove local change * reset mimalloc and onnx-tensorrt * change username to Microsoft Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-12-21 12:34:03 -08:00
Ye Wang	7a1bdc2052	Don't check cache shape when using dynamic axis (#10090 ) Co-authored-by: Ubuntu <wy@linux-v100.aidmrjtolptuzevavgwhrapqcd.jx.internal.cloudapp.net>	2021-12-20 21:19:29 -08:00
Changming Sun	4e9e01cb3c	Fix SDL warnings in CPU EP (#9975 )	2021-12-19 20:54:29 -08:00
satyajandhyala	bd4fb4c5da	Coding style fix. (#10080 )	2021-12-18 12:05:48 -08:00
ashari4	cdbd678192	Check kMSDomain already exists before registering it (#10078 ) * Check domain before registration	2021-12-17 17:55:15 -08:00

1 2 3 4 5 ...

6191 commits