onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-29 03:30:52 +00:00

Author	SHA1	Message	Date
Thiago Crepaldi	f38f2d5b54	Port #4920 into the new pytorch frontend (#4965 )	2020-09-01 19:00:49 -07:00
Hariharan Seshadri	d30dd41c0e	Remove public default ctor in PyInferenceSession and replace it with a protected ctor (#4990 )	2020-09-01 17:10:36 -07:00
Ryan Lai	c6a3620ba8	Remove evaluate telemetry due to redundancy (#4996 ) * Remove evaluate start / stop from telemetry * Remove eval telemetry * remove check for evaluate time delay * add comment * remove const Co-authored-by: Ryan Lai <ryalai96@gamil.com>	2020-09-01 17:02:00 -07:00
Tianlei Wu	a47cae031f	Use raw attention mask in BERT related fusions (#4889 ) * Use raw attention mask in fusion * update python scripts to use raw attention mask by default	2020-09-01 13:22:20 -07:00
liqunfu	d79af260bb	Liqun/new api orttraining test transformers (#4982 ) * matching transformer model test with Lamb * increase epochs * use atol 1e-6 to pass full precision test	2020-09-01 13:11:06 -07:00
gwang-msft	64237d999c	Add Cmake config for onnxruntime_NO_EXCEPTIONS (#4975 ) * additional noexception setting, added compile options * more no exception changes * addressed PR comments * Fix build issue when MSVC static library is used. * Clarify comment * add fatal message for onnxruntime_NO_EXCEPTIONS enabled without onnxruntime_MINIMAL_BUILD Co-authored-by: Scott McKay <skottmckay@gmail.com>	2020-09-01 10:17:50 -07:00
Pranav Sharma	ad1701dfb1	Rename DeviceAllocatorRegistrationInfo to a more generic name; Use OrtArenaCfg for arena members; Remove unused OrtMemType; Simplify CreateAllocator interface. (#4970 ) * Rename DeviceAllocatorRegistrationInfo to a more generic name; Remove OrtMemType; Simplify CreateAllocator interface. * - fix builds - fixed mixed aggregation + constructor calls (which were coded before this PR) - changed default value of max_mem in API header - added some validation of values for for arena_extend_strategy * fix tensorrt and cuda tests	2020-09-01 09:25:32 -07:00
Yufeng Li	ffc2b25a3a	Quantization tool improvement (#4933 ) Improve quantization tools: 1. Support QAT 2. Make quantization tool to register Operators. 3. Make the API clear to use Co-authored-by: t-yguo <t-yguo@microsoft.com>	2020-09-01 09:07:46 -07:00
Zhang Lei	464bbd27a9	Zhalei/optimize nms (#4875 ) * double the speed of non_max_suppression for cpu. * handle edge case in test case.	2020-08-31 23:33:54 -07:00
Zhang Lei	cf1b74396a	Fix build break for microbench. (#4960 )	2020-08-31 23:29:07 -07:00
RandySheriffH	14b51d6502	CiPipeline@ReducedOpsBuild (#4917 ) * cancel night build on pyop * setup ci pipeline for build of reduced ops * add back c# test * remove debugging print * add testing model * add more arg in pipeline script * disable pipeline trigger temporarily * fix yaml format * fix yaml format * fix pipeline error * rid c# test * add ops for test cases * add Conv from domain com.microsoft.nchwc * remove --reduce_ops * fix typo * remove --build_java * add test case for excluded op * update doc with --skip_test * formatting code, renaming files and simplify yaml * remove debug build from yaml * remove surplus ops from included_ops.txt * add MinSizeRel build to yaml * rename test cases and models * exclude ir test from minimum build * restrict ir test to be only applied to reduced ops build	2020-08-31 21:21:18 -07:00
gwang-msft	7ca8388dc9	[ORT Mobile] file format schema and file I/O code (#4973 ) * ort mobile file format schema and [de]serializing code	2020-09-01 11:51:31 +10:00
George Wu	bca9ccb1b3	add install sec updates (#4957 )	2020-08-31 18:13:02 -07:00
Xueyun Zhu	1e1f5a9c79	support data parallel + pipeline parallel (#4648 ) * enable data + pipeline parallel * distributed group calculation * fix typo * fix test and minor changes	2020-08-31 17:32:03 -07:00
Thiago Crepaldi	9817b8c8a7	Fix state_dict/checkpoint issue introduced by #4639 (#4984 ) https://github.com/microsoft/onnxruntime/pull/4639 changed the default behavior by removing optimizer state from state_dict/checkpoint APIs. The reason for the previous change was to allow models trained on ORT to be used for inference on PyTorch, which is an important feature. Due to the change aforementioned, when resuming training from a checkpoint, the optimizer would start with random weights, leading to a bad performance. This behavior would also cause reproducibility issues, as the optimizer wouldnt be able to resume from its previous state. This PR adds a boolean flag to state_dict/save_xheckpoint API that when True (default) it saves both model and optimizer state. When False, only the model state is kept.	2020-08-31 17:00:14 -07:00
Ashwini Khade	8679a7244e	Enable rejecting models based on onnx opset (#4912 ) * enable rejecting models based on onnx opset * enable unreleased opsets in linux and mac CI * test fixes and more updates * enable unreleased opsets in CI builds * enable released opsets in linux cis * try fix windows ci yml * yml fixes * update yml * yml updates post master merge * review comments * bug fix	2020-08-31 13:35:36 -07:00
Sherlock	50c610e70a	Stop Gradient at Shape op (#4983 )	2020-08-31 13:13:17 -07:00
Faith Xu	7af052fd62	Add CI status badges for Training builds (#4951 ) * Add CI status badges for Training builds * Fix links	2020-08-31 12:10:38 -07:00
M. Zeeshan Siddiqui	6d9d252bc3	Disable NegativeLogLikelihoodLoss_LargeSizeTensor test (#4979 ) Disabling this test until it's intermittent failure is root caused, this is a function and does not have a dedicated op by itself. However, this op is not used in known model to the best of my knowledge to disabling this test for the sanity of CI until the investigation is over is probably reasonable.	2020-08-31 11:02:07 -07:00
edgchen1	b41e5e88fb	Add more node debug dump functionality. (#4921 ) Add ability to dump node inputs/outputs to files, filter nodes, configure behavior with environment variables.	2020-08-31 10:17:23 -07:00
Sherlock	98f7fdd7da	Handle MatmulGradient with 2D Weight at B (#4977 )	2020-08-30 22:56:33 -07:00
Changming Sun	bac41969be	update (#4948 )	2020-08-29 19:05:07 -07:00
Hariharan Seshadri	64d52ae47d	Support creating sessions using DML EP via C# (#4955 )	2020-08-29 15:18:50 -07:00
Hariharan Seshadri	7080e485a3	hHandle upper-cased subscript labels in Einsum (#4964 )	2020-08-29 15:18:21 -07:00
Dwayne Robinson	f4b057b098	Fix DML License in nuget package (#4969 )	2020-08-29 00:02:01 -07:00
gwang-msft	ea5732319e	Add option ORT_NO_EXCEPTIONS to disable most exception/throw in /onnxruntime/ (#4894 ) * init no exception changes * initial test * disable exceptions * more throw handling * minor update * fix linux build break * fix windows/nuphar build break * address cr comments, move #ifdef to ORT_CATCH * address cr comments, move #ifdef to ORT_CATCH * handle return statement in ORT_CATCH * linux build break fix * addressed cr comments, remove ort_catch_end * addressed cr comments, remove ort_catch_end * move mlas to a separated ifdef flag * merge master, move some new code in master to no_exc Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-08-28 23:03:51 -07:00
Brian Martin	655ffd5d5b	make (de)tensorization events measure level events (#4958 ) * make tensorizer events measures * throttle the events and add a new one SoftwareBitmapToGPUTensorTelemetryEvent * factor out timing code into a class * typo * typo * move eventimer class into its own header file * add throttling to detensorization and remove variable timing * make detensorization events measures as well * add ConvertGPUTensorToSoftwareBitmapTelemetryEvent event * de-duplicate event names * fix comment * PR feedback	2020-08-28 16:49:32 -07:00
Thiago Crepaldi	cd0f2fb48c	Add code oweners for pytorch frontend (#4963 )	2020-08-28 15:57:52 -07:00
Hariharan Seshadri	7045910d10	Support RegisterCustomOpsLibrary via the Python API (#4764 )	2020-08-28 13:24:29 -07:00
Dwayne Robinson	040c5fa3e0	Merge pull request #4925 from microsoft/user/dwayner/Iron ORT DirectML EP for Iron release, ONNX 1.5	2020-08-28 12:28:30 -07:00
Wei-Sheng Chin	1281ff6462	Put operators in-between Wait and Record (#4916 )	2020-08-28 11:44:54 -07:00
Hariharan Seshadri	b945225de3	Include DirectML pdb in x86 bin folder (#4953 )	2020-08-28 11:29:26 -07:00
Changming Sun	c37fa7c278	Delete Dockerfile.centos6_gpu (#4851 )	2020-08-28 09:56:52 -07:00
Brian Martin	39382dc6c3	Update winrt_api.md to address the 1.4 release (#4946 )	2020-08-28 08:05:22 -07:00
Dwayne Robinson	79429c934b	Update	2020-08-27 21:01:19 -07:00
Ori Levari	a7ce5b2be1	fix comment and casing of telemetry fields for named dimension overrides (#4943 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2020-08-27 17:30:56 -07:00
Ye Wang	dfb9d97ddf	Support DistilBert's Attention fusion in Optimizer (#4748 ) * checkin * attention fusion * attention work under layernorm, still need refine * embedlayernorm(have problems with graph.Resolve()) * some fix * update: attention works but onnx results in protobuf parsing failed * tested by optimizer * add embedlayer fusion test * add attention fusion test * clean code, need refactor later * clean code * added reshape fusion for distilbert, modified attention, added tests * refactor * small fix * remove uncessary lines * fix reshape and modify attention * resolving conflicts * restore * refactor and review partial comments * refactor attention * small fix * fix inf compare * match new pattern for attention fusion * formatting * attention does not depend on transposescalematmul * fix * review coments * revert changes * review comments * small fix	2020-08-27 17:00:30 -07:00
George Wu	e6b6736e48	update cuda capabilities (#4936 )	2020-08-27 16:38:18 -07:00
Tang, Cheng	efdd96595f	bfloat16 and opset13 related fix (#4913 ) * regsiter part of opset13 cpu kernels; fix a bug in func impl; adjust reshapefusion order * remove useless function Co-authored-by: Cheng Tang <chenta@microsoft.com>	2020-08-27 16:10:53 -07:00
Dwayne Robinson	f68d5263b7	Merged PR 5100436: EinSum ONNX 1.7 (opset 12) ORT DML EP kernel Adds EinSum operator (purely an EP kernel, not a dedicated DML operator), which takes an equation string and depending on the specifics is capable of representing: identity, diag, trace, transpose, reduce sum, dot product, matmul, elementwise multiplication, inner product, outer product. The DML EP recognizes many of them (identity, transpose, reduce sum, 1D dot product, matmul, elementwise multiplication), but defers to CPU when not supported (extended inner product, outer product, diag, trace, arbitrary batch ellipsis). https://github.com/onnx/onnx/blob/master/docs/Operators.md#Einsum WindowsAI PR: https://microsoft.visualstudio.com/DefaultCollection/WindowsAI/_git/WindowsAI/pullrequest/5100608 Related work items: #27469790	2020-08-27 22:10:14 +00:00
Nick Feeney	b5c765c76b	Merged PR 5103319: 8d Update Required changes for 8D scatter and gather Related work items: #27678554	2020-08-27 21:28:02 +00:00
Brian Martin	970ddd56a7	Fix typo in contributing.md (#4939 ) committments -> commitments	2020-08-27 14:01:36 -07:00
Sherlock	9f5d4918dc	MatMul Gradient optimization for dB when B's is 2D tensor (#4899 ) * Optimized MatMulGrad for dB when B's shape is 2D * Refactor for ConstantScalarNode Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-08-27 11:33:20 -07:00
Sheil Kumar	6dc85b5f14	wstring_convert std::codecvt_utf8 add ~200KB to inbox windows.ai.machinelearning.dll binary size (#4932 ) * switch to UTF8FromHString * remove extra c_str Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-08-27 10:07:10 -07:00
Dmitri Smirnov	2b460eaeca	Revise IDisposable implementation in C# interfaces (#4915 ) Revise IDisposable implementation in C# interfaces	2020-08-27 09:17:42 -07:00
Scott McKay	08eb15068c	Exclude the Map types from the build if ML ops are disabled. (#4908 ) * Exclude the Map types from the build if ML ops are disabled. They're the only ops that use Map.	2020-08-27 17:48:12 +10:00
Ye Wang	792ed44537	Support EmbedLayerNorm fusion for DistilBert (#4928 ) * checkin embedlayernorm fusion for distilbert * move function from optimizer_utils * review comments	2020-08-26 21:46:31 -07:00
harshithapv	00fe718264	Fix divide-by-zero for SSCE kernel when normalize factor is zero. (#4911 ) * Changes in SSCE for all tokens ignored case.	2020-08-26 17:12:17 -07:00
Thiago Crepaldi	cac25751bd	Fix mnist example (#4926 )	2020-08-26 15:28:39 -07:00
Scott McKay	438babd966	Fix some Android build issues when ORT_MINIMAL_BUILD is defined. (#4924 )	2020-08-27 07:37:51 +10:00

... 174 175 176 177 178 ...

11997 commits