onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-17 21:10:43 +00:00

Author	SHA1	Message	Date
Hariharan Seshadri	1599562016	Fix BatchNorm CUDA kernel definition	2020-04-18 17:21:29 -07:00
Zhang Lei	c365822808	Refactor some for the calibate.py. Add QLinearAdd and QLinearMul support. Fix bugs loading jpgs not strict RGB, and typoes in load_batch call. (#3542 )	2020-04-18 17:10:55 -07:00
Dmitri Smirnov	db9566f70d	Implement Inverse(12) for CPU and CUDA (#3485 )	2020-04-18 17:10:21 -07:00
Dmitri Smirnov	38a18023c7	Fix some too popular warnings. (#3578 ) Some pointless and noisy warnings either fixed or disabled.	2020-04-18 17:05:05 -07:00
Changming Sun	d68245853e	Disable downloading test data on Linux (#3581 )	2020-04-18 15:54:58 -07:00
Sergii Dymchenko	3e884b4b6b	Fix some typos. (#3582 ) * Fix some typos. * Fix a typo.	2020-04-18 14:18:05 -07:00
suryasidd	6fe688c732	Disabled failed maxpool test on GPU (#3549 )	2020-04-18 13:49:42 -07:00
Tianlei Wu	7f46f347db	Add GPT2 Attention Fusion in optimization script (#3488 ) * Add Attention fusion for GPT2 * Support distilgpt2 in benchmark_gpt2.py * Add options to disable Attention/SkipLayerNormalization/EmbedLayerNormalization/BiasGelu fusions * Add logging at the begining of each fusion * Update notebooks: Add Gpt2OnnxModel.py to list of script files. * Add test for gpt2 model optimization * Add optional parameters (--input_ids --segment_ids --input_mask) for graph inputs * Fuse BiasGelu * Handle model that does not have segment_ids input. * Allow fuse embed layer without mask	2020-04-17 16:23:53 -07:00
Tianlei Wu	5d3b217039	Update Attention operator for GPT2 (#3474 ) Add unidrectional mask for Attention operator. Update mask_index to mask broadcast from B->BxS->BxNxSxS to B->BxSxS->BxNxSxS.	2020-04-17 16:20:40 -07:00
Hariharan Seshadri	b4457ecb7a	Fix `gen_doc` build option and refresh documentation (#3545 ) * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit * Initial commit * More changes * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit * Initial commit * More changes * Add md files * Doc changes * Update * revert cmake changes * Update * Doc change * Update * Update	2020-04-17 14:41:04 -07:00
Hector Li	5acd8dbe7d	remove option --enable_lto (#3515 )	2020-04-17 14:18:56 -07:00
Yufeng Li	f822a54860	Make De/QuantizeLinear support half (#3531 ) * Make QuantizeLinear support half * remove unnessary type constraint * refine kernel definition * add fp16 support for dequantizelinear * diable QuantizeLinear_per_tensor_half_int8 for tensorrt * refine unit test and fix saturate issue for MSDomain QuantizeLinear * fix build break * include tensorrt for half_uint8 test	2020-04-17 12:17:48 -07:00
Tracy Sharpe	c7b6fab29d	Fix build break in mlas\lib\quantize.cpp: missing nearbyintf (#3572 )	2020-04-17 11:50:25 -07:00
Xiang Zhang	43c3a5edba	update onnxruntime version string for telemetry (#3526 ) * update onnxruntime version string for telemetry * use ORT_VERSION * deleted version.h	2020-04-17 10:46:58 -07:00
Changming Sun	209b41a67d	Update dependencies graph	2020-04-17 07:38:45 -07:00
Sheil Kumar	2717c178cc	Fork the WinML APIs into the Microsoft namespace (#3503 ) * Migrate winml to Microsoft Namespace (packaging changes are pending) * add ns_prefix toggle * fix packaging * Users/sheilk/add missing raw header (#3484) * add dualapipartition * wrong variable for repo root Co-authored-by: Sheil Kumar <sheilk@microsoft.com> * remove existence check to force failures * extra paren * dualapipartition needs to be referenced from the source * add microsoft.ai.machinelearning.dll to the output dir * rename the idl file so that assembly info is correctly added into the winmd * fix namespaces * update namespaces * default to microsoft, and add namespace override as build argument * update cmakesetings.json as well * remove from cmakelists.txt Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-04-17 06:18:54 -07:00
ytaous	fcb27c4e8b	hotfix for skiplayernorm (#3543 ) Co-authored-by: Ethan Tao <ettao@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-04-17 01:22:08 -07:00
liuziyue	92269ae409	perf tuning docs update (#3520 )	2020-04-17 00:23:15 -07:00
Sheil Kumar	951484ba53	Dualapipartitionattibute.h header is missing in nuget package (#3350 ) * add dualapipartition * wrong variable for repo root Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-04-16 22:21:57 -07:00
Changming Sun	1a222b3f6e	Disable downloading test data on Windows (#3551 ) * Disable downloading test data on Windows	2020-04-16 22:15:20 -07:00
Andrews548	93b957a55a	Acl improvements (#3463 ) * Fixed cornercases for acl ep gemm implementation by setting fully connected as the main layer * Introduced versioned build for the acl ep. ACL versions supported are 1902, 1905 and 1908 * Added convolution-activation fusion optimization for acl ep. We see improvements of 12% for mobilenetv2 and 4% for resnet50 Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-04-16 03:14:37 -07:00
Adam Pocock	c91527235a	[Java] Add support for map and sequence information on output nodes (#3468 )	2020-04-16 02:29:23 -07:00
Changming Sun	7c89f38a34	Fix static analysis warnings found by VC++ (#3530 ) 1. Fix static analysis warnings found by VC++ 2. Add a new pipeline for static analysis 3. Merge all the windows CI build into one single yaml file.(Easier to queue them all). 4. Make DNNL build faster by disabling building the tests and examples. 5. Enable custom op unitest.	2020-04-16 01:46:47 -07:00
Ye Wang	ec4f6c099b	Resolve comments and make minor changes to Featurizer transformers (#3535 )	2020-04-15 13:29:24 -07:00
Hariharan Seshadri	abfb275ac0	Support listing keys in custom metadata map via C/C++ API (#3477 ) * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit	2020-04-15 12:14:03 -07:00
David Brownell	72cd61baae	Removed use of parameters in python wheel build scripts (#3524 )	2020-04-15 10:31:14 -07:00
Yulong Wang	cf2fddf760	fix nuget build (#3532 )	2020-04-15 10:30:11 -07:00
Changming Sun	b63349c8d6	Fix custom op test failure (#3525 )	2020-04-14 20:36:42 -07:00
Adam Pocock	bc9a199b16	Renaming deviceNum to deviceId.	2020-04-14 20:35:03 -07:00
Adam Pocock	e9dc8954ac	Adding support for ACL and DML to the Java API.	2020-04-14 20:35:03 -07:00
Changming Sun	a2feb29b0d	Fix build break (#3528 ) Ignore some known test failures Install ONNX package before running Windows CI builds	2020-04-14 18:07:56 -07:00
Negin Raoof	e303f458e4	Add int64 input type for ReduceProd (#3507 ) * Add int64 input type * Fix for cuda * Fix linking * Cuda * Fixed missing registration * Fix registeration for opsets 1-11 * Adding reduce_matrix_rows for int64 * Update reduction_functions.cu * Revert cuda	2020-04-14 15:09:28 -07:00
Ori Levari	f564569a80	Adapter Model and Environment tests (#3469 ) Adapter Model and Environment tests winml test macro clean up and extension	2020-04-14 13:36:31 -07:00
Tiago Koji Castro Shibata	560f4c5b16	Make GPUTEST macro consistent among TAEF/googletest (#3518 )	2020-04-14 10:55:16 -07:00
Du Li	621b3ac03a	FFT contrib ops (#3381 ) * add custom op skeleton * Adding Rfft, Irfft kernels. * Fix a few errors: 1. make kernel stateless to avoid race condition 2. reclaim cufft plan * Adding MLFloat16 support * Adding fp16 support for fft ops. * Adding cufft plan cache. * adding a util func * adding copyright info. * Accommodating PR comments.	2020-04-14 10:12:04 -07:00
Yufeng Li	baa86f181f	Handle the case that initializers are in graph input (#3449 ) warn that initializers are in graph input provide a tool to move initializer out of graph input Motivation and Context ONNX model from IR_VERSION 4 only treats initializers that appear in graph input as non-constant. This may fail some of the graph optimizations, like const folding, operator fusion and etc. Warn the case and provide a tool.	2020-04-14 09:06:04 -07:00
David Brownell	006c5be1b1	Optionally produce a python wheel that includes featurizers (#3491 )	2020-04-14 09:00:13 -07:00
Changming Sun	040c28ff39	Remove dead code from HandleNegativeAxis	2020-04-14 01:01:15 -07:00
Colin Jermain	06db89cf13	Using logic for finding README.rst to find requirements.txt	2020-04-13 18:59:44 -07:00
Colin Jermain	43d9f9190e	Removing unused six package	2020-04-13 18:59:44 -07:00
Colin Jermain	c2c3102aba	Tying install_requires to requirements.txt	2020-04-13 18:59:44 -07:00
Ye Wang	66a79d2c9f	fix (#3512 )	2020-04-13 18:30:58 -07:00
Dmitri Smirnov	efd9b92482	Handle Scalars in TernaryOps and Where. (#3509 ) Handle Scalars in TernaryOps and Where.	2020-04-13 16:24:35 -07:00
Ye Wang	cbe30f3e19	update FeaturizersLibrary (#3511 )	2020-04-13 15:47:51 -07:00
Tracy Sharpe	5aab2671f8	Fix crash in DequantizeLinear with scalar tensor (#3508 )	2020-04-13 14:52:52 -07:00
Ye Wang	438353abcd	Fix TruncatedSVDFeaturizer's test failure and re-enable it's kernel test (#3458 ) * checkin * fix linux & macos build * fix test * revert the changes for a single-aimed PR * fix	2020-04-13 13:59:38 -07:00
Tianlei Wu	54bbbb78ae	Change mask_index input of Attention op to be optional (#3459 ) Change Mask Index to optional	2020-04-12 22:55:37 -07:00
George Wu	7f6e407e09	fix python packaging manylinux1 build break. (#3482 )	2020-04-11 06:58:22 +08:00
Ryan Lai	4223591043	Add automatic generation of tensors for Onnxruntime Perf Runner (#3448 ) * Add flag to enable automatic generation of input for models with tensor inputs * change wording of variable * Naming convention changes to variables * Handle free dimensions * Comment with default allocator * variable rename * Remove input_count * Cast to size_t to avoid warning Co-authored-by: Ryan Lai <ryalai96@gamil.com>	2020-04-10 11:54:17 -07:00
stevenlix	56e85484ba	Handle optional inputs and remove more empty shape nodes in TensorRT EP (#3455 ) * check optional inputs and remove more empty shape affected nodes * fix some minor issues * update code according to feedback	2020-04-10 11:13:38 -07:00

1 2 3 4 5 ...

2102 commits