onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-03 03:58:54 +00:00

Author	SHA1	Message	Date
Changming Sun	911d125323	Remove openmp from gpu build	2020-04-20 17:13:54 -07:00
Sheil Kumar	31b6629e99	Fork WinML IDL Guids (#3591 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-04-20 09:17:07 -07:00
Prabhat	381fee47ab	Added support to build onnxruntime with ACL (#3586 ) * Added support to build onnxruntime with ACL * Added ACL build instructions	2020-04-20 13:35:28 +05:30
Changming Sun	75426a3091	Fix build break	2020-04-19 18:32:46 -07:00
Zhang Lei	422266c445	Support conv transpos 1D in cuda provider. (#3300 ) * Support conv transpos 1D in cuda provider. * Clear some old comment. Enable conv_transpose_1d onnx test for cuda.	2020-04-19 22:07:34 +08:00
Scott McKay	7d5348f87e	Add ability to batch device copy for graph inputs and outputs. (#3580 ) * Add ability to batch device copy for graph inputs and outputs.	2020-04-19 17:51:07 +10:00
Prabhat	ea62b3435a	Clean up build.py code (#3466 )	2020-04-18 20:48:30 -07:00
Maxim Kalinin	fcf0f6ee9f	Generalize reshape fusion (#3554 ) * Generalize reshape fusion * Allow arbitrary number of Concat arguments * Apply fusion even when an output of an internal node is used elsewhere * Fix a bug when an internal node's output is the subgraph output * Simplify code	2020-04-18 20:47:23 -07:00
Tiago Koji Castro Shibata	14e387aa1a	Fix WinML namespace build break (#3583 ) * Add missing winrt namespace * Conditional compilation of dxcore code * Fix TAEF macros	2020-04-18 20:46:01 -07:00
Sherlock	56b223bc60	Implement OneHot CUDA Kernels (#3390 ) * Implement OneHot CUDA Kernels * Support fp16 * Use HandleNegativeAxis * Make MLFloat16 test GPU only	2020-04-18 17:41:39 -07:00
Hariharan Seshadri	1599562016	Fix BatchNorm CUDA kernel definition	2020-04-18 17:21:29 -07:00
Zhang Lei	c365822808	Refactor some for the calibate.py. Add QLinearAdd and QLinearMul support. Fix bugs loading jpgs not strict RGB, and typoes in load_batch call. (#3542 )	2020-04-18 17:10:55 -07:00
Dmitri Smirnov	db9566f70d	Implement Inverse(12) for CPU and CUDA (#3485 )	2020-04-18 17:10:21 -07:00
Dmitri Smirnov	38a18023c7	Fix some too popular warnings. (#3578 ) Some pointless and noisy warnings either fixed or disabled.	2020-04-18 17:05:05 -07:00
Changming Sun	d68245853e	Disable downloading test data on Linux (#3581 )	2020-04-18 15:54:58 -07:00
Sergii Dymchenko	3e884b4b6b	Fix some typos. (#3582 ) * Fix some typos. * Fix a typo.	2020-04-18 14:18:05 -07:00
suryasidd	6fe688c732	Disabled failed maxpool test on GPU (#3549 )	2020-04-18 13:49:42 -07:00
Tianlei Wu	7f46f347db	Add GPT2 Attention Fusion in optimization script (#3488 ) * Add Attention fusion for GPT2 * Support distilgpt2 in benchmark_gpt2.py * Add options to disable Attention/SkipLayerNormalization/EmbedLayerNormalization/BiasGelu fusions * Add logging at the begining of each fusion * Update notebooks: Add Gpt2OnnxModel.py to list of script files. * Add test for gpt2 model optimization * Add optional parameters (--input_ids --segment_ids --input_mask) for graph inputs * Fuse BiasGelu * Handle model that does not have segment_ids input. * Allow fuse embed layer without mask	2020-04-17 16:23:53 -07:00
Tianlei Wu	5d3b217039	Update Attention operator for GPT2 (#3474 ) Add unidrectional mask for Attention operator. Update mask_index to mask broadcast from B->BxS->BxNxSxS to B->BxSxS->BxNxSxS.	2020-04-17 16:20:40 -07:00
Hariharan Seshadri	b4457ecb7a	Fix `gen_doc` build option and refresh documentation (#3545 ) * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit * Initial commit * More changes * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit * Initial commit * More changes * Add md files * Doc changes * Update * revert cmake changes * Update * Doc change * Update * Update	2020-04-17 14:41:04 -07:00
Hector Li	5acd8dbe7d	remove option --enable_lto (#3515 )	2020-04-17 14:18:56 -07:00
Yufeng Li	f822a54860	Make De/QuantizeLinear support half (#3531 ) * Make QuantizeLinear support half * remove unnessary type constraint * refine kernel definition * add fp16 support for dequantizelinear * diable QuantizeLinear_per_tensor_half_int8 for tensorrt * refine unit test and fix saturate issue for MSDomain QuantizeLinear * fix build break * include tensorrt for half_uint8 test	2020-04-17 12:17:48 -07:00
Tracy Sharpe	c7b6fab29d	Fix build break in mlas\lib\quantize.cpp: missing nearbyintf (#3572 )	2020-04-17 11:50:25 -07:00
Xiang Zhang	43c3a5edba	update onnxruntime version string for telemetry (#3526 ) * update onnxruntime version string for telemetry * use ORT_VERSION * deleted version.h	2020-04-17 10:46:58 -07:00
Changming Sun	209b41a67d	Update dependencies graph	2020-04-17 07:38:45 -07:00
Sheil Kumar	2717c178cc	Fork the WinML APIs into the Microsoft namespace (#3503 ) * Migrate winml to Microsoft Namespace (packaging changes are pending) * add ns_prefix toggle * fix packaging * Users/sheilk/add missing raw header (#3484) * add dualapipartition * wrong variable for repo root Co-authored-by: Sheil Kumar <sheilk@microsoft.com> * remove existence check to force failures * extra paren * dualapipartition needs to be referenced from the source * add microsoft.ai.machinelearning.dll to the output dir * rename the idl file so that assembly info is correctly added into the winmd * fix namespaces * update namespaces * default to microsoft, and add namespace override as build argument * update cmakesetings.json as well * remove from cmakelists.txt Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-04-17 06:18:54 -07:00
ytaous	fcb27c4e8b	hotfix for skiplayernorm (#3543 ) Co-authored-by: Ethan Tao <ettao@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-04-17 01:22:08 -07:00
liuziyue	92269ae409	perf tuning docs update (#3520 )	2020-04-17 00:23:15 -07:00
Sheil Kumar	951484ba53	Dualapipartitionattibute.h header is missing in nuget package (#3350 ) * add dualapipartition * wrong variable for repo root Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-04-16 22:21:57 -07:00
Changming Sun	1a222b3f6e	Disable downloading test data on Windows (#3551 ) * Disable downloading test data on Windows	2020-04-16 22:15:20 -07:00
Andrews548	93b957a55a	Acl improvements (#3463 ) * Fixed cornercases for acl ep gemm implementation by setting fully connected as the main layer * Introduced versioned build for the acl ep. ACL versions supported are 1902, 1905 and 1908 * Added convolution-activation fusion optimization for acl ep. We see improvements of 12% for mobilenetv2 and 4% for resnet50 Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-04-16 03:14:37 -07:00
Adam Pocock	c91527235a	[Java] Add support for map and sequence information on output nodes (#3468 )	2020-04-16 02:29:23 -07:00
Changming Sun	7c89f38a34	Fix static analysis warnings found by VC++ (#3530 ) 1. Fix static analysis warnings found by VC++ 2. Add a new pipeline for static analysis 3. Merge all the windows CI build into one single yaml file.(Easier to queue them all). 4. Make DNNL build faster by disabling building the tests and examples. 5. Enable custom op unitest.	2020-04-16 01:46:47 -07:00
Ye Wang	ec4f6c099b	Resolve comments and make minor changes to Featurizer transformers (#3535 )	2020-04-15 13:29:24 -07:00
Hariharan Seshadri	abfb275ac0	Support listing keys in custom metadata map via C/C++ API (#3477 ) * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit	2020-04-15 12:14:03 -07:00
David Brownell	72cd61baae	Removed use of parameters in python wheel build scripts (#3524 )	2020-04-15 10:31:14 -07:00
Yulong Wang	cf2fddf760	fix nuget build (#3532 )	2020-04-15 10:30:11 -07:00
Changming Sun	b63349c8d6	Fix custom op test failure (#3525 )	2020-04-14 20:36:42 -07:00
Adam Pocock	bc9a199b16	Renaming deviceNum to deviceId.	2020-04-14 20:35:03 -07:00
Adam Pocock	e9dc8954ac	Adding support for ACL and DML to the Java API.	2020-04-14 20:35:03 -07:00
Changming Sun	a2feb29b0d	Fix build break (#3528 ) Ignore some known test failures Install ONNX package before running Windows CI builds	2020-04-14 18:07:56 -07:00
Negin Raoof	e303f458e4	Add int64 input type for ReduceProd (#3507 ) * Add int64 input type * Fix for cuda * Fix linking * Cuda * Fixed missing registration * Fix registeration for opsets 1-11 * Adding reduce_matrix_rows for int64 * Update reduction_functions.cu * Revert cuda	2020-04-14 15:09:28 -07:00
Ori Levari	f564569a80	Adapter Model and Environment tests (#3469 ) Adapter Model and Environment tests winml test macro clean up and extension	2020-04-14 13:36:31 -07:00
Tiago Koji Castro Shibata	560f4c5b16	Make GPUTEST macro consistent among TAEF/googletest (#3518 )	2020-04-14 10:55:16 -07:00
Du Li	621b3ac03a	FFT contrib ops (#3381 ) * add custom op skeleton * Adding Rfft, Irfft kernels. * Fix a few errors: 1. make kernel stateless to avoid race condition 2. reclaim cufft plan * Adding MLFloat16 support * Adding fp16 support for fft ops. * Adding cufft plan cache. * adding a util func * adding copyright info. * Accommodating PR comments.	2020-04-14 10:12:04 -07:00
Yufeng Li	baa86f181f	Handle the case that initializers are in graph input (#3449 ) warn that initializers are in graph input provide a tool to move initializer out of graph input Motivation and Context ONNX model from IR_VERSION 4 only treats initializers that appear in graph input as non-constant. This may fail some of the graph optimizations, like const folding, operator fusion and etc. Warn the case and provide a tool.	2020-04-14 09:06:04 -07:00
David Brownell	006c5be1b1	Optionally produce a python wheel that includes featurizers (#3491 )	2020-04-14 09:00:13 -07:00
Changming Sun	040c28ff39	Remove dead code from HandleNegativeAxis	2020-04-14 01:01:15 -07:00
Colin Jermain	06db89cf13	Using logic for finding README.rst to find requirements.txt	2020-04-13 18:59:44 -07:00
Colin Jermain	43d9f9190e	Removing unused six package	2020-04-13 18:59:44 -07:00

1 2 3 4 5 ...

2112 commits