onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-04 23:59:56 +00:00

Author	SHA1	Message	Date
Hector Li	5acd8dbe7d	remove option --enable_lto (#3515 )	2020-04-17 14:18:56 -07:00
Yufeng Li	f822a54860	Make De/QuantizeLinear support half (#3531 ) * Make QuantizeLinear support half * remove unnessary type constraint * refine kernel definition * add fp16 support for dequantizelinear * diable QuantizeLinear_per_tensor_half_int8 for tensorrt * refine unit test and fix saturate issue for MSDomain QuantizeLinear * fix build break * include tensorrt for half_uint8 test	2020-04-17 12:17:48 -07:00
Tracy Sharpe	c7b6fab29d	Fix build break in mlas\lib\quantize.cpp: missing nearbyintf (#3572 )	2020-04-17 11:50:25 -07:00
Xiang Zhang	43c3a5edba	update onnxruntime version string for telemetry (#3526 ) * update onnxruntime version string for telemetry * use ORT_VERSION * deleted version.h	2020-04-17 10:46:58 -07:00
Changming Sun	209b41a67d	Update dependencies graph	2020-04-17 07:38:45 -07:00
Sheil Kumar	2717c178cc	Fork the WinML APIs into the Microsoft namespace (#3503 ) * Migrate winml to Microsoft Namespace (packaging changes are pending) * add ns_prefix toggle * fix packaging * Users/sheilk/add missing raw header (#3484) * add dualapipartition * wrong variable for repo root Co-authored-by: Sheil Kumar <sheilk@microsoft.com> * remove existence check to force failures * extra paren * dualapipartition needs to be referenced from the source * add microsoft.ai.machinelearning.dll to the output dir * rename the idl file so that assembly info is correctly added into the winmd * fix namespaces * update namespaces * default to microsoft, and add namespace override as build argument * update cmakesetings.json as well * remove from cmakelists.txt Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-04-17 06:18:54 -07:00
ytaous	fcb27c4e8b	hotfix for skiplayernorm (#3543 ) Co-authored-by: Ethan Tao <ettao@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-04-17 01:22:08 -07:00
liuziyue	92269ae409	perf tuning docs update (#3520 )	2020-04-17 00:23:15 -07:00
Sheil Kumar	951484ba53	Dualapipartitionattibute.h header is missing in nuget package (#3350 ) * add dualapipartition * wrong variable for repo root Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-04-16 22:21:57 -07:00
Changming Sun	1a222b3f6e	Disable downloading test data on Windows (#3551 ) * Disable downloading test data on Windows	2020-04-16 22:15:20 -07:00
Andrews548	93b957a55a	Acl improvements (#3463 ) * Fixed cornercases for acl ep gemm implementation by setting fully connected as the main layer * Introduced versioned build for the acl ep. ACL versions supported are 1902, 1905 and 1908 * Added convolution-activation fusion optimization for acl ep. We see improvements of 12% for mobilenetv2 and 4% for resnet50 Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-04-16 03:14:37 -07:00
Adam Pocock	c91527235a	[Java] Add support for map and sequence information on output nodes (#3468 )	2020-04-16 02:29:23 -07:00
Changming Sun	7c89f38a34	Fix static analysis warnings found by VC++ (#3530 ) 1. Fix static analysis warnings found by VC++ 2. Add a new pipeline for static analysis 3. Merge all the windows CI build into one single yaml file.(Easier to queue them all). 4. Make DNNL build faster by disabling building the tests and examples. 5. Enable custom op unitest.	2020-04-16 01:46:47 -07:00
Ye Wang	ec4f6c099b	Resolve comments and make minor changes to Featurizer transformers (#3535 )	2020-04-15 13:29:24 -07:00
Hariharan Seshadri	abfb275ac0	Support listing keys in custom metadata map via C/C++ API (#3477 ) * Support listing keys in custom metadata map via C/C++ API * nit * PR feedback * Nit	2020-04-15 12:14:03 -07:00
David Brownell	72cd61baae	Removed use of parameters in python wheel build scripts (#3524 )	2020-04-15 10:31:14 -07:00
Yulong Wang	cf2fddf760	fix nuget build (#3532 )	2020-04-15 10:30:11 -07:00
Changming Sun	b63349c8d6	Fix custom op test failure (#3525 )	2020-04-14 20:36:42 -07:00
Adam Pocock	bc9a199b16	Renaming deviceNum to deviceId.	2020-04-14 20:35:03 -07:00
Adam Pocock	e9dc8954ac	Adding support for ACL and DML to the Java API.	2020-04-14 20:35:03 -07:00
Changming Sun	a2feb29b0d	Fix build break (#3528 ) Ignore some known test failures Install ONNX package before running Windows CI builds	2020-04-14 18:07:56 -07:00
Negin Raoof	e303f458e4	Add int64 input type for ReduceProd (#3507 ) * Add int64 input type * Fix for cuda * Fix linking * Cuda * Fixed missing registration * Fix registeration for opsets 1-11 * Adding reduce_matrix_rows for int64 * Update reduction_functions.cu * Revert cuda	2020-04-14 15:09:28 -07:00
Ori Levari	f564569a80	Adapter Model and Environment tests (#3469 ) Adapter Model and Environment tests winml test macro clean up and extension	2020-04-14 13:36:31 -07:00
Tiago Koji Castro Shibata	560f4c5b16	Make GPUTEST macro consistent among TAEF/googletest (#3518 )	2020-04-14 10:55:16 -07:00
Du Li	621b3ac03a	FFT contrib ops (#3381 ) * add custom op skeleton * Adding Rfft, Irfft kernels. * Fix a few errors: 1. make kernel stateless to avoid race condition 2. reclaim cufft plan * Adding MLFloat16 support * Adding fp16 support for fft ops. * Adding cufft plan cache. * adding a util func * adding copyright info. * Accommodating PR comments.	2020-04-14 10:12:04 -07:00
Yufeng Li	baa86f181f	Handle the case that initializers are in graph input (#3449 ) warn that initializers are in graph input provide a tool to move initializer out of graph input Motivation and Context ONNX model from IR_VERSION 4 only treats initializers that appear in graph input as non-constant. This may fail some of the graph optimizations, like const folding, operator fusion and etc. Warn the case and provide a tool.	2020-04-14 09:06:04 -07:00
David Brownell	006c5be1b1	Optionally produce a python wheel that includes featurizers (#3491 )	2020-04-14 09:00:13 -07:00
Changming Sun	040c28ff39	Remove dead code from HandleNegativeAxis	2020-04-14 01:01:15 -07:00
Colin Jermain	06db89cf13	Using logic for finding README.rst to find requirements.txt	2020-04-13 18:59:44 -07:00
Colin Jermain	43d9f9190e	Removing unused six package	2020-04-13 18:59:44 -07:00
Colin Jermain	c2c3102aba	Tying install_requires to requirements.txt	2020-04-13 18:59:44 -07:00
Ye Wang	66a79d2c9f	fix (#3512 )	2020-04-13 18:30:58 -07:00
Dmitri Smirnov	efd9b92482	Handle Scalars in TernaryOps and Where. (#3509 ) Handle Scalars in TernaryOps and Where.	2020-04-13 16:24:35 -07:00
Ye Wang	cbe30f3e19	update FeaturizersLibrary (#3511 )	2020-04-13 15:47:51 -07:00
Tracy Sharpe	5aab2671f8	Fix crash in DequantizeLinear with scalar tensor (#3508 )	2020-04-13 14:52:52 -07:00
Ye Wang	438353abcd	Fix TruncatedSVDFeaturizer's test failure and re-enable it's kernel test (#3458 ) * checkin * fix linux & macos build * fix test * revert the changes for a single-aimed PR * fix	2020-04-13 13:59:38 -07:00
Tianlei Wu	54bbbb78ae	Change mask_index input of Attention op to be optional (#3459 ) Change Mask Index to optional	2020-04-12 22:55:37 -07:00
George Wu	7f6e407e09	fix python packaging manylinux1 build break. (#3482 )	2020-04-11 06:58:22 +08:00
Ryan Lai	4223591043	Add automatic generation of tensors for Onnxruntime Perf Runner (#3448 ) * Add flag to enable automatic generation of input for models with tensor inputs * change wording of variable * Naming convention changes to variables * Handle free dimensions * Comment with default allocator * variable rename * Remove input_count * Cast to size_t to avoid warning Co-authored-by: Ryan Lai <ryalai96@gamil.com>	2020-04-10 11:54:17 -07:00
stevenlix	56e85484ba	Handle optional inputs and remove more empty shape nodes in TensorRT EP (#3455 ) * check optional inputs and remove more empty shape affected nodes * fix some minor issues * update code according to feedback	2020-04-10 11:13:38 -07:00
Tiago Koji Castro Shibata	d09d4a6b0d	Fix OS build (#3481 )	2020-04-09 21:46:01 -07:00
Pranav Prakash	95ade8f47b	Add check to prevent storing nullptr in value_info_ when proto has unused value info (#3461 ) * Add unit test for serialization of unused value_info * Do not add non-existent (nullptr) value_info_ when loading a model. Fixes #3430	2020-04-09 19:25:10 -07:00
Pranav Sharma	2ccedb7b4d	Improve error logging when a kernel cannot be found. (#3473 ) * Improve error logging when a kernel cannot be found. * Fix mac build	2020-04-09 19:24:46 -07:00
KeDengMS	739c9d4875	Always call cudaSetDevice at the beginning of session::Run (#3475 ) This is required for running multithreaded with multi-GPUs. Without it, when running in a work thread it would default to GPU 0, while CUDAExecutionProvider is assigned on other GPUs. That might cause CUDA crash when some CUDA resources is from GPU 0, while being used in GPU N>0.	2020-04-09 18:54:58 -07:00
Yufeng Li	a443b1b6b9	Revert "Use IMMA for int8 matmul to leverage Turing Tensor Core (#3413 )" (#3472 ) This reverts commit `4d71958ccf`. Revert the PR. Looks like it triggers a bug in nvcc and failes the GPU pipeline.	2020-04-09 15:59:52 -07:00
Scott McKay	40d80cde8f	Rework CDist (#3393 ) * Make CDist faster via Eigen squaredNorma and GEMM. * Add call to abs() as the GEMM output may differ slightly due to floating point accuracy and result in a negative distance which returns NaN if sqrt() is applied to it. * Update math::Gemm to use the type for alpha and beta instead of hardcoding to float. Matches the GemmEx definition. * Provide Eigen based replication of the GEMM call on x86 if T=double. * Make test model data deterministic. * Do the GEMM first so we can avoid potentially subtracting two numbers that are very close to each other.	2020-04-09 14:05:25 +10:00
Yulong Wang	718068f020	update C# API to optimize inference latency (#3171 ) * update C# API to optimize inference latency * rename PinnedOnnxValue to fixedBufferOnnxValue and fix build break * add more test cases * add conditions on string tensors for pre-allocated outputs * change to random inputs * fix word spell * resolve comments * resolve comments * remove FixedBufferOnnxValueTests.cs * fix trivial typos in doc	2020-04-08 11:57:40 -07:00
Pranav Sharma	cdac74b3c3	Use Eigen threadpool for ReduceSum and ReduceMean. (#3441 ) * Use Eigen threadpool for ReduceSum and ReduceMean. * Fix mac build	2020-04-08 11:50:22 -07:00
Ye Wang	f8fa1dde55	Add a list of Featurizers kernels (#3435 ) * wangye/pivot (#3432) * check in * work version * add ForecastingPivot kernel * fix mac os and linux build error * update FeaturizerLibrary Version * resolve comments * remove changes * Add Kernel for LagLeadOperator & RollingWindowFeaturizer (#3434) * update * update todo * resolve comments * relax eps for TruncatedSVD transformer * mute TruncatedSVD_transformer due to undeterministic test result * resolve comments * update * test * update * fix	2020-04-07 17:00:45 -07:00
Yufeng Li	4d71958ccf	Use IMMA for int8 matmul to leverage Turing Tensor Core (#3413 ) Use IMMA for int8 matmul to leverage Turing Tensor Core Format files under onnxruntime/core/providers/cude	2020-04-07 15:22:04 -07:00

1 2 3 4 5 ...

2092 commits