onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-25 02:50:42 +00:00

Author	SHA1	Message	Date
Tomasz Socha	f93be8af90	Update nGraph to version 0.26 (#1965 ) * Adjust ngraph cmake files to onnx 1.5.0 * Enable LSTM reverse direction mode in nGraph EP * Enable full support for the Split op in nGraph EP * Revert "Disable the unsigned input Shrink op tests for nGraph until the next update" This reverts commit 257b42a55bdd98f804d4846868542b8e3aeb4b4e. * Enable Gather and remove unused subgraph attribute * Remove the unused param from AppendClusterToSubGraph * Fix for the incorrect onnx opset version * Use the r0.26 release branch before the tag is created * Enable the quantizelinear and dequantizelinear for NGEP * Use the v0.26.0-rc.2 tag in ngraph.cmake * Add skip for modes others than default in Pad operator * Reenable negative axis tests for ngraph * Use temporary ngraph version * Use branch name instead of SHA for temporary ngraph branch * Use ngraph v0.26.0-rc.4 * Remove patch for missing symbol in MKLDNN * Use MKLDNN 1.0 in ngraph * Exclude the Pad op for opsets greater than 10 * Disable quantizelinear and dequantizelinear tests for ONNX 1.5.0 * Fix the onnx-headers related compilation errors * ONNX libs linking fix * Use a tag for ngraph and support more Pad modes * Use the v0.26.0 release tag for nGraph * Update ngraph to RC8 - bigobj flag for Windows builds * Fix the MKLDNN constexpr error on Windows	2019-10-14 10:37:48 -07:00
Pranav Sharma	91db840b6b	Introduce execution mode enum for clarity and extensibility; Change Python, C and C# APIs accordingly; Removed EnableSequentialExecution, DisableSequentialExecution in favor of the more general SetExecutionModeAPI. (#2098 ) * Introduce execution mode for clarity and extensibility; Change Python APIs accordingly; Replace DisableSequentialExecution API with EnableParallelExecution for clarity. * Fix cuda build * Modify the test slightly * Make C and C# APIs consistent with Python.	2019-10-14 09:48:19 -07:00
Changming Sun	5558b80774	clean up ubuntu docker scripts (#2103 )	2019-10-14 07:20:20 -07:00
KeDengMS	9363c14d23	Fix a bug in nuphar settings where existing options cannot be override (#2113 ) Fix a bug in nuphar settings where existing options cannot be override	2019-10-13 23:02:37 -07:00
Scott McKay	b829d55320	Fix invalid logic that ran past end of nodes and double increment. (#2117 )	2019-10-13 11:37:57 +10:00
Scott McKay	eb24617d2e	Add ability to get symbolic dimension info for graph inputs and outputs. (#2051 ) * Add ability to get symbolic dimension info for graph inputs and outputs. WIP to get initial feedback. * Fix linxu build error. Update C# API and add unit test * Clarify the two different ways Tensor shape and type info is created. One is from concrete values and one is from a type proto where symbolic dimensions may exist. Doing so allows a change to default to empty strings for the symbolic dimensions if not provided.	2019-10-12 15:46:28 -07:00
jignparm	20515363e5	Add int32 and int64 types for Equal(11) (#2112 )	2019-10-12 11:16:19 -07:00
Scott McKay	50faab308b	Remove 'Ort' prefix from OrtAddFreeDimensionOverride for consistency. (#2099 )	2019-10-12 06:11:29 +10:00
Ryan Hill	e8e33977da	Ryanunderhill/customop dll (#2002 ) * Add OrtApiBase * Add RegisterCustomOpsLibrary API	2019-10-11 11:12:51 -07:00
Changming Sun	c24d7a8a0a	Update eigen to the latest version (#1910 )	2019-10-11 10:44:19 -07:00
Scott McKay	bdfff800ea	Move access to intra-op threadpool into OpKernelContext. (#2091 )	2019-10-11 10:36:20 -07:00
Changming Sun	368bdfd936	Update README.md (#2070 ) Update the vcredist package link Note: Visual C++ 2015, 2017 and 2019 all share the same redistributable files.	2019-10-11 10:06:50 -07:00
Hector Li	3b335c933f	Fix issue that TRT not work for device other than device id 0 Fix issue that TRT not work for device other than device id 0. Because the allocation planner need to get the default allocator to allocate memory for graph input data. (#2094)	2019-10-11 09:22:25 -07:00
Scott McKay	ffb94fd170	Fix bug with delayed allocation of If and Scan outputs. (#2024 ) * Fix bug with delayed allocation of If and Scan outputs. If the subgraph is producing output on a non-CPU device the delayed allocation was incorrectly providing a CPU allocated tensor. Check for the required location, and update 'fetches' instead if a device copy is needed. The utils::ExecuteGraph logic will handle the device copy in this case.	2019-10-11 19:49:21 +10:00
Yang Chen	ca1b88c069	Added support to infer Pad11 (#2085 ) * Added support to infer Pad11 * address CR	2019-10-10 23:18:49 -07:00
shahasad	8803f6fff4	C# end to end test fix, and make end to end tests mandatory (#2079 )	2019-10-10 19:23:43 -07:00
Changming Sun	a314402097	Downgrade python gpu package to CUDA 10.0 (#2086 )	2019-10-10 18:31:24 -07:00
Dmitri Smirnov	af9dbb70f2	Introduce a separate check and conditional for AVX512BW build (#2083 ) Separate checks for AVX512f and AVX512BW Make AVX512BW cmake instructions nested within AVX512F support.	2019-10-10 16:14:00 -07:00
Hariharan Seshadri	2ba705ed99	Handle nodes with subgraphs in ORT function handling implementation (#2053 ) * Initial commit * Update * Update * Nits * More updates * to be reverted * Update * Update * More changes * Updates * Update Function * Nits * Fix build break * Comment	2019-10-10 16:07:42 -07:00
Pranav Sharma	2d4d0abd36	Add support for output seq(tensor) in python binding and test framework. Implement SequenceConstruct, SequenceEmpty, SequenceInsert and SequenceErase ops. (#2040 ) Add support for output seq(tensor) in python binding and test framework. Implement SequenceConstruct, SequenceEmpty, SequenceInsert and SequenceErase ops. (#2040)	2019-10-10 15:58:49 -07:00
Scott McKay	ddbc2086e4	Add support for opset 11 Clip in optimizers. (#2059 )	2019-10-10 10:47:29 -07:00
Yulong Wang	a41c71cbf2	check and fix CUDA kernel launch errors in several OPs (#2047 )	2019-10-10 23:47:00 +08:00
baowenlei	b4a98aab78	change MatMulInteger/MatMulInteger16 fallback option (#2064 ) * change MatMulInteger/MatMulInteger16 fallback option when no initializer exist * add AVX option * fix condition for old machines	2019-10-09 22:03:21 -07:00
Hariharan Seshadri	d186c19c45	Add opset-11 TopK CPU kernel (#1912 ) * initial commit * Update * Update top_k.cc * PR comments * Add more tests * Update * Add another test case * Update * Resolve conflicts * Update * Nits * Nits * Nits * Pick sorted content using 2 different approaches * Update to logic * PR comments * PR feedback * Update * Fix build * Fix build * Update	2019-10-09 19:09:30 -07:00
Colin Versteeg	8fda6593fe	Update failing tests (#2038 ) * Fix failing tests from when they were not enabled * split into two * fix failing test	2019-10-09 15:17:21 -07:00
Tracy Sharpe	57e0099425	MLAS: Implement U8S8 GEMV kernels (#2069 ) This implements an optimization for U8S8 MlasGemm when M=1, aka GEMV.	2019-10-09 11:54:16 -07:00
Changming Sun	eee9c55030	C++11 fix for memcpy_transformer_test.cc (#2061 )	2019-10-09 10:52:10 -07:00
Changming Sun	cefae93305	Add a test case for linearregressor (#1962 )	2019-10-09 10:17:08 -07:00
Changming Sun	ccaf692ff2	Run auditwheel for manylinux1 (#2063 )	2019-10-09 09:23:00 -07:00
Dmitri Smirnov	cae571c713	Add a test for AVX512 compilation before compiling 512 asm (#2055 )	2019-10-08 21:18:04 -07:00
Changming Sun	af8fe0f980	Replace make_unique in cuda_utils.cu (#2052 )	2019-10-08 18:32:08 -07:00
Scott McKay	db0dd09ded	Cleanup some aspects of the Initializer class used by optimizers (#2005 ) * Move check on data type outside of the Initializer class as it's specific to Conv processing. Use references for arguments that can't be null.	2019-10-09 10:37:44 +10:00
Changming Sun	a00ca56ae1	Remove gcc from manylinux1 docker image (#2048 )	2019-10-08 13:49:15 -07:00
baowenlei	b82de794d5	Weba/update nuphar doc (#2026 ) * update nuphar xp doc * address comments * address CR * update doc	2019-10-08 12:41:25 -07:00
RandySheriffH	f501b6e234	pack pyop in nightly build (#2018 ) * pack pyop in nightly build * correct logic * add comment * exclude debug build * add dependency * reset postbuild rule * remove dep	2019-10-08 12:02:45 -07:00
Changming Sun	e9bed8b23b	Change python packaging pipeline to use manylinux1 (#2035 ) 1. Change the python packaing pipeline to use manylinux1 2. Temporarily disable model test in the python pipeline.	2019-10-08 10:03:54 -07:00
Changming Sun	3053af812c	Fix a crash in deep_cpu_gru_op_test.cc (#2028 )	2019-10-08 10:03:07 -07:00
Zhang Lei	71b389322e	Implement cuda scatter op. (#1991 ) * Implement cuda scatter op. Disable Invalid Index of Scatter op only for cuda provider. * Fix some pipeline's type narrow warning as error.	2019-10-08 09:53:33 -07:00
Yang Chen	a94c9bd88d	throw exception using dmlc::LogMessageFatal (#2033 ) * throw exception using dmlc::LogMessageFatal On windows, ORT_THROW couldn't be caught if the exception was thrown from a jitted functions. Let's call dmlc::LogMessageFatal instead. * address CR use LOG(FATAL)	2019-10-08 09:31:35 -07:00
Yang Chen	19b0d0af87	Enabled bool input type for Equal for op_ver 11 (#2034 ) This change enabled bool type for Equal-11's inputs	2019-10-08 01:50:37 -07:00
Yang Chen	203c2f5b59	updated reduce_ops for op_ver 11 (#2039 ) After enabling op_ver 11 for reduce ops, we need to check axes to make sure it's not empty.	2019-10-08 01:05:05 -07:00
Pranav Sharma	f13b66768a	Fix build for gcc 4.8.5. (#2036 )	2019-10-08 00:50:53 -07:00
shahasad	b70fc34fae	Fix C# end to end tests in NuGet pipeline, failing for missing test data file	2019-10-07 20:14:20 -07:00
shahasad	b0feaef9de	Update the C# pretrained model test to include opset9 and 10 models (#2003 )	2019-10-07 19:14:34 -07:00
George Wu	0bd807f3b3	trt provider status return cleanup (#2032 ) * status and code cleanup. * revert change. seems like a bug in TRT causes intermittent failure return?	2019-10-07 18:34:48 -07:00
Tianlei Wu	b2c1937523	Add EmbedLayerNormalization and SkipLayerNormalization ops for bert optimization (#2012 ) * Add Embed Layer Normalization and Skip Layer Normalization ops for bert optimization. * add float16 test for skiplayernorm * Add test for EmbedLayerNormalization op * fix cpu build error * fix build warning * update HasCudaEnvironment function * handle cuda error	2019-10-07 17:29:43 -07:00
Changming Sun	8f7657fa32	Ignore some gcc warnings (#1996 )	2019-10-07 16:32:34 -07:00
Pranav Sharma	ea60469af5	Support seq(tensor), implement 2 sequence ops that use the new type. (#1983 ) * Mention OrtCreateSessionFromArray in C API doc * fix seq of tensors * changes on 9/30 * All tests passing * Add SequenceAt op * Fix shared_lib non_tensor_types test * Address some PR comments * Address PR comments * Add support in python bindings to accept seq(tensor) * Change data type from vector<Tensor> to TensorSeq * Change data type from vector<Tensor> to TensorSeq * Added some documentation * Added missing test model * Fix Linux build * Fix Mac build * Fix Mac build	2019-10-07 15:35:09 -07:00
Hector Li	00e24ae4fe	refactor Cuda Ops Sum, Max, Min, remove dup code (#1946 ) refactor Cuda Ops Sum, Max, Min, remove dup code	2019-10-07 13:17:49 -07:00
Tianlei Wu	7b39f5090c	Add Attention op for multi-head self attention in BERT (#1984 ) * Add Attention op for multi head self attention in BERT * Add test cases * Move op from kOnnxDomain to kMSDomain. Limit test to run by CUDA provider only. * fix test * Add float16 test * fix cpu build error * handle cuda error * get last cuda error when failed	2019-10-07 12:22:54 -07:00

... 211 212 213 214 215 ...

11997 commits