onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-28 22:56:32 +00:00

Author	SHA1	Message	Date
Edward Chen	daa14b64e3	Merge remote-tracking branch 'origin/master' into edgchen1/merge_from_master	2020-04-21 03:31:32 +00:00
Prabhat	381fee47ab	Added support to build onnxruntime with ACL (#3586 ) * Added support to build onnxruntime with ACL * Added ACL build instructions	2020-04-20 13:35:28 +05:30
Dmitri Smirnov	38a18023c7	Fix some too popular warnings. (#3578 ) Some pointless and noisy warnings either fixed or disabled.	2020-04-18 17:05:05 -07:00
edgchen1	52cfc98ec4	Merge pull request #3557 from microsoft/havenka/master-merge Merge from master	2020-04-18 09:40:32 -07:00
edgchen1	de543c0308	Add SafeInt include to WinML targets (#3558 ) Fixing Windows builds on the ort_training branch in preparation for the merge to master. SafeInt (included via onnxruntime/core/common/safeint.h) was recently made a dependency of onnxruntime/core/framework/bfc_arena.h. That requires consumers of bfc_arena to compile with the SafeInt include directory.	2020-04-17 09:54:01 -07:00
Changming Sun	209b41a67d	Update dependencies graph	2020-04-17 07:38:45 -07:00
Sheil Kumar	2717c178cc	Fork the WinML APIs into the Microsoft namespace (#3503 ) * Migrate winml to Microsoft Namespace (packaging changes are pending) * add ns_prefix toggle * fix packaging * Users/sheilk/add missing raw header (#3484) * add dualapipartition * wrong variable for repo root Co-authored-by: Sheil Kumar <sheilk@microsoft.com> * remove existence check to force failures * extra paren * dualapipartition needs to be referenced from the source * add microsoft.ai.machinelearning.dll to the output dir * rename the idl file so that assembly info is correctly added into the winmd * fix namespaces * update namespaces * default to microsoft, and add namespace override as build argument * update cmakesetings.json as well * remove from cmakelists.txt Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-04-17 06:18:54 -07:00
edgchen1	0ec90f7019	Put safeint_interface include directory into onnxruntime_common interface include directories to simplify usage by other targets. (#3546 )	2020-04-16 10:34:32 -07:00
harshitha	80e0c64e2e	merged with master	2020-04-16 17:13:36 +00:00
Andrews548	93b957a55a	Acl improvements (#3463 ) * Fixed cornercases for acl ep gemm implementation by setting fully connected as the main layer * Introduced versioned build for the acl ep. ACL versions supported are 1902, 1905 and 1908 * Added convolution-activation fusion optimization for acl ep. We see improvements of 12% for mobilenetv2 and 4% for resnet50 Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-04-16 03:14:37 -07:00
Changming Sun	7c89f38a34	Fix static analysis warnings found by VC++ (#3530 ) 1. Fix static analysis warnings found by VC++ 2. Add a new pipeline for static analysis 3. Merge all the windows CI build into one single yaml file.(Easier to queue them all). 4. Make DNNL build faster by disabling building the tests and examples. 5. Enable custom op unitest.	2020-04-16 01:46:47 -07:00
edgchen1	2f16172e69	Address PR comments and clean up. (#3536 ) Address PR comments and clean up. - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r408549886 - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r408551151	2020-04-15 15:51:52 -07:00
Adam Pocock	e9dc8954ac	Adding support for ACL and DML to the Java API.	2020-04-14 20:35:03 -07:00
pengwa	2c7c45076b	MaxBatchSize E2E Test (#3454 ) * max batch size e2e test *update test data snapshot	2020-04-15 09:50:44 +08:00
Ori Levari	f564569a80	Adapter Model and Environment tests (#3469 ) Adapter Model and Environment tests winml test macro clean up and extension	2020-04-14 13:36:31 -07:00
Du Li	621b3ac03a	FFT contrib ops (#3381 ) * add custom op skeleton * Adding Rfft, Irfft kernels. * Fix a few errors: 1. make kernel stateless to avoid race condition 2. reclaim cufft plan * Adding MLFloat16 support * Adding fp16 support for fft ops. * Adding cufft plan cache. * adding a util func * adding copyright info. * Accommodating PR comments.	2020-04-14 10:12:04 -07:00
Ye Wang	66a79d2c9f	fix (#3512 )	2020-04-13 18:30:58 -07:00
Ye Wang	cbe30f3e19	update FeaturizersLibrary (#3511 )	2020-04-13 15:47:51 -07:00
Ye Wang	438353abcd	Fix TruncatedSVDFeaturizer's test failure and re-enable it's kernel test (#3458 ) * checkin * fix linux & macos build * fix test * revert the changes for a single-aimed PR * fix	2020-04-13 13:59:38 -07:00
M. Zeeshan Siddiqui	5d99f179b9	Merge pull request #3486 from microsoft/sedymche/merge_master_ort_training Merge from master into ort_training	2020-04-13 10:55:36 -07:00
Tiago Koji Castro Shibata	d09d4a6b0d	Fix OS build (#3481 )	2020-04-09 21:46:01 -07:00
Yufeng Li	a443b1b6b9	Revert "Use IMMA for int8 matmul to leverage Turing Tensor Core (#3413 )" (#3472 ) This reverts commit `4d71958ccf`. Revert the PR. Looks like it triggers a bug in nvcc and failes the GPU pipeline.	2020-04-09 15:59:52 -07:00
Sergii Dymchenko	8ea0e596ec	Fix onnxruntime_unittests.cmake after merge.	2020-04-09 13:14:15 -07:00
Sergii Dymchenko	6ba7c99e50	Merge branch 'master' into ort_training	2020-04-09 12:42:04 -07:00
ytaous	f73008483a	safeint for region bytes in bfc arena and code clean up (#3447 ) * PR comments * remove build issue workaround * SafeInt for region bytes * fix build * fix build Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-04-08 13:54:42 -07:00
Yufeng Li	4d71958ccf	Use IMMA for int8 matmul to leverage Turing Tensor Core (#3413 ) Use IMMA for int8 matmul to leverage Turing Tensor Core Format files under onnxruntime/core/providers/cude	2020-04-07 15:22:04 -07:00
Thiago Crepaldi	15e32b44fd	Merge pull request #3383 Merge from master into ort_training	2020-04-06 19:05:01 -07:00
Ye Wang	4ebad8805b	change (#3431 )	2020-04-06 11:30:21 -07:00
Changming Sun	0dcc6035b1	Disable strong inline (#3399 ) To bypass a MSVC bug. Without this change, people can't use VS2017 to build onnxruntime in Release or RelWithDebInfo mode.	2020-04-06 11:19:09 -07:00
Changming Sun	33006f48c0	Update onnx submodule to 1.7.0 release candidate (#3405 ) Update onnx submodule to 1.7.0 release candidate. This isn't a release tag, but it will be released soon, in 1-2 weeks.	2020-04-04 16:23:42 -07:00
Pranav Sharma	14f4c3e25f	Fix issue in construction of DummyArena. (#3416 )	2020-04-03 08:28:05 -07:00
Thiago Crepaldi	d89e5d91a6	Disable GradientCheckerTest tests for GPU/Debug build (#3407 )	2020-04-03 01:01:58 +00:00
Thiago Crepaldi	675035b1a8	Disable GradientCheckerTest tests for GPU/Debug build (#3407 )	2020-04-02 18:00:54 -07:00
Tiago Koji Castro Shibata	1671072b6b	[WIP] Port image tests from WAI (#3365 ) * Copy image tests from ADO * wip * Port tests to googletest * Add FNS-Candy license * Add missing collaterals * Remove brand images * Fix typos * Use PrepareModelSessionBinding in MnistImageTest * Fix typos	2020-04-01 15:38:44 -07:00
ytaous	2ce90cff4c	PR comments (#3374 ) * PR comments * PR comments * PR comments * PR comments * PR comments * PR comments * PR comments Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-04-01 10:36:16 -07:00
Changming Sun	accffded5d	Build options for enabling AVX/AVX2/AVX512 (#3373 ) 1. Add build options for enabling AVX/AVX2/AVX512 2. Update eigen to a newer version, because the current one doesn't work with VC and AVX512.	2020-04-01 10:07:22 -07:00
Dmitri Smirnov	a4fe60c4d3	OpSet 12 ops (#3341 ) Advance ONNX commit to pickup the latest ArgMax, ArgMin, ReduceMax/ReduceMin, MaxPool Declare new versions for CPU/CUDA. Implement infrastructure support for int8/uint8. Adust GatherOp test for a new error. Adjust Scan9.BadShape test. Add exclusions for index out of bounds checks. Rework result verification for SVDTransformer.	2020-03-31 15:31:06 -07:00
Thiago Crepaldi	759818f2c1	Merge remote-tracking branch 'origin/master' into thiagofc/ort_training_merge_from_master	2020-03-31 10:53:22 -07:00
stevenlix	2332a93db0	Update onnx-tensorrt parser (#3369 ) * sync onnx-tensorrt parser and update TensorRT doc * remove --msvc_toolset 14.16 in tensorrt ci pipeline	2020-03-30 20:31:59 -07:00
Jan Scholz	ce9acf0c21	iOS crosscompilation under linux (#3298 ) * added support for ios crosscompilation under linux * reverted cmake generator change * if --ios is added protoc can be compiled for host system * accidently reverted change to compile protoc for host system for ios if protoc exe is not set * wdata is now used * accidentally pasted CMAKE_OSX_ARCHITECTURES into CmakeLists.txt, also made bad merge on build.py previously * removed print * fixed typeo, deleted commented statements for earlier debugging * reverted accidental delete * added asmmacro.h for aarch64 asm now MlasSgemmKernel**** gets underscore added if needed no need anymote to differentiate between iOS arm64 and normal amr64 build onnxruntime.cmake: added check if iOSCross is set to properly set RPATH * removed 2 spaces * fix: logcial error fixed, now protoc gets compiled if not supplied with --path_to_protoc_exe * removed unecessarily added spaces * removed some more spaces	2020-03-30 19:39:17 -07:00
edgchen1	fb2f97a002	Address master merge PR comments (#3348 ) Address some comments from https://github.com/microsoft/onnxruntime/pull/3174. - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r396855459 - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r396855630 - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r396857140 - https://github.com/microsoft/onnxruntime/pull/3174#discussion_r398094858 - https://github.com/microsoft/onnxruntime/pull/3174#issuecomment-599024924	2020-03-30 18:52:48 -07:00
Changming Sun	06fc9506fd	Thread pool changes (#3153 ) 1. Copy tensorflow's thread pool class to ORT, so that we can get a better implementation of thread pool based parallelfor 2. Copy Eigen's thread pool class to ORT 3. Support thread affinity 4. Remove RNN kernel’s private thread pool 5. Modify pool kernels to use the thread pool when openmp is disabled.	2020-03-30 12:18:40 -07:00
George Wu	355f39ddee	fix cuda build for cmake >= 3.17.0 (#3362 )	2020-03-30 00:38:57 -07:00
Tiago Koji Castro Shibata	c3cea486d0	Port ConcurrencyTests from TAEF (#3086 ) * Add ConcurrencyTests * Make ConcurrencyTests compatible with TAEF * Use test PCH in concurrency tests * Fix include header * Ignore unused code warnings on WINML_SKIP_TEST * Remove BOM * Remove conflicting namespace in older SDK * Refactor duplicate code * Fix unused DELAYLOAD * Fix unused DELAYLOAD * Remove link to internal bug * Address code style fixes * Add new concurrency tests	2020-03-27 17:39:22 -07:00
Sheil Kumar	b72fe13941	Update WinML Projection to accept sequence of tensors (#3287 ) * Enable sequence of tensor * add tests * small updates * There should only be 2 elements returned * CR feedback, and another 6->2 check update in the test. * missing semicolon... * Add explicit to constructor taking pointer paramter Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-03-23 15:55:20 -07:00
Tracy Sharpe	57468c651c	QLinearMatMul speed up (#3283 ) The equivalent of PR#3196 but done for QLinearMatMul. Use MLAS to do a u8u8=s32 GEMM and then requantize this intermediate buffer.	2020-03-21 15:37:25 -07:00
Pranav Sharma	84015d9491	Fix post merge test. This doesn't get triggered as part of gated PR checks. (#3277 )	2020-03-20 13:23:09 -07:00
Xueyun Zhu	ccc3535e72	resolve conflict	2020-03-20 20:20:35 +00:00
Ye Wang	c5149e89d9	Wangye/shortgraindropper (#3273 ) (#3274 ) * Featurizer Library update * update Featurizer Library * add short_grain_dropper_transformer * resolve comments * resolve comments * resolve comments	2020-03-20 11:48:31 -07:00
liqunfu	d521efd904	refactor frontend (#3235 ) * refactor frontend * remove training python files from inferencing build * update according to reviewer's comments * merge pybind_state.cc * refactor pybind_state.cc * code clean up * missed a forward declaration in ort_pybind_state.cc * passed pytest * move training_session.py into a subfolder per reviewer's comment * add copyright Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-03-19 20:59:41 -07:00

1 2 3 4 5 ...

380 commits