onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-26 03:00:54 +00:00

Author	SHA1	Message	Date
Wenbing Li	de6e3fb61d	Reduce IOS shared library size by symbol file. (#5171 )	2020-09-14 23:59:41 -07:00
Ryan Hill	8fa427b264	Ryanunderhill/backout 5014 (#5167 ) * Revert 5014	2020-09-14 22:48:00 -07:00
Wenbing Li	2a456d16c0	Enable onnxruntime iOS shared library build. (#5148 )	2020-09-14 10:32:39 -07:00
sfatimar	0c7e9fb52a	changes to ensure compilation issues in windows is fixed by disabling the level 3 warning 4267 (#5147 ) while a more permanent fix is found Co-authored-by: sfatimar <sahar.fatima@intel/com>	2020-09-14 08:59:41 -07:00
Scott McKay	323a1ba8a4	Add option to exclude support for loading ORT format models in full build. (#5129 ) * Add ability to exclude support for loading ORT format models. Disable support for ORT format models in packages	2020-09-12 12:21:30 +10:00
Guoyu Wang	698eccf15e	Add iOS build instruction (#5125 ) * ios build instruction * fix logger issue in onnx_model_info * Revert "fix logger issue in onnx_model_info" This reverts commit 72f2b88256ccf29c75fefbcd1daf6b4dcf7e0c61. * Address comments and fix small issue in iOS build	2020-09-11 16:10:36 -07:00
stevenlix	c794c88ae0	Solve name conflict in TensorRT engine caching (#5128 ) * fix hash conflict * Add verbose for engine deserialization and destroy old engine memory if new engine is generated * update parser * Update tensorrt_execution_provider.cc * use a better hash algorithm * Update tensorrt_execution_provider.cc	2020-09-11 09:12:56 -07:00
Wei-Sheng Chin	5618b9dddc	Use CMake built-in function to compare NCCL version (#5118 ) * Use CMake built-in function to compare version * Address comment	2020-09-10 15:59:47 -07:00
Tianlei Wu	c5d4ae0401	Add transformers tools to python package (#5090 ) * Add transformers to onnxruntime python package	2020-09-10 15:42:15 -07:00
Scott McKay	fae5915d76	CMake fixes/tweaks for minimal builds and MinSizeRel builds (#5112 ) * Fix places where MinSizeRel wasn't having relevant flags added in the same way as Release and RelWithDebInfo Enable LTO for minimal build. Cleanups onnx_minimal.cmake to remove some things handled when LTO is enabled in CMakeLists.txt * Only enable LTO for MSVC in a minimal build	2020-09-11 06:50:28 +10:00
Guoyu Wang	433061531e	Enable onnx_test_runner for ort format (#5100 ) * Enable onnx_test_runner using ort format, for ort minimal build only Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-10 17:15:19 +10:00
Wei-Sheng Chin	9ba56dcfed	Support Send and Recv for old NCCL versions (#5097 ) If NCCL version < 2.7, MPI is sued. Otherwise, we use NCCL Send and Recv.	2020-09-09 20:58:05 -07:00
RandySheriffH	5e10cde006	PipelinesForCuda11Cudnn8 (#4938 ) * cancel night build on pyop * setup win cuda11 pipeline * add debug build * test base gpu settings * setup pipelines to test cuda 10.2 and 11 * rename linux docker images * rename docker image tag and add clean up job * fix typo in cuda 11 config * set cuda11 env * update linux cuda 11 pipeline * reset docker image name * disable uninitialized warning from linux build * change the way to silence uninitialized warning * add flags to linux gpu pipeline * switch docker image for linux cuda 10.2 * switch linuc cuda 10.2 image * test cuda11 with devtool8 * try latest built images Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2020-09-09 16:13:58 -07:00
Tiago Koji Castro Shibata	f7c3e4fa99	Store/containerized apps support (#4651 ) * Initial containerized/Store build * Remove unsupported APIs * Remove usage of STL ifstream * Revert CMake changes * Link to app runtime * WCOS/Store cmake * Update CMakeSettings.json * Fix winapi family support * Fix downlevel * Downlevel build * Remove downlevel workaround * pep8 compliance * Workaround WinRT headers bug https://github.com/microsoft/cppwinrt/issues/584 in older SDK * Always cross compile to avoid warnings as errors * PR feedback * More CI fixes * PR feedback * aiinfra build fix * Win8 store	2020-09-09 14:36:35 -07:00
Thiago Crepaldi	6594d6672f	Move onnxruntime.experiment to onnxruntime.training namespace (#5045 )	2020-09-09 09:46:06 -07:00
Wei-Sheng Chin	4ccca20def	Replace MPI Send and Recv with NCCL Send and Recv (#5054 ) * Prototype NCCL P2P * Clean code * Fix NCCL path and some minor bugs * Add path * Fix path * Try fix path * Add missed files * Address some comments * Clean code * Rename files * Add MPI path back and fix a path * Put MPI path under USE_NCCL flag * not to build Send and Recv when MPI is not installed	2020-09-09 09:39:56 -07:00
Scott McKay	80ada0291f	Improve the minimal build size on android and linux (#5086 ) Fix bug where linux build fails when python is enabled and rtti is disabled Update doco for new build settings	2020-09-09 21:38:34 +10:00
Guoyu Wang	5019b2f3b9	fix for x86 android build break (#5088 )	2020-09-09 21:38:22 +10:00
gwang-msft	a1a81470e3	Add minimal build binary size verification (arm64) to Android CI (#5087 ) * Add minimal build binary size verification (arm64) to Android CI * Add comments in the CI ymal	2020-09-09 19:06:20 +10:00
Cameron Maske	4553b2eecd	Expose DirectML provider to python (conflicts resolved from #3359 ) (#4630 )	2020-09-08 14:34:09 -07:00
gwang-msft	6081c1cfa2	Update ONNX to latest (#5069 ) * Update ONNX to latest * update onnxml.cs * revert changes in proto and cs files * add broken test * update broken tests * update broken tests Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-05 00:49:09 -07:00
Zhang Lei	ec88f14a7a	Implement QLinearMul in mlas (#4593 ) * Implement QLinearMul	2020-09-04 15:02:19 -07:00
Scott McKay	b5c2932ae8	Last major set of ORT format model changes (#5056 ) * Add minimal build option to build.py Group some of the build settings so binary size reduction options are all together Make some cmake variable naming more consistent Replace usage of std::hash with murmurhash3 for kernel. std::hash is implementation dependent so can't be used. Add initial doco and ONNX to ORT model conversion script Misc cleanups of minimal build breaks.	2020-09-05 07:59:01 +10:00
Du Li	6134994db9	Parallelizing elementwise kernels (#4577 ) * Parallelizing unary elementarywise ops. * Parallelizing binary elementwise ops. * Accommodating PR comments.	2020-09-04 14:45:43 -07:00
Ryan Hill	d792af776d	Remove Cuda dependency from TensorRT shared provider (#5014 )	2020-09-04 11:35:02 -07:00
Andrews548	bd215b79a2	ACL v20.02 (#4981 ) * Add ACL version 20.02 * fix loging typo * check depthwise operation based on group param * Generate ArmNN runtime inside class constructor * Update to the latest ONNX operation set * Update BUILD.md Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-09-03 20:44:27 -07:00
xkszltl	4b9b5b6146	Imported protoc cannot have compile options. (#5030 )	2020-09-03 15:20:00 -07:00
gwang-msft	fde7a2c848	Temporarily switch SafeInt to a fork for an option to disable exceptions (#5041 ) * Removed submodule * Add safeint fork	2020-09-02 23:21:39 -07:00
Changming Sun	d5d5e37e76	Build system enhancements (#5012 ) 1. Add a docker file for CUDA11 2. Support setting CUDA_ARCHITECTURES from command line.	2020-09-02 10:13:26 -07:00
gwang-msft	64237d999c	Add Cmake config for onnxruntime_NO_EXCEPTIONS (#4975 ) * additional noexception setting, added compile options * more no exception changes * addressed PR comments * Fix build issue when MSVC static library is used. * Clarify comment * add fatal message for onnxruntime_NO_EXCEPTIONS enabled without onnxruntime_MINIMAL_BUILD Co-authored-by: Scott McKay <skottmckay@gmail.com>	2020-09-01 10:17:50 -07:00
Yufeng Li	ffc2b25a3a	Quantization tool improvement (#4933 ) Improve quantization tools: 1. Support QAT 2. Make quantization tool to register Operators. 3. Make the API clear to use Co-authored-by: t-yguo <t-yguo@microsoft.com>	2020-09-01 09:07:46 -07:00
RandySheriffH	14b51d6502	CiPipeline@ReducedOpsBuild (#4917 ) * cancel night build on pyop * setup ci pipeline for build of reduced ops * add back c# test * remove debugging print * add testing model * add more arg in pipeline script * disable pipeline trigger temporarily * fix yaml format * fix yaml format * fix pipeline error * rid c# test * add ops for test cases * add Conv from domain com.microsoft.nchwc * remove --reduce_ops * fix typo * remove --build_java * add test case for excluded op * update doc with --skip_test * formatting code, renaming files and simplify yaml * remove debug build from yaml * remove surplus ops from included_ops.txt * add MinSizeRel build to yaml * rename test cases and models * exclude ir test from minimum build * restrict ir test to be only applied to reduced ops build	2020-08-31 21:21:18 -07:00
gwang-msft	7ca8388dc9	[ORT Mobile] file format schema and file I/O code (#4973 ) * ort mobile file format schema and [de]serializing code	2020-09-01 11:51:31 +10:00
edgchen1	b41e5e88fb	Add more node debug dump functionality. (#4921 ) Add ability to dump node inputs/outputs to files, filter nodes, configure behavior with environment variables.	2020-08-31 10:17:23 -07:00
Brian Martin	655ffd5d5b	make (de)tensorization events measure level events (#4958 ) * make tensorizer events measures * throttle the events and add a new one SoftwareBitmapToGPUTensorTelemetryEvent * factor out timing code into a class * typo * typo * move eventimer class into its own header file * add throttling to detensorization and remove variable timing * make detensorization events measures as well * add ConvertGPUTensorToSoftwareBitmapTelemetryEvent event * de-duplicate event names * fix comment * PR feedback	2020-08-28 16:49:32 -07:00
Dwayne Robinson	040c5fa3e0	Merge pull request #4925 from microsoft/user/dwayner/Iron ORT DirectML EP for Iron release, ONNX 1.5	2020-08-28 12:28:30 -07:00
Dwayne Robinson	79429c934b	Update	2020-08-27 21:01:19 -07:00
George Wu	e6b6736e48	update cuda capabilities (#4936 )	2020-08-27 16:38:18 -07:00
Scott McKay	438babd966	Fix some Android build issues when ORT_MINIMAL_BUILD is defined. (#4924 )	2020-08-27 07:37:51 +10:00
Scott McKay	1161c4d75f	Exclude MLAS AVX512 in minimal build (#4905 )	2020-08-26 08:03:37 +10:00
Bowen Bao	db6a821869	Enable example transformer test with dynamic size inputs (#4888 ) Co-authored-by: Thiago Crepaldi <thiago.crepaldi@microsoft.com>	2020-08-24 14:31:08 -07:00
Xiang Zhang	824fcbfd9d	support Normalized_0_1 and Normalized_1_1 (#4800 ) * support Normalized_0_1 and Normalized_1_1 * add tests for Normalized_1_1 * fix build error * fix imagetests failure * support denterization and add more tests * fix build * remove added models * disable gpu tests for CPU pipeline * refactor based on comments and moved two added models * merge normalizer and Denomalizer into NominalRangeConverter * add comments * little change	2020-08-24 13:13:50 -07:00
Changming Sun	26546f81fe	Remove the private ONNX protobuf definition file (#4878 )	2020-08-24 12:40:33 -07:00
Scott McKay	47c4144bd1	Add gcc/clang flags to make binary smaller (https://interrupt.memfault.com/blog/best-and-worst-gcc-clang-compiler-flags#-ffunction-sections--fdata-sections----gc-sections ) (#4895 ) Add gcc/clang flags to make binary smaller. ~10% reduction for Android baseline build (minimal build with no ops, no exceptions, no rtti).	2020-08-24 19:24:13 +10:00
Scott McKay	db7669b225	Reduce ONNX dependency in minimal build (#4890 ) * Next round of changes. Remove inclusion of ONNX schema header Exclude custom registry related things Move IsConstantInitializer from graph_utils to Graph as it's needed in a minimal build and graph_utils is excluded.	2020-08-23 07:02:13 +10:00
Thiago Crepaldi	dce2ce7a4f	Fix checkpoint API and copy samples into build dir (#4887 ) * Fix state_dict APIs * Copy samples to build folder and fix CI	2020-08-22 00:09:48 -07:00
gwang-msft	82bc21e35e	Namespace change on ort flatbuffers schema (#4886 ) * correct some errors in the flatbuffers schema, move flatbuffers submodule to cmake/external * update the ort flatbuffers schema to use less namespace * minor update Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-08-21 17:43:11 -07:00
Scott McKay	e00ad83f2b	Initial changes to disable code in a minimal build (#4872 ) * Initial set of changes to start disabling code in the minimal build. Breaking changes into multiple PRs so they're more easily reviewed. Focus on InferenceSession, Model and Graph here. SessionState will be next. Needs to be integrated with de/serialization code before being testable so changes are all off by default. Changes are limited to - #ifdef'ing out code - moving some things around so there are fewer #ifdef statements - moving definition of some one-line methods into the header so we don't need to #ifdef out in a .cc as well - exclude some things in the cmake setup * Update session state and a few other places. The core code builds if ORT_MINIMAL_BUILD is specified.	2020-08-22 07:14:53 +10:00
Yufeng Li	fb43aa0de0	implement per-channel for quantizelinear and dequantizelinear (#4759 ) * update onnx to latest master * implement per-channel for quantizelinear and dequantizelinear * refine the unit test * exclude sequence_insert tests * refine onnx cmake * add failure tests to broken_tests * move qdq common code to a seperate function * refine code	2020-08-21 12:08:50 -07:00
paradigm	c5342b5417	fixed compilation issue for Jetson Xavier (#4873 ) The string concatenation of the cuda flags makes compiling impossible due to the missing space (Error: " nvcc fatal: redefinition of keyword 'code' ") .	2020-08-21 06:27:38 +08:00

1 2 3 4 5 ...

548 commits