onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-29 03:30:52 +00:00

Author	SHA1	Message	Date
Tianlei Wu	7511021e0e	Save Gpt2 test data (#5132 ) (1) Save gpt2 test data during test generation. (2) Use torch fp32 model as baseline when onnx model is fp16. (3) Refine logic to compose onnx model path	2020-09-11 14:31:49 -07:00
RandySheriffH	120e3cda74	fix path (#5131 ) Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2020-09-11 12:18:07 -07:00
Rayan-Krishnan	92a8c650ad	[Debuggability] Add feature to ORTTrainer Frontend (#5124 ) * add option, feature to orttrainer and test * address comments * minor fixes * further address comments * minor changes Co-authored-by: Rayan Krishnan <t-rakr@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-09-11 12:16:07 -07:00
Ye Wang	89509f256a	Not fuse SkipLayerNorm when add has initializer input (#5123 )	2020-09-11 11:46:31 -07:00
Ashwini Khade	cd56ab197c	csharp build documentation (#5121 )	2020-09-11 11:46:10 -07:00
dependabot[bot]	15d431f39b	Bump node-fetch from 2.6.0 to 2.6.1 in /nodejs Bumps [node-fetch](https://github.com/bitinn/node-fetch) from 2.6.0 to 2.6.1. - [Release notes](https://github.com/bitinn/node-fetch/releases) - [Changelog](https://github.com/node-fetch/node-fetch/blob/master/docs/CHANGELOG.md) - [Commits](https://github.com/bitinn/node-fetch/compare/v2.6.0...v2.6.1) Signed-off-by: dependabot[bot] <support@github.com>	2020-09-11 11:45:37 -07:00
Tianlei Wu	ccfbc56388	Handle dummy mask in Attention operators (#5108 ) * Handle dummy mask with shape (1, 1) or (batch_size, 1).	2020-09-11 09:31:03 -07:00
stevenlix	c794c88ae0	Solve name conflict in TensorRT engine caching (#5128 ) * fix hash conflict * Add verbose for engine deserialization and destroy old engine memory if new engine is generated * update parser * Update tensorrt_execution_provider.cc * use a better hash algorithm * Update tensorrt_execution_provider.cc	2020-09-11 09:12:56 -07:00
Guoyu Wang	51f3d3af72	Enable onnxruntime_perf_test for ORT minimal build (#5126 ) * Enable onnxruntime_perf_test for ort minimal build * Add error message Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-11 01:58:11 -07:00
Scott McKay	59ee8ffb17	Remove SparseTensor support from minimal build. (#5114 ) * Remove SparseTensor support from minimal build. Currently the only valid usage of a SparseTensor is as an attribute of a Constant node. That would have been lifted to a dense tensor initializer when loading the onnx model, so would not exist when saving the ORT format model. Due to that there can be no SparseTensors in an ORT format model. Co-authored-by: gwang <wanggy@outlook.com>	2020-09-11 17:56:54 +10:00
Ye Wang	879751f3b7	Support Tensorflow benchmarking and onnx export in transformers tool (#5068 ) * init checkin for tf export and tf benchmark * small fix on argparse * refactor * review comments * review comments	2020-09-11 00:47:37 -07:00
Changming Sun	c5efb0085d	Update Linux GPU build pipelines to CUDA 10.2 (#5120 ) * Update Linux GPU build pipelines to CUDA 10.2	2020-09-10 17:40:51 -07:00
Ashwini Khade	a8557b3f0f	skip tests when model opset > released opset (#5096 ) * skip tests when model opset > released opset * remove multiple model load * nit fixes * plus some comments	2020-09-10 17:25:32 -07:00
Hariharan Seshadri	782ccff207	Add dll probe path so that the right DirectML.dll is loaded while running C# tests (#5104 )	2020-09-10 16:19:21 -07:00
Wei-Sheng Chin	5618b9dddc	Use CMake built-in function to compare NCCL version (#5118 ) * Use CMake built-in function to compare version * Address comment	2020-09-10 15:59:47 -07:00
Tianlei Wu	c5d4ae0401	Add transformers tools to python package (#5090 ) * Add transformers to onnxruntime python package	2020-09-10 15:42:15 -07:00
Moshe David	61051396e8	[TensorRT] Align naming convention and remove redundant code (#5094 )	2020-09-10 15:03:34 -07:00
Scott McKay	fae5915d76	CMake fixes/tweaks for minimal builds and MinSizeRel builds (#5112 ) * Fix places where MinSizeRel wasn't having relevant flags added in the same way as Release and RelWithDebInfo Enable LTO for minimal build. Cleanups onnx_minimal.cmake to remove some things handled when LTO is enabled in CMakeLists.txt * Only enable LTO for MSVC in a minimal build	2020-09-11 06:50:28 +10:00
Changming Sun	a5530358c9	Fix a path problem in Dockerfile.manylinux2014_cuda10_2 (#5106 )	2020-09-10 10:30:13 -07:00
Changming Sun	47554a0422	Disable some tests (#5103 )	2020-09-10 08:15:18 -07:00
Ryan Hill	3207de276c	Remove IDeviceAllocator class as it doesn't extend IAllocator in any way. (#5067 )	2020-09-10 00:46:35 -07:00
Guoyu Wang	5b6643cefb	Move ort flatbuffers header to use enum class instead of enum (#5105 ) * change fbs to scoped enum * modify ort code to use new fbs header Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-10 17:17:49 +10:00
Guoyu Wang	433061531e	Enable onnx_test_runner for ort format (#5100 ) * Enable onnx_test_runner using ort format, for ort minimal build only Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-10 17:15:19 +10:00
Tiago Koji Castro Shibata	62848c4de5	Add store builds to nuget packaging (#5040 ) * Nuget store packaging * Move DNNL workaround to EP * Fix warning as error * Disable store tests * Skip store tests * msbuild target * Cross compile protoc in Store * Disable DML in store * Move store builds to CPU queue * Copy uap10 to final nuget * Fix pip8 error * Remove extra dml copies * Fix argparse * pep8 * Forward IsStoreBuild * Apply is_store_build to duplicate generate_nuspec * runtimes * Refactor uap10 * Store .NET * uap * PR feedback	2020-09-09 21:38:14 -07:00
Wei-Sheng Chin	9ba56dcfed	Support Send and Recv for old NCCL versions (#5097 ) If NCCL version < 2.7, MPI is sued. Otherwise, we use NCCL Send and Recv.	2020-09-09 20:58:05 -07:00
Changming Sun	09a6ce6bc0	Add re2 to memory leak checker whitelist (#5101 ) * Add re2 to memory leak checker whitelist	2020-09-09 20:08:37 -07:00
Wei-Sheng Chin	934f30fc38	Not to call NVTX when not available (#5095 ) * Not to call NVTX when not available * fix syntax * Fix a syntax error	2020-09-09 20:01:45 -07:00
Scott McKay	4b7aa16ed2	Fix a few more signed/unsigned warnings. (#5098 )	2020-09-10 10:39:56 +10:00
RandySheriffH	5e10cde006	PipelinesForCuda11Cudnn8 (#4938 ) * cancel night build on pyop * setup win cuda11 pipeline * add debug build * test base gpu settings * setup pipelines to test cuda 10.2 and 11 * rename linux docker images * rename docker image tag and add clean up job * fix typo in cuda 11 config * set cuda11 env * update linux cuda 11 pipeline * reset docker image name * disable uninitialized warning from linux build * change the way to silence uninitialized warning * add flags to linux gpu pipeline * switch docker image for linux cuda 10.2 * switch linuc cuda 10.2 image * test cuda11 with devtool8 * try latest built images Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2020-09-09 16:13:58 -07:00
Xueyun Zhu	a90fae8c71	unify error handling in pipeline transformer (#5039 )	2020-09-09 14:52:04 -07:00
Hariharan Seshadri	61151af321	Fix typo in DML native method call from the C# API (#5083 )	2020-09-09 14:47:50 -07:00
Tiago Koji Castro Shibata	f7c3e4fa99	Store/containerized apps support (#4651 ) * Initial containerized/Store build * Remove unsupported APIs * Remove usage of STL ifstream * Revert CMake changes * Link to app runtime * WCOS/Store cmake * Update CMakeSettings.json * Fix winapi family support * Fix downlevel * Downlevel build * Remove downlevel workaround * pep8 compliance * Workaround WinRT headers bug https://github.com/microsoft/cppwinrt/issues/584 in older SDK * Always cross compile to avoid warnings as errors * PR feedback * More CI fixes * PR feedback * aiinfra build fix * Win8 store	2020-09-09 14:36:35 -07:00
Changming Sun	924ecb0623	Use manylinux2014 for Linux CPU build (#5091 )	2020-09-09 10:09:52 -07:00
Thiago Crepaldi	6594d6672f	Move onnxruntime.experiment to onnxruntime.training namespace (#5045 )	2020-09-09 09:46:06 -07:00
Wei-Sheng Chin	4ccca20def	Replace MPI Send and Recv with NCCL Send and Recv (#5054 ) * Prototype NCCL P2P * Clean code * Fix NCCL path and some minor bugs * Add path * Fix path * Try fix path * Add missed files * Address some comments * Clean code * Rename files * Add MPI path back and fix a path * Put MPI path under USE_NCCL flag * not to build Send and Recv when MPI is not installed	2020-09-09 09:39:56 -07:00
Scott McKay	dbf4e7019d	Add ability to generate configuration file with required operators. (#5089 ) * Add ability to generate configuration file with required operators.	2020-09-09 21:39:17 +10:00
Scott McKay	80ada0291f	Improve the minimal build size on android and linux (#5086 ) Fix bug where linux build fails when python is enabled and rtti is disabled Update doco for new build settings	2020-09-09 21:38:34 +10:00
Guoyu Wang	5019b2f3b9	fix for x86 android build break (#5088 )	2020-09-09 21:38:22 +10:00
gwang-msft	a1a81470e3	Add minimal build binary size verification (arm64) to Android CI (#5087 ) * Add minimal build binary size verification (arm64) to Android CI * Add comments in the CI ymal	2020-09-09 19:06:20 +10:00
dependabot[bot]	b8d63f31c3	Bump bl from 4.0.2 to 4.0.3 in /nodejs Bumps [bl](https://github.com/rvagg/bl) from 4.0.2 to 4.0.3. - [Release notes](https://github.com/rvagg/bl/releases) - [Commits](https://github.com/rvagg/bl/compare/v4.0.2...v4.0.3) Signed-off-by: dependabot[bot] <support@github.com>	2020-09-09 00:33:28 -07:00
Vincent Wang	07bf8b968e	Register BiasGelu and BiasDropout for CUDA only. (#5060 ) Co-authored-by: Vincent Wang <weicwang@microsoft.com>	2020-09-09 11:46:55 +08:00
Brian Martin	f41614a875	User/brianma/telemetry (#5084 ) * add runtime session id to (de)tensorization events * append start or stop to the event names and remove opcodes * add appsessionguid to telemetry events	2020-09-08 19:02:46 -07:00
Moshe David	1b46573bb7	Update BUILD.md (#5085 ) * Update BUILD.md - No need to format the word 'parameter' as code * Update BUILD.md	2020-09-08 18:20:46 -07:00
gwang-msft	a40d34386a	Add Linux CPU CI for ORT minimal build (#5074 ) * initial test version * update yml * minor updates * minor updates * Test minimal build * update with include ops for minimal build ut only * error case to see build failure * test no_exceptio * Remove error cases * address pr comments Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com>	2020-09-08 17:09:33 -07:00
Ye Wang	b23e08b85c	Add AutoModel selector in transformers tool (#5051 ) * Add AutoModel selector in transformers tool * change distilbert--squad's pipeline to AutoModelForQuestionAnswering rule base selector and add model_class as parameter * Update huggingface_models.py * review comments	2020-09-08 15:06:04 -07:00
Cameron Maske	4553b2eecd	Expose DirectML provider to python (conflicts resolved from #3359 ) (#4630 )	2020-09-08 14:34:09 -07:00
Ye Wang	c239ff0750	Modify embedlayernorm fusion due to shape node merging (#4967 ) * modify embedlayernorm fusion due to shape integration * update * update comments * review comments * review comments * fix test	2020-09-08 14:17:29 -07:00
Sherlock	38453acae3	Further populate Stop Gradient list (#5021 ) * Add to Stop Gradient list * Improve Stop gradient	2020-09-08 12:49:09 -07:00
Hariharan Seshadri	e1ed0fde2b	Prevent registering both DML and CUDA EPs in an ML op test (#5078 )	2020-09-08 11:13:50 -07:00
Olivia Jain	8d91d4ff36	Build docker image instruction fix (CUDA) (#5070 )	2020-09-08 09:59:16 -07:00

... 172 173 174 175 176 ...

11997 commits