onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-07 17:15:29 +00:00

Author	SHA1	Message	Date
Yufeng Li	3cae28699b	work around of the build break in mac (#6069 ) * Fix the build break in macos release * revert android change	2020-12-07 20:39:36 -08:00
Ye Wang	fa06be2133	Support export >2G model when using optimizer.py only (#6014 ) * checkin * add warning if user specify same inut and output path	2020-12-07 17:18:49 -08:00
Edward Chen	b348538c8a	Update build docker image cache cleanup (#6048 ) The current image cache cleanup is not removing many images. Upon examining the cache container registry logs, it appears there are some infrequent pulls of old images which may be made by something other than CI builds (perhaps some automated scan of the registry). This change adds a minimum access count for images in the cache so that infrequently but periodically accessed images can be removed. The idea is that images used by CI builds that are worth caching will have a higher volume of accesses.	2020-12-07 13:07:19 -08:00
Tianlei Wu	51fbe87b9b	Update profiler tool to support gpt2 and longformer models (#6011 ) * support gpt2 and longformer in profiler tool * rename bert_profiler to profiler * Add --basic_optimization to allow user to use basic level of graph optimization * Add --kernel_time_only to filter kernel time and exclude fence time * Add --threshold to filter nodes that with low run time percentage.	2020-12-07 10:33:41 -08:00
Changming Sun	925879a8b0	Remove python 3.8 Windows GPU build from python packaging pipeline (#6054 ) Revert the last a few changes to get the pipeline back to a normal state.	2020-12-07 10:23:07 -08:00
George Wu	020efc9002	fix windows cuda support for python 3.8 + (#6046 ) * fix * noqa * fix. * remove unused import	2020-12-07 10:09:22 -08:00
ashbhandare	7cebf76a46	Improve checkpointing for Zero stage 1 (#5478 ) * Initial running changes * Checkpointing aggregation changes * compare with older version * initial cleanup * Add zero test, minor fix * Fix zero test, transform, formatting * Review comments * add more unit tests * review comments * Try fix CI * Add additional check on just aggregation code * Try fix ckpt gen * Add pregenerated ckpt for CI, enable zero test in e2e * Moving test to nightly, removing ckpt files * Add tests to dist GPU CI * Fix dist test * Review comments * Fix test	2020-12-07 09:16:01 -08:00
Hariharan Seshadri	a046ef133a	Update api_summary.rst (#6038 )	2020-12-04 17:59:56 -08:00
dependabot[bot]	d5e8c48e54	Bump highlight.js from 10.2.1 to 10.4.1 in /nodejs Bumps [highlight.js](https://github.com/highlightjs/highlight.js) from 10.2.1 to 10.4.1. - [Release notes](https://github.com/highlightjs/highlight.js/releases) - [Changelog](https://github.com/highlightjs/highlight.js/blob/master/CHANGES.md) - [Commits](https://github.com/highlightjs/highlight.js/compare/10.2.1...10.4.1) Signed-off-by: dependabot[bot] <support@github.com>	2020-12-04 16:45:07 -08:00
Edward Chen	d8139814fd	Clean up builds (#6015 ) Update training Python packaging build to use get_docker_image.py. Remove BUILD_EXTR_PAR docker build argument. Update get_docker_image.py to check again for the image in the cache after building and before pushing to reduce the chance of a redundant push.	2020-12-04 15:13:17 -08:00
Sheil Kumar	00f43a3a68	add missing iclosable interface (#6036 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-12-04 13:21:03 -08:00
Jesse Benson	14f6eb14b1	Use __launch_bounds__ workaround, rather than limiting threads to 256 on AMD.	2020-12-03 13:06:34 -08:00
Jesse Benson	98ea7372d3	Re-enable Lamb unit tests for AMD	2020-12-03 13:06:34 -08:00
Jesse Benson	245d43615d	Fix AMD multi-tensor implementation.	2020-12-03 13:06:34 -08:00
Edward Chen	6572a4d306	Disable Python 3.9 for training Python packaging build. (#6012 ) Disable Python 3.9 for training Python packaging build. Python 3.9 is not supported by the PyTorch dependency.	2020-12-03 11:42:28 -08:00
Tianlei Wu	cdb91208a3	longformer onnx conversion and benchmark tools (#6007 ) * initial implementation of longformer tools for onnx conversion and benchmark * Support ONNX conversion for transformers 4.0 Add an option to optimize onnx model, and export fp16 model	2020-12-03 11:37:30 -08:00
Cecilia Liu	3b198c9614	Support Fusion for 1 and 2 Inputs Bert Models Converted From tf (#5993 ) Support fusion for 1 and 2 inputs Bert models converted from tf	2020-12-03 10:52:33 -08:00
Sherlock	c86a1e5c13	Fix Flaky orttraining tests (#5977 ) * Fix Flacky orttraining tests	2020-12-03 10:24:25 -08:00
Ryan Lai	2878e8eb2e	Fix nuget build error (#6009 )	2020-12-03 09:28:39 -08:00
baijumeswani	2b35f7d4f6	Fix build.py bug which prevents running some unit tests (#5990 ) Also ignore an exception occurred for execution providers which generate compiled nodes	2020-12-03 08:57:55 -08:00
Xavier Dupré	0acc3837ee	Make operator TreeEnsemble 5x faster for batches of size 100.000 (#5965 ) * improves processing time by 10 * extend coverage unit test coverage * better implementation for the multi regression case * better comment, keep parallelization by trees when not enough trees	2020-12-03 14:36:42 +01:00
Xavier Dupré	524b9fa899	Initialize a structure in operator ReduceSum (#6005 ) * fix initialisation issue	2020-12-03 12:41:26 +01:00
Zhang Lei	648c9c7789	Fix bugs for 1: Calibrator should check model inputs; 2: (#6017 ) quantize_inupts forgot to use parameter initializer_use_weight_qtyp.	2020-12-03 00:00:16 -08:00
Xavier Dupré	bdd06f6310	Fix PR #5550 reverted in #5911 (performance improvment for operator Transpose) (#5916 ) * Improves implementation of transpose operator * Fix issue mentioned in #5911 * adding unit test for function DoTransposeImpl	2020-12-03 00:38:18 +01:00
Yufeng Li	f2dcba7afe	Fuse MatMulIntegerToFloat only when scales are scalar (#6008 ) MatMulIntegerToFloat fusion fuses per-row and per-column MatMulInteger, which is not supported by the MatMulIntegerToFloat kernel now. Limit the fusion to per-matrix only before we supporting the per-channel fully.	2020-12-02 14:40:17 -08:00
Yufeng Li	4fdfbfd4b4	Add int32_t support for DeQuantizeLinear (#5994 ) * Add int32_t support for DeQuantizeLinear * DequantizeLinear with int32 should have not zero point	2020-12-02 12:35:41 -08:00
Olivia Jain	c727a28735	include gemm_helper.h (#5988 )	2020-12-02 11:28:28 -08:00
Xiang Zhang	b4e6cc59c7	skip the check for A channel (#5989 )	2020-12-02 11:23:54 -08:00
Guoyu Wang	cdacee6696	[NNAPI] Support non-1d tensor for C of Gemm op (#5982 ) * Add support for non-1d tensor for C of Gemm * check android api level before add squeeze * Minor update * Fix to accept c only in format of {1,1,...,1,n}	2020-12-02 00:22:38 -08:00
Guoyu Wang	6846c665ff	Use loose version in build.py (#5998 )	2020-12-01 20:57:44 -08:00
Ryan Lai	897310f6fb	Add suspend handler with new telemetry event for UWP scenarios (#5907 ) * Add suspend handler with new telemetry event * Fix build warning * Use cppwinrt from nuget * Restore nuget packages * add dependencies * Add nuget_helpers * Cleaned up * Clean up * Comment * Add dependencies for the rest * Remove unused line * Update activation string * PR comment to remove ALL	2020-12-01 20:26:18 -08:00
Edward Chen	6d642a3dba	Replace direct pulls from image cache container registry with get_docker_image.py, build definition clean up. (#5906 )	2020-12-01 19:10:23 -08:00
Chun-Wei Chen	c63e8cf7d7	Remove chronological starttime assertion in InferenceTest.cs because it is not determined (#5976 ) * remove chronological starttime assertion because it is not determined * use different vars	2020-12-01 15:58:12 -08:00
Adam Pocock	396074d2a8	Fixing OrtEnvironment.getEnvironment() so it doesn't print a warning if the environment already exists with a non-default name. (#5973 )	2020-12-01 15:21:06 -08:00
Scott McKay	30c7fffbab	Expand the documentation on using compiling EPs with a minimal build (#5893 ) * Expand the documentation on using compiling EPs with a minimal build to call out a 'simple' option that is easier to use. Provide more background on what happens to help users choose the best option for them. Tweak conversion script to be noisier about attempted usage of 'all' optimization level. Co-authored-by: manashgoswami <magoswam@microsoft.com>	2020-12-02 09:12:36 +10:00
Nat Kershaw (MSFT)	46277bfd09	Remove survey link from README (#5979 )	2020-12-01 11:36:00 -08:00
Ye Wang	5f516899bf	optimize a bert model converted using tf2onnx (#5492 ) * optimize a bert model converted using tf2onnx * add test data * update * remove comments * format * Revert "format" This reverts commit f8ae88cb564bce5caf4780e56561403f3ba3d524. * Revert "remove comments" This reverts commit 59d8a693581a731fd0291b70fe2c9cec6c4950fe. * add a squeeze node to convert a 3-d mask to 2-d * update * update * verify and add comments	2020-12-01 11:19:16 -08:00
Changming Sun	3323fb6082	Update docker files to put 'unattended-upgrades' in a right place(#5983 )	2020-12-01 10:45:03 -08:00
Wenbing Li	2ec211ea7b	Support the cross compiling for Apple Silicon (#5974 ) * support macos_arm64 cross compiling * update the build docs * update as commented. * Update BUILD.md	2020-12-01 10:00:06 -08:00
Alberto Magni	fb310fba0c	Avoid adding non-existent inputs to new Event nodes (#5915 ) During graph resolve non-existent nodes cause shape-inference failures.	2020-12-01 08:21:05 -08:00
Changming Sun	2d9dcc4576	Add python 3.9 support (#5874 ) 1. Add python 3.9 support(except Linux ARM) 2. Add Windows GPU python 3.8 to our packaging pipeline.	2020-11-30 12:02:48 -08:00
Wenbing Li	1852ade75d	Enable the xcode build for Apple Silicon (arm64 MacOS) (#5924 ) * fix the build script for macos/xcode * add the version check * correct the osx-arch configuration * typo	2020-11-30 11:22:08 -08:00
Jesse Benson	45966d878a	Code review feedback	2020-11-30 09:24:22 -08:00
Jesse Benson	86e30a2db6	Update CUDA IsAllFinite kernel	2020-11-30 09:24:22 -08:00
Jesse Benson	bd96f60888	Use CUDA's IsAllFinite kernel for ROCm	2020-11-30 09:24:22 -08:00
Moshe David	06ad516a5d	w (#5947 ) Co-authored-by: modav <modav@microsoft.com>	2020-11-30 10:35:44 +10:00
Moshe David	5f5d4a10bd	[OpenVINO]Fix memory leak upon exception throwing (#5954 ) 1. Make sure to free the output_shape vector even if Output names mismatch between OpenVINO and ONNX exception is thrown 2. Piggy back this PR to remove un-needed call to fstream close method Authored-by: modav <modav@microsoft.com>	2020-11-26 19:44:51 -08:00
Ivan Stojiljkovic	015fbb3dbb	Add support for Python 3.8+ on Windows when CUDA is enabled (#5956 )	2020-11-26 15:52:30 -08:00
Moshe David	e207589631	[OpenVINO]Fix memory leak in `IsDebugEnabled()` under Windows (#5948 ) * w * w Co-authored-by: modav <modav@microsoft.com>	2020-11-26 15:10:37 -08:00
Guoyu Wang	4afdced775	[NNAPI EP] Update squeeze ops (#5946 ) * [NNAPI EP] Update squeeze ops	2020-11-26 21:00:54 +10:00

1 2 3 4 5 ...

3893 commits