onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-16 21:00:14 +00:00

Author	SHA1	Message	Date
baijumeswani	93bf7c4d52	Documentation for distributed CI tests pipeline (#6140 )	2021-01-04 10:09:39 -08:00
Suffian Khan	46e0e4e69f	Tune BiasGeluGradDx kernel in approximation mode to avoid tanh(...) on Rocm (#6239 ) * bias gelu grad use exp(...) instead * update cuda to rocm * missing semicolon * comment * remove dockerfile * missing factor of two	2021-01-02 08:54:16 -08:00
Changming Sun	1685167e46	Update manylinux docker image to the latest (#6242 )	2020-12-31 19:57:04 -08:00
Xavier Dupré	cd14c1af29	Support double for operator ArgMin (#6222 ) * Support double for operator ArgMin * add test specifically for double * add new test on pai-excluded-tests.txt	2020-12-31 11:25:46 +01:00
Xavier Dupré	84addcd2cf	Support double for operator ReduceMean, ReduceLogSumExp (#6217 ) * Support double for operators ReduceMean, ReduceLogSumExp	2020-12-31 11:24:54 +01:00
William Tambellini	39a988ce1c	Upgrade build.py to assert for python 3.6+ Upgrade build.py to assert for python 3.6+ as python 3.5 cannot build anymore todays master.	2020-12-30 20:17:09 -08:00
Changming Sun	3911105f09	Remove python 3.5	2020-12-30 20:16:45 -08:00
Changming Sun	1b23b28706	Remove MKLML/openblas/jemalloc build config (#6212 )	2020-12-30 17:18:19 -08:00
Michael Goin	bbb6b416f0	Fix ImportError in build.py (#6231 ) There is a possible ImportError where build.py can import the wrong 'util' package if there are others present in `sys.path` already	2020-12-30 14:22:55 -08:00
Jesse Benson	7ccdfed1a6	Remove most ROCm-specific element-wise code and reuse CUDA element-wise code.	2020-12-27 10:30:29 -08:00
sfatimar	7347996942	Openvino ep 2021.2 (#6196 ) * Enabling fasterrcnn variant and vehicle detector * changes for 2021_2 branch * yolov3_pytorch commit * fixed braces in basic_backend.cc * ci information added * faster rcnn variant and vehicle detector changes were made in 2021.1 and not in 2021.2 * some changes to support unit tests * disable some tests which are failing * fix myriad tests for vehicle detector * Did some cleanup cleaned up comments Disabled Add_Broadcast_0x1 and Add_Broadcast_1x0 tests on MYRIAD_FP16 backend due to a bug cleaned up capability_2021_2.cc file Removed extra conditions which were added for some validation in backend_utils Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * yolov3 pytorch workaround to ensure that the output names are matched * gemmoptest fixed on myriad * Fixed MYRIADX CPP Test Failures Expand,GatherND,Range,Round op's are only supported in model where op with float input data types are not supported and fixed Scatter and ScatterElements op's with negative axis are fixed Reshape op with 0 dim value are not supported and fixed Disabled InstanceNorm_2 test on MYRIADX Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> make changes to yolov3 pytorch * Fixed python unit tests Fixed failing python tests on vpu, GPU and CPU Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Fixes POW op failures on GPU_FP16 Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Clean up capability_2021_2.cc Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Updated docx for MultiThreading option Added extra info on setting the num_of_threads option using the API and it's actual usage Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> fixed slice and removed extra prints * Disabled failing python tests Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor changes added in capabilty_2021_2 Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * made changes to slice to avoid failures * Disabling FP16 support for GPU_FP32 ->Inferencing an FP16 model on GPU_FP32 leads to accuracy mismatches. so, we would rather use GPU_FP16 to infer an FP16 model on GPU Device Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Updated docx for Inferencing a FP16 Model Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * fix for mask rcnn * Script for installing openvino from source * Updated with openvino 2021.2 online installation * code comment fixes fixed accuracy mismatch for div * Update OpenvinoEP-ExecutionProvider.md updated for 2021.2 branch * Update README.md updated dockerfile documentation * Update BUILD.md build.md update documentation * permissiong change of install_openvino.sh * made changes to align with microsoft onnxruntime changes * Updated with ov 2021.2.200 Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com> Co-authored-by: sfatimar <sahar.fatima@intel/com> Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: mohdansx <mohdx.ansari@intel.com>	2020-12-23 08:47:22 -08:00
Weixing Zhang	53307a5f2e	improve perf for softmax (#6128 ) * improve perf for both gathergrad and softmax * revert the change in gathergrad and will be done in another PR. * address comments from code review.	2020-12-21 14:15:54 -08:00
satyajandhyala	201d0dbb1a	Android coverage dashboard (#6163 ) * Write the report to a file. * Post code coverage to the Dashboard database.	2020-12-21 10:34:01 -08:00
Edward Chen	cd3a5acca0	Update get_docker_image.py to enable use without image cache container registry. (#6177 ) Update get_docker_image.py to enable use without image cache container registry.	2020-12-18 19:01:02 -08:00
Tixxx	32c67c2944	Deprecating Horovod and refactored Adasum computations (#5468 ) deprecated horovod submodule refactored adasum logic to be ort-native added tests for native kernel and e2e tests	2020-12-17 16:21:33 -08:00
Edward Chen	0fa04bdc50	Fix clean_docker_image_cache.py detection of image pushes. (#6151 ) Fix clean_docker_image_cache.py detection of image pushes. They were being ignored because the expected HTTP status code was wrong. For pushes, it's 201 instead of 200.	2020-12-16 17:25:22 -08:00
Changming Sun	344a2a8ee5	Revert "work around of the build break in mac (#6069 )" (#6150 ) This reverts commit `3cae28699b`.	2020-12-16 14:41:18 -08:00
Edward Chen	64709b1335	Deprecate Python global configuration functions [Part 1] (#5923 ) Enable options to be set via execution provider (EP)-specific options and log deprecation warning from current global configuration functions.	2020-12-15 11:32:43 -08:00
Jesse Benson	a8d549e181	Minor changes to AMD element-wise kernels to converge with CUDA element-wise kernels.	2020-12-15 08:46:36 -08:00
Edward Chen	9810b9e02b	Reduce amount of compiled CUDA device code (#6118 ) Move CudaKernel from cuda_common.h to a new separate header, cuda_kernel.h. Update include sites to use cuda_kernel.h instead if they need CudaKernel. Inclusions of cuda_common.h are now more lightweight. Make corresponding changes for ROCM execution provider code. Other minor cleanup.	2020-12-14 15:27:40 -08:00
Sheil Kumar	a6a23db130	Enable C# .NET5 for WinML (#6120 ) * build for .net5 * only reference cswinrt for .net5 * remove netstandard2.0 references * upgrade language version * net5 * remove extra comment closure * add targetframework * set target framework * remove net* * pep8 errors * make test project build with .net windows SDK projection * disable c# builds for non-x64 builds * fix pep8 errors * disable for store build * fix tests * remove cswinrt and sdk references from package * bump cswinrt down to 1.0.1 * fix bin path Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-12-14 15:05:15 -08:00
liqunfu	cde723a136	Liqun/move nightly pl to linux multi gpu v100 (#6024 ) * move e2e nightly pipeline to azure devop Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-12-14 12:43:41 -08:00
baijumeswani	dd2e5a1a05	state_dict and load_state_dict for ORTTrainer (#6095 ) * add functions state_dict and load_state_dict to ORTTrainer * unit tests for state_dict and load_state_dict for ORTTrainer	2020-12-14 11:55:52 -08:00
Vincent Wang	7ddeafdfcc	Add ReduceL2Grad and ClipGrad (#5970 ) * ReduceL2Grad and ClipGrad. * fix win build and amd ci pipeline * resolve comments. Co-authored-by: Vincent Wang <weicwang@AiFramework2080ti2.corp.microsoft.com>	2020-12-10 11:03:26 +08:00
Jesse Benson	cc47cfcb31	Update AMD transpose to match CUDA transpose.	2020-12-09 11:00:18 -08:00
Edward Chen	e357486707	Fix build definition template typo, add logging (#6065 ) Fix a typo in tools/ci_build/github/azure-pipelines/templates/get-docker-image-steps.yml. Add logging to tools/ci_build/get_docker_image.py for easier debugging.	2020-12-08 15:16:50 -08:00
baijumeswani	523d187193	save data to and load data from an hdf5 file for checkpointing (#5975 ) * save python dictionary to hdf5 representation and load an hdf5 file into a python dictionary * unit tests for saving data to and loading data from hdf5 file	2020-12-08 11:40:57 -08:00
satyajandhyala	f68a256140	Android code coverage (#6061 ) * Added Onnxruntime_GCOV_COVERAGE flag for Android. * Set CMAKE_SYSTEM_NAME explicityly for Android. * Added GCOV_PREFIX option to collect code coverage data. Added a new python script to generate code coverage info. Modified build pipeline to geneate Android code coverage info * Added build command line option --android_coverage * Added a comment describing the GCOV environment variables * Fixed PEP8 issues. * Added --android_coverage option to the build command. * Increased Android emulator memory from 3K to 8K. * Increased Android partition-size from 2GB to 4GB to overcome no-space-left-on-device error * Removed source_dir from command line args. * Use cwd absolute path to run tests. * Added commands to output the contents of /data/local/tmp on the emulator. * Added run_adb_shell function. * Format changes. * Removed keywd argument cwd. * Removed Android in the --build_dir path. * Removed commands added for debugging. * Removed exxtra new-lines. * Fix MacOs build pipeline failures by uninstalling openssl before running build script. * Revert "Fix MacOs build pipeline failures by uninstalling openssl before running build script." This reverts commit 90d0568fe533e9456c20d061a2d435c8fea48266. * Change dir to the build directory where the tar file is copied. * Changed the option from --android_coverage to --code_coverage * Moved steps to generate Android code coverage to run_nnap_code_coverage.sh * Require --android option if --code_coverage is specified. * No code coverage needed for onnx_test_runner. * Expect that the emulator is running when the script is executed. * Fixed the title in the buildpipeline step. * Fixed the formatting issue. * Added a command line argument, ORT_ROOT, to run_nnapi_code_coverage.sh script Co-authored-by: Satya Jandhyala <satyajandhyala@Satyas-Mac-mini.local>	2020-12-08 10:55:02 -08:00
Suffian Khan	e35211c0ff	Fix AMD GPU pipeline by adjusting reference /opt/rocm-3.9.0 => /opt/rocm (#6063 ) * use /opt/rocm instead * fix indent	2020-12-08 08:53:20 -08:00
Yufeng Li	3cae28699b	work around of the build break in mac (#6069 ) * Fix the build break in macos release * revert android change	2020-12-07 20:39:36 -08:00
Edward Chen	b348538c8a	Update build docker image cache cleanup (#6048 ) The current image cache cleanup is not removing many images. Upon examining the cache container registry logs, it appears there are some infrequent pulls of old images which may be made by something other than CI builds (perhaps some automated scan of the registry). This change adds a minimum access count for images in the cache so that infrequently but periodically accessed images can be removed. The idea is that images used by CI builds that are worth caching will have a higher volume of accesses.	2020-12-07 13:07:19 -08:00
Changming Sun	925879a8b0	Remove python 3.8 Windows GPU build from python packaging pipeline (#6054 ) Revert the last a few changes to get the pipeline back to a normal state.	2020-12-07 10:23:07 -08:00
Edward Chen	d8139814fd	Clean up builds (#6015 ) Update training Python packaging build to use get_docker_image.py. Remove BUILD_EXTR_PAR docker build argument. Update get_docker_image.py to check again for the image in the cache after building and before pushing to reduce the chance of a redundant push.	2020-12-04 15:13:17 -08:00
Jesse Benson	14f6eb14b1	Use __launch_bounds__ workaround, rather than limiting threads to 256 on AMD.	2020-12-03 13:06:34 -08:00
Jesse Benson	98ea7372d3	Re-enable Lamb unit tests for AMD	2020-12-03 13:06:34 -08:00
Jesse Benson	245d43615d	Fix AMD multi-tensor implementation.	2020-12-03 13:06:34 -08:00
Edward Chen	6572a4d306	Disable Python 3.9 for training Python packaging build. (#6012 ) Disable Python 3.9 for training Python packaging build. Python 3.9 is not supported by the PyTorch dependency.	2020-12-03 11:42:28 -08:00
baijumeswani	2b35f7d4f6	Fix build.py bug which prevents running some unit tests (#5990 ) Also ignore an exception occurred for execution providers which generate compiled nodes	2020-12-03 08:57:55 -08:00
Guoyu Wang	6846c665ff	Use loose version in build.py (#5998 )	2020-12-01 20:57:44 -08:00
Edward Chen	6d642a3dba	Replace direct pulls from image cache container registry with get_docker_image.py, build definition clean up. (#5906 )	2020-12-01 19:10:23 -08:00
Scott McKay	30c7fffbab	Expand the documentation on using compiling EPs with a minimal build (#5893 ) * Expand the documentation on using compiling EPs with a minimal build to call out a 'simple' option that is easier to use. Provide more background on what happens to help users choose the best option for them. Tweak conversion script to be noisier about attempted usage of 'all' optimization level. Co-authored-by: manashgoswami <magoswam@microsoft.com>	2020-12-02 09:12:36 +10:00
Wenbing Li	2ec211ea7b	Support the cross compiling for Apple Silicon (#5974 ) * support macos_arm64 cross compiling * update the build docs * update as commented. * Update BUILD.md	2020-12-01 10:00:06 -08:00
Changming Sun	2d9dcc4576	Add python 3.9 support (#5874 ) 1. Add python 3.9 support(except Linux ARM) 2. Add Windows GPU python 3.8 to our packaging pipeline.	2020-11-30 12:02:48 -08:00
Wenbing Li	1852ade75d	Enable the xcode build for Apple Silicon (arm64 MacOS) (#5924 ) * fix the build script for macos/xcode * add the version check * correct the osx-arch configuration * typo	2020-11-30 11:22:08 -08:00
Jesse Benson	bd96f60888	Use CUDA's IsAllFinite kernel for ROCm	2020-11-30 09:24:22 -08:00
Changming Sun	5fdd9f0fd2	Fix Python Linux GPU package name (#5943 ) Fix Python Linux GPU package name. I accidentally added "noopenmp" to it.	2020-11-25 17:46:11 -08:00
Edward Chen	7546d251e0	Expose parameters in clean build Docker image cache build. (#5941 ) Expose some parameters in the clean build Docker image cache build. In particular, whether to do a dry-run and the lifetime of unused cache images.	2020-11-25 14:15:54 -08:00
Tianlei Wu	31a6be3d67	Add Longformer Attention Cuda Op(#5932 ) Limitation: Global tokens must be at the beginning of sequence.	2020-11-25 13:52:10 -08:00
Suffian Khan	4d603e83d7	Remove attention_past.cu and attention_transpose.cu from hipify to fix AMD build (#5921 ) * remove attention_transpose.cu and attention_past.cu from hipify * remove print line * remove trailing ws for flake test * fix ws onre mor etime	2020-11-24 20:49:06 -05:00
Ashwini Khade	705d093167	Update onnx (#5720 ) * update onnx * update docker image for testing	2020-11-24 11:20:15 -08:00

1 2 3 4 5 ...

778 commits