onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-02 03:55:34 +00:00

Author	SHA1	Message	Date
Edward Chen	d761571afc	Deprecate Python global configuration functions [Part 2] (#6171 ) Update Python API to allow more flexibility for setting providers and provider options. The providers argument (InferenceSession/TrainingSession constructors, InferenceSession.set_providers()) now also accepts a tuple of (name, options dict). Fix get_available_providers() API (and the corresponding function in the C API) to return the providers in default priority order. Now it can be used as a starting point for the providers argument and maintain the default priority order. Convert some usages of the deprecated global configuration functions to use EP-specific options instead. Update some EP-specific option parsing to fail on unknown options. Other clean up.	2021-01-07 10:10:55 -08:00
Hariharan Seshadri	bbc9ed908a	Fix VS 2017 build break (#6276 )	2021-01-07 02:09:35 -08:00
Tang, Cheng	431604ef89	add bfloat16 to gathergrad type constrains (#6267 ) Co-authored-by: Cheng Tang <chenta@microsoft.com>	2021-01-06 15:04:14 -08:00
Hariharan Seshadri	2347de4a9e	Fix Linux/Mac error message on input type mismatch (#6256 )	2021-01-05 22:21:24 -08:00
Hariharan Seshadri	d42399e1b0	Allow querying a GraphProto's doc_string as part of ModelMetadata (#6248 )	2021-01-05 22:18:03 -08:00
pengwa	eea3806db1	model parallel refinement (#6244 ) * Megatron Transformation as a seperate step * remove useless header * clang formating * Re-Structure megatron transformer for subsquent changes * fix comments	2021-01-06 10:30:22 +08:00
liqunfu	addb4b8c2b	Liqun/speech model loop to scan (#6070 ) Provide a tool to convert Loop to Scan for Nuphar performance Fix Nuphar CI pipeline failures. Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-01-05 15:15:23 -08:00
Edward Chen	ce6161cf67	Add MakeStringLite which uses current locale, update some MakeString call sites to use it instead. (#6252 ) * Add MakeStringLite which uses current locale, update macros to use that to generate messages. * Convert calls to MakeStringLite().	2021-01-04 19:27:24 -08:00
ashbhandare	493bf931c5	Add the Concat Slice Elimination transform, fix constant_folding transform (#5457 ) * Add concat slice transform + test * Cosmetic improvements in concat slice transform * Remove unrelated file, fix comment, fix constant folding bug * Add test onnx graph * fix windows build * Review comments * review comment Co-authored-by: Aishwarya <aibhanda@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-01-04 16:18:33 -08:00
Changming Sun	6fd9d34bb0	Remove a debug log in provider_test_utils.cc (#6200 )	2021-01-04 13:58:11 -08:00
baijumeswani	93bf7c4d52	Documentation for distributed CI tests pipeline (#6140 )	2021-01-04 10:09:39 -08:00
Olivia Jain	c8de3f355a	Refactor EP Perf Tool (#6202 ) * merge master, keep postprocess status commit * download float16.py everytime * using variables to reference eps * adding ACL EP to ep perf tool * accuracy with absolute tolerance configurable * add acl to dict + remove commented line	2021-01-04 08:50:41 -08:00
Suffian Khan	46e0e4e69f	Tune BiasGeluGradDx kernel in approximation mode to avoid tanh(...) on Rocm (#6239 ) * bias gelu grad use exp(...) instead * update cuda to rocm * missing semicolon * comment * remove dockerfile * missing factor of two	2021-01-02 08:54:16 -08:00
Hector Li	ffb4b62826	Fix allocator issue for TensorRT IOBinding (#6240 ) * Fix issue: https://github.com/microsoft/onnxruntime/issues/6094 Root cause: we didn't expose the OrtMemoryInfo for TRT, so it will cause issue if user want use IObinding for Tensorrt. Short term fix, add the OrtMemoryInfo for TRT. Long term should unify the allocator for CUDA and TRT	2020-12-31 20:15:43 -08:00
Changming Sun	1685167e46	Update manylinux docker image to the latest (#6242 )	2020-12-31 19:57:04 -08:00
Changming Sun	d5cb17c679	Update BUILD.md	2020-12-31 17:20:00 -08:00
Xavier Dupré	cd14c1af29	Support double for operator ArgMin (#6222 ) * Support double for operator ArgMin * add test specifically for double * add new test on pai-excluded-tests.txt	2020-12-31 11:25:46 +01:00
Xavier Dupré	84addcd2cf	Support double for operator ReduceMean, ReduceLogSumExp (#6217 ) * Support double for operators ReduceMean, ReduceLogSumExp	2020-12-31 11:24:54 +01:00
Xavier Dupré	5968a91ea6	Support double for operator Gemm + fix bug in gemm implementation for cuda, rocm when sizeof(type) != sizeof(float) (#6223 ) * Support double for operator Gemm * fix type size while copying data in gemm operator for GPU * fix type in gemm implementation for rocm	2020-12-31 11:24:16 +01:00
Xavier Dupré	70e2f96ef4	Support double for operator TopK + fix one bug in TopK implementation for GPU for double (#6220 ) * Support double for operator TopK * add static classes for topk/double * fix cast issue in topk	2020-12-31 11:23:19 +01:00
Tracy Sharpe	ecb2e119e4	MLAS: handle MlasGemm(M/N/K==0) cases (#6238 )	2020-12-30 23:25:10 -08:00
Hariharan Seshadri	4cc2ffef21	Support MLFloat16 type in Pow opset-12 CUDA kernel (#6233 )	2020-12-30 20:41:59 -08:00
William Tambellini	39a988ce1c	Upgrade build.py to assert for python 3.6+ Upgrade build.py to assert for python 3.6+ as python 3.5 cannot build anymore todays master.	2020-12-30 20:17:09 -08:00
Changming Sun	c15a858745	Update the readme file	2020-12-30 20:16:45 -08:00
Changming Sun	3911105f09	Remove python 3.5	2020-12-30 20:16:45 -08:00
Changming Sun	1b23b28706	Remove MKLML/openblas/jemalloc build config (#6212 )	2020-12-30 17:18:19 -08:00
Michael Giba	5c584b2636	Removed executor todo that looks dead. (#6234 )	2020-12-30 17:17:37 -08:00
Michael Goin	bbb6b416f0	Fix ImportError in build.py (#6231 ) There is a possible ImportError where build.py can import the wrong 'util' package if there are others present in `sys.path` already	2020-12-30 14:22:55 -08:00
Xavier Dupré	df7e2f3c1e	Support double for operators Relu, Tanh, Sigmoid (#6221 )	2020-12-29 18:25:23 +01:00
Xavier Dupré	111ac299cc	Support double for operators Where, LpNormalisation (#6034 )	2020-12-28 12:53:44 +01:00
Xavier Dupré	2d09db67b4	Support double for operators Log, Reciprocal, Sum (CPU) (#6032 ) * Support double for operators Log, Reciprocal, Sum * remove tesdt erf_double	2020-12-28 12:53:18 +01:00
Xavier Dupré	8a0f5c50ab	Minor change to improve performance for operator Pad. (#5537 ) * small improvment for pad	2020-12-28 12:52:41 +01:00
Jesse Benson	7ccdfed1a6	Remove most ROCm-specific element-wise code and reuse CUDA element-wise code.	2020-12-27 10:30:29 -08:00
Jesse Benson	52228a703c	Use TArray in AMD element-wise kernels, rather than manually copying memory to device.	2020-12-27 10:30:29 -08:00
Changming Sun	1fc7f92f25	Fix a memory leak in test_inference.cc (#6201 ) * Fix a memory leak in test_inference.cc	2020-12-25 13:02:21 -08:00
sfatimar	7347996942	Openvino ep 2021.2 (#6196 ) * Enabling fasterrcnn variant and vehicle detector * changes for 2021_2 branch * yolov3_pytorch commit * fixed braces in basic_backend.cc * ci information added * faster rcnn variant and vehicle detector changes were made in 2021.1 and not in 2021.2 * some changes to support unit tests * disable some tests which are failing * fix myriad tests for vehicle detector * Did some cleanup cleaned up comments Disabled Add_Broadcast_0x1 and Add_Broadcast_1x0 tests on MYRIAD_FP16 backend due to a bug cleaned up capability_2021_2.cc file Removed extra conditions which were added for some validation in backend_utils Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * yolov3 pytorch workaround to ensure that the output names are matched * gemmoptest fixed on myriad * Fixed MYRIADX CPP Test Failures Expand,GatherND,Range,Round op's are only supported in model where op with float input data types are not supported and fixed Scatter and ScatterElements op's with negative axis are fixed Reshape op with 0 dim value are not supported and fixed Disabled InstanceNorm_2 test on MYRIADX Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> make changes to yolov3 pytorch * Fixed python unit tests Fixed failing python tests on vpu, GPU and CPU Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Fixes POW op failures on GPU_FP16 Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Clean up capability_2021_2.cc Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Updated docx for MultiThreading option Added extra info on setting the num_of_threads option using the API and it's actual usage Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> fixed slice and removed extra prints * Disabled failing python tests Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor changes added in capabilty_2021_2 Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * made changes to slice to avoid failures * Disabling FP16 support for GPU_FP32 ->Inferencing an FP16 model on GPU_FP32 leads to accuracy mismatches. so, we would rather use GPU_FP16 to infer an FP16 model on GPU Device Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Updated docx for Inferencing a FP16 Model Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * fix for mask rcnn * Script for installing openvino from source * Updated with openvino 2021.2 online installation * code comment fixes fixed accuracy mismatch for div * Update OpenvinoEP-ExecutionProvider.md updated for 2021.2 branch * Update README.md updated dockerfile documentation * Update BUILD.md build.md update documentation * permissiong change of install_openvino.sh * made changes to align with microsoft onnxruntime changes * Updated with ov 2021.2.200 Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com> Co-authored-by: sfatimar <sahar.fatima@intel/com> Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: mohdansx <mohdx.ansari@intel.com>	2020-12-23 08:47:22 -08:00
Ryan Lai	0494a0f95f	Add ability to skip GPU tests based on GPU adapter name (#6198 ) * Implement conversion from ortvalue to Itensor for string tensors and comparing sequence of maps of strings to floats * PR comments * Add ability to skip gpu tests according to adapter description * spacing * spacing * spacing	2020-12-22 15:20:23 -08:00
Jesse Benson	c562952750	Dockerfile to build onnxruntime with ROCm 4.0	2020-12-22 10:21:12 -08:00
Ryan Lai	21395f8e24	Implement comparing outputs that are sequence of maps of strings to floats (#6180 ) * Implement conversion from ortvalue to Itensor for string tensors and comparing sequence of maps of strings to floats * PR comments	2020-12-22 09:52:29 -08:00
baijumeswani	a8b482681a	Clean up checkpoint tests to use the new checkpoint functions (#6188 ) * add deprecation warning for old checkpoint functions * update all the distributed checkpoint tests to use new checkpoint functions	2020-12-22 09:15:57 -08:00
Hariharan Seshadri	04b3e0ef5e	Condition fix in Resize operator (#6193 )	2020-12-22 00:05:31 -08:00
Hariharan Seshadri	fc27074bae	Implement ScatterND for CUDA EP (#6184 )	2020-12-22 00:04:20 -08:00
Chi Lo	945fae8f56	Lochi/quantization tool for trt (#6103 ) * Initial implementation of generating calibration dynamic range table * Initialize validation support for Quantization * Initialize validation support for Quantization (cont.) * Improve validation support for Quantization * Improve validation support for Quantization * Rewrite/Refine for calibration and validation * Rewrite/Refine for calibration and validation (cont.) * Refine code * Refine code * Add data reader for BERT * Add flatbuffers to serialize calibration table * Refine code and add BERT evaluation * Refine the code * minor modification * Add preprocess/postprocess of vision team yolov3 and refine the code * Update annotation * Make bbox cooridates more accurate * Fix bug * Add support of batch processing * Batch processing for model zoo yolov3 * Add batch inference for evaluation * Refine the code * Add README * Add comments * Refine the code for PR * Remove batch support checking in data_reader and refine the code * Refine the code for PR * Refine the code for PR review Co-authored-by: Olivia Jain <oljain@microsoft.com>	2020-12-21 20:59:08 -08:00
Olivia Jain	234e94b4e1	Add Status.csv to EP Perf Tool (#6167 ) * merge master, keep postprocess status commit * download float16.py everytime * removing hardcoded values	2020-12-21 20:23:19 -08:00
Suffian Khan	67ac6ae4e0	Tune fast Gelu to use exp(x) instead of tanh(x) on Rocm platform (#6174 ) * tune fast gelu to use exp(x) instead of tanh(x) on rocm * update to use expression 2/(1+exp(-2x))-1 for stability	2020-12-21 16:25:21 -08:00
Weixing Zhang	53307a5f2e	improve perf for softmax (#6128 ) * improve perf for both gathergrad and softmax * revert the change in gathergrad and will be done in another PR. * address comments from code review.	2020-12-21 14:15:54 -08:00
S. Manohar Karlapalem	ea9cfa554a	Add usage details of unified MCR container image (#6182 ) Going forward, a single unifed docker image will be published in MCR. The hardware accelerator target choice will have to be made in the application using OpenVINO EP's runtime config options.	2020-12-21 11:48:54 -08:00
satyajandhyala	201d0dbb1a	Android coverage dashboard (#6163 ) * Write the report to a file. * Post code coverage to the Dashboard database.	2020-12-21 10:34:01 -08:00
jingyanwangms	f874260b9e	Backend APIs for checkpointing (#5803 ) * Add backend API GetOptimizerState and GetModelState * add GetPartitionInfoMap	2020-12-21 08:21:29 -08:00
Scott McKay	2da8060f34	Helper for compiling EP to generate deterministic unique ids for use in MetaDef names (#6156 ) * Create a helper for generating unique ids that can be used by an EP that creates compiled nodes and needs ids to be deterministic for a model when used in multiple sessions. Added to IExecutionProvider as this can potentially be used by all compiling EPs and is more robust than a simplistic counter (although EP implementer is free to choose either approach). * Restructure the helper so it can be called across the EP bridge. Add ability to call id generation helper from EP bridge - convert DNNL EP to use helper to validate Address issue where a new Model may be loaded into the same address as a previous one. - hash the bytes in the Graph instance (1728 bytes currently) to use as the key to the full hash for the model Add lock around id generation to ensure no issues if multiple sessions partitions graphs at exactly the same time. - Extremely unlikely but would be hard to debug and the locking cost is not an issue as it's only incurred during graph partitioning and not execution.	2020-12-21 12:17:58 +10:00

1 2 3 4 5 ...

4003 commits