onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-02 03:55:34 +00:00

Author	SHA1	Message	Date
ytaous	ac4d615553	Enable priority-based execution order as default to support inputs with symbolic/dynamic shape (#6892 ) * priority-based exec order * disable 1 failing test * fix UT * more comments Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-03-04 22:36:25 -08:00
Funtowicz Morgan	9126faa35b	Ability to fuse non-square (pruned) attention weights for BERT-like models (#6850 )	2021-03-04 17:08:08 -08:00
RandySheriffH	f986ffcb5f	move pipeline file and change relative path (#6882 ) Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2021-03-04 15:31:42 -08:00
Reuben Zotz-Wilson	107c9672fd	No such file or directory with --use_external_data_form and int8 (#6867 ) Implemented following change to avoid the error when using both --use_external_data_form and --precision int8 with GPT2LMHeadModel, which results in line 161, in save_external_data; open(external_data_file_path, 'ab').close() FileNotFoundError: [Errno 2] No such file or directory: This may also be related to the identified bug #6047.	2021-03-04 15:14:23 -08:00
RandySheriffH	679718b12f	Configure session thread pool spinning preference (#6895 ) * add config allow_spinning * add config allow_spinning * set true as default * split configures for inter and intra ops Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2021-03-04 14:54:58 -08:00
Tianlei Wu	8f1786d5d2	Save output tensors in bert_test_data tool (#6872 )	2021-03-04 13:09:05 -08:00
Tiago Koji Castro Shibata	fa8d1b44b8	Fix app packaging in UWP (#6804 ) * Change msbuild condition for UAP * update .netcore target as well * create nuget packages with _native path * validate path under _native directory for windowsai package * pep8 * add diagnostic error message * pep8 * use baseame * lib\uap10.0 * uap10 * build\\uap10.0 * Manually binplace winmds into appx when PackageReference is used. * always binplace winmd regardless of packagereference since c# should work with packages.config also * resolve all paths to full paths to avoid some reference warnings * move winmds out of lib folder to prevent automatic component registration Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-03-04 11:16:25 -08:00
Suffian Khan	7915b6709a	Revert Gather Grad optimization in PR 6381 targeted for Rocm (#6880 ) * revert gather_grad_impl.cu * put stream changes back in * restrict changes to commenting launch of optimized version	2021-03-04 10:21:49 -08:00
Scott McKay	54cdb6af71	Add check that the first 2 Loop subgraph inputs have an shape (could be explicit or inferred) as we need to know the rank the subgraph expects. Other inputs to the subgraph are more opaque so we can just pass them through. (#6891 )	2021-03-04 20:42:40 +10:00
Sherlock	b429edcd45	Merge pull request #6890 from microsoft/bmeswani/merge_master_onto_ortmodule Merge master onto ortmodule dev branch	2021-03-03 23:42:50 -08:00
Baiju Meswani	aa93f2e236	move SetOutputMLValue from op_kernel.h to op_kernel_context.h	2021-03-03 20:39:34 -08:00
Baiju Meswani	d5667554e6	Merge branch 'master' of github.com:microsoft/onnxruntime into bmeswani/merge_master_onto_ortmodule	2021-03-03 20:37:29 -08:00
RandySheriffH	d01006fc22	Move constants from heap to stack to avoid randomness on cudnn function (#6869 ) * move const from heap to stack * add namespace * add base prefix * define local type	2021-03-03 20:18:21 -08:00
Sherlock	749e6a08a6	Add more asserts for ORTModule forward's correctness (#6887 ) * Add more asserts on forward outputs * Found one more failing case Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-03-03 19:57:42 -08:00
baijumeswani	ed1883a97c	Workaround for HTTP Error 403: Forbidden for MNIST dataset (#6885 )	2021-03-03 18:59:48 -08:00
Guoyu Wang	fedb68429c	[NNAPI EP] Add per-tensor u8s8 support for Qlinear[Conv/MatMul] (#6818 ) * NNAPI Add per-tensor u8s8 support * Update some comments * Address CR comments * Address CR comments	2021-03-03 15:44:49 -08:00
Guoyu Wang	3c5d811e77	[CoreML EP] Add [Average/Max]Pool support (#6870 )	2021-03-03 14:32:39 -08:00
Hariharan Seshadri	9a9e741a8c	Support optional inputs/outputs in custom op development (#6727 )	2021-03-03 05:59:23 -08:00
jingyanwangms	f22f04a109	Add comment (#6860 ) Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-03-02 18:54:25 -08:00
Faith Xu	6285ee2398	Reroute quantization tool readme to /docs page (#6854 )	2021-03-02 13:49:42 -08:00
Ye Wang	9073f7a5c3	support opset13 in embednorm (#6866 )	2021-03-02 12:33:40 -08:00
Ryan Hill	0d0eb2c85c	Change OpKernel class to be shared with shared providers (#6837 ) In the previous shared providers there aren't many OpKernel classes, and the existing Provider_OpKernel wrapper was fine. With the opposibility of making Cuda a shared provider, having this need to be changed per OpKernel adds a lot of complexity. It was fairly straightforward to make OpKernel work with shared providers with minimal changes. In this change, the ONNX_OPERATOR_* macros can also be shared with the shared providers.	2021-03-02 00:53:48 -08:00
Hariharan Seshadri	38796ad451	Refine force CPU fallback logic in the CUDA EP (#6849 )	2021-03-01 19:59:07 -08:00
Vincent Wang	4238ce341a	Add External Outputs Flag for YieldOp (#6789 ) * add external outputs flag for YieldOp * use kPreExisting * add ut for mem_pattern * fix ut after merge from master	2021-03-02 11:38:18 +08:00
Edward Chen	66df167a73	Add support for op kernel type control required types, require int64 for some ops (#6832 ) Adds support for required types to the op kernel type control infrastructure. Required types are always enabled. Added int64 as a required type for certain ops.	2021-03-01 19:04:29 -08:00
Guoyu Wang	36a44d55ed	Only report Android Baseline binary size for master branch (#6844 ) * Only report binary size from master * update script * Correct the typo	2021-03-01 15:57:18 -08:00
Guoyu Wang	5cf6606964	[CoreML EP] Add Concat support (#6834 ) * [CoreML EP] Add concat support * Update comments	2021-03-01 13:35:44 -08:00
Sherlock	12edf22f11	Merge pull request #6838 from microsoft/mzs/ortmodule-api-sync-from-master-210226 Sync from master	2021-02-27 12:32:36 -08:00
Tianlei Wu	2d6e10ba00	Update Attention and QAttention to support pruned model (#6819 ) * update Attention operator spec to support pruned model * update Attention and QAttention cpu & cuda kernel * Fix invalid embed layer norm fusion test models.	2021-02-27 09:50:16 -08:00
Thiago Crepaldi	f71d93ea2b	Enable PyTorch Lightning basic test on CI (#6809 )	2021-02-27 09:35:42 -08:00
M. Zeeshan Siddiqui	ca48310d6d	Merge branch 'master' of https://github.com/microsoft/onnxruntime into mzs/ortmodule-api-sync-from-master-210226	2021-02-27 04:25:23 +00:00
M. Zeeshan Siddiqui	cb8d8464bc	Do not create compute stream when external CUDA allocator is used. (#6833 )	2021-02-26 20:13:02 -08:00
Ye Wang	b4b87ac7a0	update (#6827 )	2021-02-26 13:58:41 -08:00
Sergii Dymchenko	059ed1c241	Copy forward signature from PyTorch model. (#6777 )	2021-02-26 13:02:13 -08:00
baijumeswani	c1b0cf6d0b	Add pipeline to clear the cache for huggingface transormers models (#6813 )	2021-02-26 10:39:22 -08:00
satyajandhyala	355057cf9c	Added RequiredGrad attribute to YieldOp (#6657 ) * Added required_grad attribute to YieldOp * Chagened YieldOp attribute to hold the indices of the required gradient outputs from the count, and removed the code reordering the outputs. * Changed backward_output_grad_names to a map from backward output gradient name to the corresponding output index.	2021-02-26 10:38:38 -08:00
Pranav Prakash	d5175795d2	Improvements to quantizer: Removed unused qType field, add reshape op (#6179 ) * Handle case where bias_name is already quantized If bias is shared between multiple nodes and we've already quantized it, just return the quantized name from the map * Remove qType attribute from QuantizedValue and QuantizedInitializer These are unused (and were incorrectly set in the case of int8 quantization) * Add Reshape op to quantizer * Add test for Reshape quant	2021-02-26 10:21:37 -08:00
Surya Siddharth Pemmaraju	3426108739	Fixed issue in python cmake to update wheel package (#6384 ) * Fixed issue in python cmake to update wheel package * Fixes python cmake issue for OV EP Added post build step for libonnxruntime_providers_openvino that copies the updated libonnxruntime_providers_openvino.so file to /onnxruntime/capi directory every time this target is rebuilt. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Removed post_build step from onnxruntime_python.cmake Now that we have added the post build step to copy onnxruntime_providers_openvino.so and providers_shared.so to /onnxruntime/capi directory in onnxruntime_providers.cmake file. so removing the duplication of the same from here. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed python cmake issue for OpenVINO-EP ->Fixed issue for both Linux and windows Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com>	2021-02-26 06:34:43 -08:00
Sherlock	8a450d523f	Check gradient correctness in the UTs (#6803 ) * Check gradient correctness in the UTs Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-02-25 13:31:07 -08:00
baijumeswani	fa8a9015bd	Mount hf model cache and use cache for loading hf models (#6810 )	2021-02-25 13:30:14 -08:00
Sergii Dymchenko	99ffffbe6a	Remove backward workaround from test. (#6811 )	2021-02-25 13:23:46 -08:00
Nick Feeney	46e026e900	Merged PR 5727374: Github to DmlDev Update 2/25/2021 Github to DmlDev Update 2/25/2021	2021-02-25 21:16:20 +00:00
Nick Feeney	a134b1f808	Merge remote-tracking branch 'upstream/master' into dmldev_temp	2021-02-25 12:02:34 -08:00
ashbhandare	b05403d877	Clear iobinding outputs (#6774 )	2021-02-25 11:50:43 -08:00
Chi Lo	9b3171e95c	Make keepdims to its default value when adding ReduceMin/ReduceMax for quantization calibration (#6788 ) * Make keepdims to its default value when adding ReduceMin/ReduceMax * Fix bug for adding ReduceMin/ReduceMax with keepdims=1	2021-02-25 09:47:59 -08:00
Olivia Jain	db05d53b94	Setup perf in docker and add features (#6582 ) * setup scripts to run in docker * percent threshold for accuracy * branch testing	2021-02-25 09:31:03 -08:00
stevenlix	d5f292ab73	fix issues caused by quantize/calibrate changes (#6802 )	2021-02-25 05:41:21 -08:00
Maajid khan	7465673e33	[OpenVINO-EP] Find package changes (#6801 ) * Find package changes to cmake * Removing unwanted code from cmake Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com>	2021-02-25 05:12:57 -08:00
Suffian Khan	8a148e44fb	make ci pipeline also run batch and convergence test (#6798 )	2021-02-24 20:18:03 -08:00
Hariharan Seshadri	ab1713f5cc	Fix regression in constant folding optimizer (#6795 )	2021-02-24 19:10:14 -08:00

... 67 68 69 70 71 ...

7863 commits