onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-16 18:31:27 +00:00

Author	SHA1	Message	Date
Sheil Kumar	efa393e596	WinML should dynamically link against onnxruntime.dll and only system32 for inbox builds (#4615 ) * Dynamically link onnxruntime.dll * fixes * add preceeding backslash to onnxruntime.dll for inbox builds * remove /d * loadlibrary -> loadlibraryex * use loadlibrary system32 option Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-27 09:56:49 -07:00
Sheil Kumar	222fd08f20	DirectML.dll is loaded via LoadLibraryW but should use LoadLibraryExA (#4616 ) * create dml device via loadlibraryexa * add build_INBOX flag to adapter Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-25 21:29:46 -07:00
Alisha Sonawalla	1e67fff93c	Add GetStringTensorElement, GetStringTensorElementLength and FillStringTensorElement API (#4374 ) Add new string tensor APIs and unit tests	2020-07-24 21:35:46 -07:00
Sheil Kumar	c361a59cff	disable gpu timeouts in winml (#4604 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-24 13:44:44 -07:00
Tiago Koji Castro Shibata	48d969f4bf	Constexpr CreateFeatureValueFromInspectable (#4460 )	2020-07-24 13:08:14 -07:00
Hariharan Seshadri	9510f26744	[Python] Support more APIs for the SessionOptions class (#4596 )	2020-07-24 12:56:54 -07:00
ytaous	9888c9e944	SplitTraining op to support split as input (#4597 ) * SplitTraining op to support split as input * on comments and minor refactor Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-07-24 12:49:19 -07:00
Sherlock	aa328c2c20	Update GratherGard to accumulate in fp32 (#4601 )	2020-07-24 10:54:31 -07:00
Yufeng Li	9c75c29403	refine opset version getter (#4602 )	2020-07-24 10:34:56 -07:00
ashbhandare	5189530b7b	Create the ConcatTraining op (#4595 ) * Working changes for ConcatTraining op * Refactor to move changes to orttraining * Fix segfault * Support -ve axis for shape inferencing * fix build Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-07-24 10:03:58 -07:00
ytaous	d5b98a13c2	Move ReduceSumTraining op under orttraining (#4588 ) * move training op under orttraining * code clean up * fix build * on comments Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-07-23 21:36:52 -07:00
Ryan Lai	f35ce4677f	add double and uint8_t datatypes (#4603 ) Co-authored-by: Ryan Lai <ryalai96@gamil.com>	2020-07-23 21:32:29 -07:00
Tianlei Wu	ace41b8064	Force return_tuple=True to handle transformers breaking change of output format. (#4599 )	2020-07-23 11:35:41 -07:00
Tianlei Wu	ea87c0d028	Update Transformer Optimizer documents (#4591 ) (1) Add bert-base-cased and gpt2 benchmark results on V100 (2) Update list of supported models. (3) Add comments to gpt2_helper. (4) Use IO Binding in test parity by default.	2020-07-23 08:38:39 -07:00
gwang-msft	03ebe33850	[Android NNAPI EP] Add support for LRN/Grouped Conv ops, fix issues where NNAPI will fall back to CPU (#4582 ) * add LRN/Grouped Conv Support, minor changes * better pool ops sdk version requirement * reduce string comparision for gemm/matmul ops * fix nnapi fall back to cpu for softmax * addressed review comments, correct a small error in the code	2020-07-23 00:05:39 -07:00
RRRachelllll555	c5df918744	improve calibration tool (#4561 ) * improve calibration tool * modify calibration interface name * modify calibration interface name * refine calibrate and calibrate_user * refine and add type info * refine and add type info * add e2e user example file * remove unnecessary files * remote test images no longer needed * update readme document Co-authored-by: t-yguo <t-yguo@microsoft.com>	2020-07-22 21:31:49 -07:00
Dmitri Smirnov	1b253d18ef	Updated tags for MCR images (#4574 ) * Updated pushed CPU and CUDA tags. * Add tensorRT, fix typo. * Add OpenVINO tags. Remove 2020.2 installation instructions for VAD-M. * Revert instruction changes for V-ADM and update 2020.2 to 2020.3	2020-07-22 19:39:42 -07:00
Xueyun Zhu	e2acb165e9	Add exception check in training_runner when worker runs into error, and misc check on nccl and mpi calls (#4380 ) * error check * fix build warning treated as error	2020-07-22 14:32:19 -07:00
gwang-msft	c2ec3b734b	[Android NNAPI EP] Remove dependency on external JD/DNNLibrary (#4576 ) * remove dependency of external jd-dnnlibrary * remove extra variables not used any more * update /cgmanifest.json	2020-07-22 14:08:12 -07:00
Pranav Sharma	f0edd074fb	Optimize CreateEnv by not creating the logging manager instance if env instance has already been created. (#4583 ) * Optimize CreateEnv by not creating the logging manager instance if env instance has already been created. * Move creation of logging mgr inside if block	2020-07-22 12:29:55 -07:00
Sheil Kumar	fa6d035090	Create WindowsAI zip files automatically as part of the pipeline (#4584 ) * copy rename nupkg to zip as part of build task * update both symbols and regular package Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-22 10:53:47 -07:00
George Wu	6b53a74867	replace invalid sample (#4567 )	2020-07-21 23:51:17 -07:00
S. Manohar Karlapalem	e00e2e84d6	[OpenVINO] Update MCR dockerfile with Ubuntu 18.04 package names (#4581 ) * Update Ubuntu version codename for install components * Specify OpenVINO dockerfile's name in documentation Also install NCS udev rules.	2020-07-21 23:48:24 -07:00
Andrews548	f20afc4991	Update ACL/ArmNN EP (#4571 ) * Add BN to ArmNN EP * Add Concat to ArmNN EP * ACL logging improvements * ArmNN logging improvements * Fallback to CPU for 9x9 convolution in ACL EP * Fallback to CPU for 9x9 convolution in ArmNN EP * Enable python support for ACL and ArmNN EPs when compiled with BSP toolchain * Removed the matmul operator * Fix conv infer shape function * Fix provider_names list for armnn Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-07-21 22:25:58 -07:00
Yufeng Li	822b23ff2f	Add support of EmbeddingLayerNorm (#4562 )	2020-07-21 21:43:02 -07:00
Hariharan Seshadri	bf78e4d18b	Handle cases which produce an empty output in the MatMul op family (#4573 )	2020-07-21 17:57:47 -07:00
Ryan Lai	0b4659c3fe	Populates TensorProto with tensor from protobuf file (#4535 ) * Expose load tensor proto from protobuf file function * Add comment * Remove use of fstream and use parsefromzerocopystream * Close file descriptor after finish parsing it * Close input stream too * Set Close on delete only, no need to close file descriptor * Revert "Set Close on delete only, no need to close file descriptor" This reverts commit `5ba6e3c31b`. * Revert "Close input stream too" This reverts commit `4564776733`. * Revert "Close file descriptor after finish parsing it" This reverts commit `846e550c4f`. * Revert "Remove use of fstream and use parsefromzerocopystream" This reverts commit `25a3117183`.	2020-07-21 17:50:15 -07:00
Changming Sun	603f2d1138	Exclude a few OpenVino flaky tests (#4572 )	2020-07-21 12:14:34 -07:00
Chi Lo	affdeb53c2	Add Python API for specifying device options. (#4205 ) * Add python API for specifying CUDA device id * Modification for providing session based python api for specifying device id * When include header file pybind11/stl.h, conversion between c++ containers and Python list, vector and dict data structure are automatically enabled. https://pybind11.readthedocs.io/en/stable/advanced/cast/stl.html# Therefore, refactor the code for better leverage this advantage. * Make struct CudaDeviceOptions as default cuda device options * Implement sess.set_providers(list_of_providers, list_of_provider_option_dicts) But still stay consistent with existing sess.set_providers(list_of_provider) * Add cuda provider option default setting * Add support for setting cuda cuda_mem_limit and arena_extend_strategy. Also resolved the merge conflict on session.py * Use python ctypes to call cuda library to help python unittest * Refine the code with reviewer's suggestions * Add the capability of getting execution provider's configuration - Once we introduced the capability to set execution provider's configuration, it makes sense to add capability of getting ep's configuration. * Modify the code with reviewer's suggestions. * Using stoull() and stoul() depends on 32/64-bits architecture. * Rewrite the testcases for testing setting CUDA device id Note: We need to make sure every ORT process be run on one CUDA device at a time. * Make sure old session object is destroyed by python gc before new session object is being created * Move testcases to original onnxruntime_test_python.py * Fix bugs to pass CI build * Make it pass CI build (cont.) * Make it pass CI build (cont.)	2020-07-21 07:28:13 -07:00
M. Zeeshan Siddiqui	e11629d9e4	Revert "Deprecate TrainableDropout (#4501 )" (#4564 ) This reverts commit `6eb5549cb9`. # Conflicts: # orttraining/orttraining/core/graph/training_op_defs.cc	2020-07-21 06:18:42 -07:00
ashbhandare	ab4be8355f	Add the BroadcastGradientArgs op (#4511 ) * Adding CPU implementation of BroadcastGradientArgs op * Modify to take shape as input instead of tensor * Cleanup * Correct schema * Corrected kernel, added tests, addressed review comments. * Added exception,test for invalid broadcast,addresed review comments. * Fix mac build error.	2020-07-20 23:59:10 -07:00
ytaous	0008e92b4e	Internal ReduceSum op that accepts axes as input (#4522 ) * Initial change, to add ReduceSumTraining cpu op * cpu support * cuda support + more UTs * on comments + UT * no op support for {} axes with new attr - noop_with_empty_axes * on comments * fix build * on comments Co-authored-by: aishwarya bhandare <aibhanda@microsoft.com> Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-07-20 21:05:00 -07:00
Yufeng Li	e92e0860c8	BERT quantization notebook (#4543 ) * BERT quantization notebook * update notebooks * more benchmark * add version info	2020-07-20 18:23:37 -07:00
gwang-msft	ca0dd8246c	NNAPI EP, add quantization support (#4530 ) * nnapi quantization work, 1. add SymmPerChannelQuantParams to operand types * add ways for operand_type to change dimension * remove per-channel quantization * Squashed commit of the following: commit 4857c3a732298c2f96efb61800b7621251d61c1b Author: gwang <wanggy@outlook.com> Date: Tue Jul 14 23:02:28 2020 -0700 remove per-channel quantization commit 775e4b2960f531496b8d11eef27d64e5b85c3c66 Author: gwang <wanggy@outlook.com> Date: Mon Jul 13 11:54:02 2020 -0700 add ways for operand_type to change dimension commit e56a494de67c66f8122d908270fbc2bb17e38423 Author: gwang <wanggy@outlook.com> Date: Wed Jul 8 15:18:55 2020 -0700 nnapi quantization work, 1. add SymmPerChannelQuantParams to operand types * add support for QuantizeLinear * add dequantizelinear support * minor style update * minor bug fix * add quantization support for qlinearmatmul, minor issue fix * add quantized input support, minor bug fix * fix issues in the qlinearmatmul * add verify scale and zeropoint for qlinearmatmul * add test for [de]qunatizelinear ops * add qlinearconv support * fixed small issue causing test failure * fix test exception * fix for centos test failure * fix centos test failure * fix issue causing win-tensorRT ci failure * addressed comments	2020-07-20 16:43:31 -07:00
Tracy Sharpe	7f9d9557b1	Remove template<T> from RoiAlignBase (#4558 )	2020-07-20 14:28:46 -07:00
dependabot[bot]	bbdabc2c48	Bump lodash from 4.17.15 to 4.17.19 in /nodejs Bumps [lodash](https://github.com/lodash/lodash) from 4.17.15 to 4.17.19. - [Release notes](https://github.com/lodash/lodash/releases) - [Commits](https://github.com/lodash/lodash/compare/4.17.15...4.17.19) Signed-off-by: dependabot[bot] <support@github.com>	2020-07-20 14:24:21 -07:00
Changming Sun	c2c4e6760b	Fix code sign validation errors in nuget and nodejs pipeline (#4527 )	2020-07-20 14:18:47 -07:00
RandySheriffH	1c5733ea9d	Silence binscope warnings (#4542 ) * slience binscope warnings * revert change on rpath Co-authored-by: Randy <Randy@randysmac.attlocal.net>	2020-07-20 14:10:41 -07:00
Changming Sun	9318b3a47f	Fix a typo in C++ sample application (#4556 ) Resolve #3406	2020-07-20 09:31:44 -07:00
M. Zeeshan Siddiqui	9d80235607	Reshape inputs for SoftmaxCrossEntropyLoss instead of transposing them. (#4551 )	2020-07-20 06:33:40 -07:00
Changming Sun	bc1d197ddf	Re-enable dnnl in CI build (#4544 ) * Revert "Temporarily remove dnnl from Linux CI build to unblock the whole team (#4266)" Previously it fails because it used too much memory. Now we only run dnnl EP with opset12 models in unit tests, to reduce peak memory usage.	2020-07-19 23:20:03 -07:00
Yulong Wang	0e91e45049	[node.js binding] use official ORT C++ API (#4552 )	2020-07-19 22:46:41 -07:00
Tracy Sharpe	08235e1662	add Output() overloads (#4546 )	2020-07-19 15:21:12 -07:00
M. Zeeshan Siddiqui	6c950a1df0	Exclude weight related types/shapes from bert loss. (#4548 )	2020-07-18 08:27:02 -07:00
Tianlei Wu	b42fe49f56	Add a script to convert gpt2 to onnx (#4541 ) * add a script convert_to_onnx.py and refactor benchmark	2020-07-17 23:14:42 -07:00
Yulong Wang	0229a6a929	[C++ API] add SessionOptions::SetLogSeverityLevel() (#4545 )	2020-07-17 21:14:41 -07:00
M. Zeeshan Siddiqui	6eb5549cb9	Deprecate TrainableDropout (#4501 ) * Deprecate TrainableDropout. * Add Dropout(12) back into Megatron transformer. * Remove TrainableDropout from front-end test models. * Update baseline for front-end tests after converting test models to opset-12. * Update baseline for front-end tests after converting test models to opset-12.	2020-07-17 13:43:25 -07:00
Yulong Wang	fdc5c308c4	introduce macro ORT_API_MANUAL_INIT in C++ API (#4536 ) * introduce macro ORT_API_MANUAL_INIT in C++ API * resolve comments	2020-07-17 13:23:30 -07:00
Wei-Sheng Chin	21d2728974	Revise pipeline schedule to consider communication ops (#4524 ) * Revise pipeline schedule to consider communication ops * Add test * Fix warning * inline some short functions * Fix warnings * Rename a class * Add comment for test * op renamed to task * Fix NVTX wrapper's bug	2020-07-17 10:04:56 -07:00
Xueyun Zhu	183098e344	adding concat logic when initial path is empty (#4525 ) * concat * add path_utils * address feedback * use string in test * convert wstring to sting in windows * address feedback * address feedback * fix comment	2020-07-16 23:46:12 -07:00

1 2 3 4 5 ...

2929 commits