onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-19 19:00:47 +00:00

Author	SHA1	Message	Date
Yufeng Li	822b23ff2f	Add support of EmbeddingLayerNorm (#4562 )	2020-07-21 21:43:02 -07:00
Hariharan Seshadri	bf78e4d18b	Handle cases which produce an empty output in the MatMul op family (#4573 )	2020-07-21 17:57:47 -07:00
Ryan Lai	0b4659c3fe	Populates TensorProto with tensor from protobuf file (#4535 ) * Expose load tensor proto from protobuf file function * Add comment * Remove use of fstream and use parsefromzerocopystream * Close file descriptor after finish parsing it * Close input stream too * Set Close on delete only, no need to close file descriptor * Revert "Set Close on delete only, no need to close file descriptor" This reverts commit `5ba6e3c31b`. * Revert "Close input stream too" This reverts commit `4564776733`. * Revert "Close file descriptor after finish parsing it" This reverts commit `846e550c4f`. * Revert "Remove use of fstream and use parsefromzerocopystream" This reverts commit `25a3117183`.	2020-07-21 17:50:15 -07:00
Changming Sun	603f2d1138	Exclude a few OpenVino flaky tests (#4572 )	2020-07-21 12:14:34 -07:00
Chi Lo	affdeb53c2	Add Python API for specifying device options. (#4205 ) * Add python API for specifying CUDA device id * Modification for providing session based python api for specifying device id * When include header file pybind11/stl.h, conversion between c++ containers and Python list, vector and dict data structure are automatically enabled. https://pybind11.readthedocs.io/en/stable/advanced/cast/stl.html# Therefore, refactor the code for better leverage this advantage. * Make struct CudaDeviceOptions as default cuda device options * Implement sess.set_providers(list_of_providers, list_of_provider_option_dicts) But still stay consistent with existing sess.set_providers(list_of_provider) * Add cuda provider option default setting * Add support for setting cuda cuda_mem_limit and arena_extend_strategy. Also resolved the merge conflict on session.py * Use python ctypes to call cuda library to help python unittest * Refine the code with reviewer's suggestions * Add the capability of getting execution provider's configuration - Once we introduced the capability to set execution provider's configuration, it makes sense to add capability of getting ep's configuration. * Modify the code with reviewer's suggestions. * Using stoull() and stoul() depends on 32/64-bits architecture. * Rewrite the testcases for testing setting CUDA device id Note: We need to make sure every ORT process be run on one CUDA device at a time. * Make sure old session object is destroyed by python gc before new session object is being created * Move testcases to original onnxruntime_test_python.py * Fix bugs to pass CI build * Make it pass CI build (cont.) * Make it pass CI build (cont.)	2020-07-21 07:28:13 -07:00
M. Zeeshan Siddiqui	e11629d9e4	Revert "Deprecate TrainableDropout (#4501 )" (#4564 ) This reverts commit `6eb5549cb9`. # Conflicts: # orttraining/orttraining/core/graph/training_op_defs.cc	2020-07-21 06:18:42 -07:00
ashbhandare	ab4be8355f	Add the BroadcastGradientArgs op (#4511 ) * Adding CPU implementation of BroadcastGradientArgs op * Modify to take shape as input instead of tensor * Cleanup * Correct schema * Corrected kernel, added tests, addressed review comments. * Added exception,test for invalid broadcast,addresed review comments. * Fix mac build error.	2020-07-20 23:59:10 -07:00
ytaous	0008e92b4e	Internal ReduceSum op that accepts axes as input (#4522 ) * Initial change, to add ReduceSumTraining cpu op * cpu support * cuda support + more UTs * on comments + UT * no op support for {} axes with new attr - noop_with_empty_axes * on comments * fix build * on comments Co-authored-by: aishwarya bhandare <aibhanda@microsoft.com> Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-07-20 21:05:00 -07:00
Yufeng Li	e92e0860c8	BERT quantization notebook (#4543 ) * BERT quantization notebook * update notebooks * more benchmark * add version info	2020-07-20 18:23:37 -07:00
gwang-msft	ca0dd8246c	NNAPI EP, add quantization support (#4530 ) * nnapi quantization work, 1. add SymmPerChannelQuantParams to operand types * add ways for operand_type to change dimension * remove per-channel quantization * Squashed commit of the following: commit 4857c3a732298c2f96efb61800b7621251d61c1b Author: gwang <wanggy@outlook.com> Date: Tue Jul 14 23:02:28 2020 -0700 remove per-channel quantization commit 775e4b2960f531496b8d11eef27d64e5b85c3c66 Author: gwang <wanggy@outlook.com> Date: Mon Jul 13 11:54:02 2020 -0700 add ways for operand_type to change dimension commit e56a494de67c66f8122d908270fbc2bb17e38423 Author: gwang <wanggy@outlook.com> Date: Wed Jul 8 15:18:55 2020 -0700 nnapi quantization work, 1. add SymmPerChannelQuantParams to operand types * add support for QuantizeLinear * add dequantizelinear support * minor style update * minor bug fix * add quantization support for qlinearmatmul, minor issue fix * add quantized input support, minor bug fix * fix issues in the qlinearmatmul * add verify scale and zeropoint for qlinearmatmul * add test for [de]qunatizelinear ops * add qlinearconv support * fixed small issue causing test failure * fix test exception * fix for centos test failure * fix centos test failure * fix issue causing win-tensorRT ci failure * addressed comments	2020-07-20 16:43:31 -07:00
Tracy Sharpe	7f9d9557b1	Remove template<T> from RoiAlignBase (#4558 )	2020-07-20 14:28:46 -07:00
dependabot[bot]	bbdabc2c48	Bump lodash from 4.17.15 to 4.17.19 in /nodejs Bumps [lodash](https://github.com/lodash/lodash) from 4.17.15 to 4.17.19. - [Release notes](https://github.com/lodash/lodash/releases) - [Commits](https://github.com/lodash/lodash/compare/4.17.15...4.17.19) Signed-off-by: dependabot[bot] <support@github.com>	2020-07-20 14:24:21 -07:00
Changming Sun	c2c4e6760b	Fix code sign validation errors in nuget and nodejs pipeline (#4527 )	2020-07-20 14:18:47 -07:00
RandySheriffH	1c5733ea9d	Silence binscope warnings (#4542 ) * slience binscope warnings * revert change on rpath Co-authored-by: Randy <Randy@randysmac.attlocal.net>	2020-07-20 14:10:41 -07:00
Changming Sun	9318b3a47f	Fix a typo in C++ sample application (#4556 ) Resolve #3406	2020-07-20 09:31:44 -07:00
M. Zeeshan Siddiqui	9d80235607	Reshape inputs for SoftmaxCrossEntropyLoss instead of transposing them. (#4551 )	2020-07-20 06:33:40 -07:00
Changming Sun	bc1d197ddf	Re-enable dnnl in CI build (#4544 ) * Revert "Temporarily remove dnnl from Linux CI build to unblock the whole team (#4266)" Previously it fails because it used too much memory. Now we only run dnnl EP with opset12 models in unit tests, to reduce peak memory usage.	2020-07-19 23:20:03 -07:00
Yulong Wang	0e91e45049	[node.js binding] use official ORT C++ API (#4552 )	2020-07-19 22:46:41 -07:00
Tracy Sharpe	08235e1662	add Output() overloads (#4546 )	2020-07-19 15:21:12 -07:00
M. Zeeshan Siddiqui	6c950a1df0	Exclude weight related types/shapes from bert loss. (#4548 )	2020-07-18 08:27:02 -07:00
Tianlei Wu	b42fe49f56	Add a script to convert gpt2 to onnx (#4541 ) * add a script convert_to_onnx.py and refactor benchmark	2020-07-17 23:14:42 -07:00
Yulong Wang	0229a6a929	[C++ API] add SessionOptions::SetLogSeverityLevel() (#4545 )	2020-07-17 21:14:41 -07:00
M. Zeeshan Siddiqui	6eb5549cb9	Deprecate TrainableDropout (#4501 ) * Deprecate TrainableDropout. * Add Dropout(12) back into Megatron transformer. * Remove TrainableDropout from front-end test models. * Update baseline for front-end tests after converting test models to opset-12. * Update baseline for front-end tests after converting test models to opset-12.	2020-07-17 13:43:25 -07:00
Yulong Wang	fdc5c308c4	introduce macro ORT_API_MANUAL_INIT in C++ API (#4536 ) * introduce macro ORT_API_MANUAL_INIT in C++ API * resolve comments	2020-07-17 13:23:30 -07:00
Wei-Sheng Chin	21d2728974	Revise pipeline schedule to consider communication ops (#4524 ) * Revise pipeline schedule to consider communication ops * Add test * Fix warning * inline some short functions * Fix warnings * Rename a class * Add comment for test * op renamed to task * Fix NVTX wrapper's bug	2020-07-17 10:04:56 -07:00
Xueyun Zhu	183098e344	adding concat logic when initial path is empty (#4525 ) * concat * add path_utils * address feedback * use string in test * convert wstring to sting in windows * address feedback * address feedback * fix comment	2020-07-16 23:46:12 -07:00
Hariharan Seshadri	d1f45f9361	Handle nodes with dtype attribute causing diffs in inferred and actual types of NodeArgs due to InsertCastTransformer (#4523 )	2020-07-16 21:59:14 -07:00
Tracy Sharpe	8b86c5cdb5	Merge int32/uint32 and int64/uint64 MatMul kernels (#4531 )	2020-07-16 21:25:29 -07:00
Sheil Kumar	02aea5d2d4	rename telemetry provider back to Microsoft.Windows.AI.MachineLearning (#4533 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-16 19:51:06 -07:00
Yulong Wang	5086e55a35	Fix condition of running tests in win CI (#4459 )	2020-07-16 16:33:30 -07:00
Tiago Koji Castro Shibata	2189c77e5b	static_typename (#4520 ) * Use static_typename * Disable RTTI outside of Release * Fix unused var * Add test types * PR feedback	2020-07-16 16:31:02 -07:00
M. Zeeshan Siddiqui	b43ce2d7ad	Replace loss function in BERT_LOSS with SoftmaxCrossEntropyLoss. (#4509 ) * Replace loss function in BERT_LOSS with SoftmaxCrossEntropyLoss. * Update BERT loss function with correct logit shapes for softmax cross entropy loss. * fix test and PR comments.	2020-07-16 15:28:24 -07:00
RandySheriffH	76b31d6ce2	fix xcode alerts (#4470 ) * fix xcode alerts * fix comment * fix comments * update text * fix comments * fix comments * remove checks on context Co-authored-by: Randy <Randy@randysmac.attlocal.net> Co-authored-by: Randy <Randy@randysmac.local> Co-authored-by: Tracy Sharpe <tracysh@microsoft.com>	2020-07-16 10:20:34 -07:00
Changming Sun	8ada440961	Move model tests to onnxruntime_test_all (#4521 ) 1. Move model tests to onnxruntime_test_all 2. Publish TestResults of Windows CI build.	2020-07-15 16:46:18 -07:00
Xueyun Zhu	5f188f4cf4	ci fix (#4519 )	2020-07-15 12:05:24 -07:00
stevenlix	0ebe2fab51	Refactor TensorRT EP code to better handle dynamic shape subgraphs (#4504 ) * build engine in runtime for dynamic shape subgraphs * Update TensorRT-ExecutionProvider.md * Update TensorRT-ExecutionProvider.md * fix build issue * Add more instructions on how to use engine caching * add precision to trt node name * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc	2020-07-15 02:35:42 -07:00
gwang-msft	cf92497c16	Nnapi, add auto_pad support for Conv/GlobalAveragePool/AveragePool/GlobalMaxPool/MaxPool operators (#4499 ) * Split ComputePadAndOutputShape into ComputePad and ComputeOutputShape * update NNAPI conv ouput shape compute to use shared ComputeOutputShapec * move use ptr to use reference for ComputePadAndOutputShape * nnapi conv support auto_pad * add logging operator support bt target devices * update InferOutputShape/ComputePadAndOutputShape/ComputePad to use force_symmetric_auto_padding as param instead of template * make log op support for target devices optional * add auto_pad support to pool operators * ignore GetTargetDevices if using all devices * fix some typo in padding calculation * fix a bug of compute padding difference between conv and pool ops * addressed CR comments, removed NNAPI device logging and move nnapi ep autopad handling into a shared function * change helper functions to static	2020-07-15 00:21:42 -07:00
edgchen1	34f73fa1aa	Add sudo --preserve-env option to allow environment to go through to docker commands. (#4512 )	2020-07-14 18:12:31 -07:00
liqunfu	f721f5f1cd	Liqun/multiple choice (#4480 ) * multiple choice runner * add docker cleanup task to frontent pipeline	2020-07-14 17:57:58 -07:00
Xueyun Zhu	7d96960ec8	support pipeline partition with shared initializer (#4321 ) * support bert partition with shared initializer * address feedback * address feedback * address feedback * add more test * remove bert-tiny model * address feedback * address function comment * move CreateNodeArg to graph_utils * rename function name * rename function name * fix windows build * fix windows type conversion warning * add function comment	2020-07-14 17:21:40 -07:00
edgchen1	1ebe598286	Conditionally compile without std::is_trivially_copyable to satisfy old GCC versions. (#4510 )	2020-07-14 16:47:40 -07:00
Sheil Kumar	ee5ca27ae2	Split Microsoft.AI.MachineLearning.nupkg in a NuGet package and symbol NuGet package (#4503 ) * add threadpool interface * generate snupkgs * include_pdb check * fix snupkg generation * Add task to merge snupkgs * folder exists * check dir * revert thread pool stuff Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-14 14:52:39 -07:00
Tianlei Wu	25885cf7d0	Add option --torchscript in benchmark_gpt2.py (#4500 ) * support TorchScript * change onnx filename format * change output name prediction_scores to logits	2020-07-14 11:53:23 -07:00
Tim Harris	a95ae164f7	Create N-1 threads in intra-op pool, given main thread now active (#4493 ) Create N-1 threads in a thread pool when configured with intra-op parallelism of N. This ensures we have N active threads, given that the main thread also runs work. To avoid ambiguity on the value returned, rename ThreadPool::NumThreads method to ThreadPool::DegreeOfParallelism, and make corresponding updates in MLAS and operators.	2020-07-14 09:48:50 +01:00
liqunfu	0bff55512e	updated expected values for frontend test to pass frontend e2e pipeline. raise tolerance to reduce future risk of failure (#4497 ) * updated expected values for frontend test, raise tol	2020-07-13 19:25:54 -07:00
Dmitri Smirnov	e0eddf502c	Bump version to 1.4.0 (#4496 )	2020-07-13 17:09:18 -07:00
Yufeng Li	3d4ac85124	Add quantization benchmark for transformer based model (#4482 ) * add support of quantization benchmark	2020-07-13 15:46:23 -07:00
gwang-msft	a3c358fd29	Split the shared ComputePadAndOutputShape into 2 separated functions ComputePad and ComputeOutputShape (#4487 ) * Split ComputePadAndOutputShape into ComputePad and ComputeOutputShape * update NNAPI conv ouput shape compute to use shared ComputeOutputShapec * move use ptr to use reference for ComputePadAndOutputShape	2020-07-13 15:07:34 -07:00
Tiago Koji Castro Shibata	3441c687b7	Revert "Remove docstrigs if __ONNX_NO_DOC_STRINGS" (#4495 ) This reverts commit bb4d331fa7bf1fe8d68b1527dda56e4739c80800.	2020-07-13 14:55:37 -07:00
gwang-msft	5f8f443ac4	Android CI build, test copy, emulator boot improvement (#4481 ) * Enable onnxruntime_test_all for NNAPI EP * switch to use ninja for ANdroid CI * make android elumator boot faster in android ci * simplify adb push * more style change * more tweaking on android ci * build.py style update	2020-07-13 14:18:34 -07:00

1 2 3 4 5 ...

2905 commits