onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-03 03:58:54 +00:00

Author	SHA1	Message	Date
dependabot[bot]	bbdabc2c48	Bump lodash from 4.17.15 to 4.17.19 in /nodejs Bumps [lodash](https://github.com/lodash/lodash) from 4.17.15 to 4.17.19. - [Release notes](https://github.com/lodash/lodash/releases) - [Commits](https://github.com/lodash/lodash/compare/4.17.15...4.17.19) Signed-off-by: dependabot[bot] <support@github.com>	2020-07-20 14:24:21 -07:00
Changming Sun	c2c4e6760b	Fix code sign validation errors in nuget and nodejs pipeline (#4527 )	2020-07-20 14:18:47 -07:00
RandySheriffH	1c5733ea9d	Silence binscope warnings (#4542 ) * slience binscope warnings * revert change on rpath Co-authored-by: Randy <Randy@randysmac.attlocal.net>	2020-07-20 14:10:41 -07:00
Changming Sun	9318b3a47f	Fix a typo in C++ sample application (#4556 ) Resolve #3406	2020-07-20 09:31:44 -07:00
M. Zeeshan Siddiqui	9d80235607	Reshape inputs for SoftmaxCrossEntropyLoss instead of transposing them. (#4551 )	2020-07-20 06:33:40 -07:00
Changming Sun	bc1d197ddf	Re-enable dnnl in CI build (#4544 ) * Revert "Temporarily remove dnnl from Linux CI build to unblock the whole team (#4266)" Previously it fails because it used too much memory. Now we only run dnnl EP with opset12 models in unit tests, to reduce peak memory usage.	2020-07-19 23:20:03 -07:00
Yulong Wang	0e91e45049	[node.js binding] use official ORT C++ API (#4552 )	2020-07-19 22:46:41 -07:00
Tracy Sharpe	08235e1662	add Output() overloads (#4546 )	2020-07-19 15:21:12 -07:00
M. Zeeshan Siddiqui	6c950a1df0	Exclude weight related types/shapes from bert loss. (#4548 )	2020-07-18 08:27:02 -07:00
Tianlei Wu	b42fe49f56	Add a script to convert gpt2 to onnx (#4541 ) * add a script convert_to_onnx.py and refactor benchmark	2020-07-17 23:14:42 -07:00
Yulong Wang	0229a6a929	[C++ API] add SessionOptions::SetLogSeverityLevel() (#4545 )	2020-07-17 21:14:41 -07:00
M. Zeeshan Siddiqui	6eb5549cb9	Deprecate TrainableDropout (#4501 ) * Deprecate TrainableDropout. * Add Dropout(12) back into Megatron transformer. * Remove TrainableDropout from front-end test models. * Update baseline for front-end tests after converting test models to opset-12. * Update baseline for front-end tests after converting test models to opset-12.	2020-07-17 13:43:25 -07:00
Yulong Wang	fdc5c308c4	introduce macro ORT_API_MANUAL_INIT in C++ API (#4536 ) * introduce macro ORT_API_MANUAL_INIT in C++ API * resolve comments	2020-07-17 13:23:30 -07:00
Wei-Sheng Chin	21d2728974	Revise pipeline schedule to consider communication ops (#4524 ) * Revise pipeline schedule to consider communication ops * Add test * Fix warning * inline some short functions * Fix warnings * Rename a class * Add comment for test * op renamed to task * Fix NVTX wrapper's bug	2020-07-17 10:04:56 -07:00
Xueyun Zhu	183098e344	adding concat logic when initial path is empty (#4525 ) * concat * add path_utils * address feedback * use string in test * convert wstring to sting in windows * address feedback * address feedback * fix comment	2020-07-16 23:46:12 -07:00
Hariharan Seshadri	d1f45f9361	Handle nodes with dtype attribute causing diffs in inferred and actual types of NodeArgs due to InsertCastTransformer (#4523 )	2020-07-16 21:59:14 -07:00
Tracy Sharpe	8b86c5cdb5	Merge int32/uint32 and int64/uint64 MatMul kernels (#4531 )	2020-07-16 21:25:29 -07:00
Sheil Kumar	02aea5d2d4	rename telemetry provider back to Microsoft.Windows.AI.MachineLearning (#4533 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-16 19:51:06 -07:00
Yulong Wang	5086e55a35	Fix condition of running tests in win CI (#4459 )	2020-07-16 16:33:30 -07:00
Tiago Koji Castro Shibata	2189c77e5b	static_typename (#4520 ) * Use static_typename * Disable RTTI outside of Release * Fix unused var * Add test types * PR feedback	2020-07-16 16:31:02 -07:00
M. Zeeshan Siddiqui	b43ce2d7ad	Replace loss function in BERT_LOSS with SoftmaxCrossEntropyLoss. (#4509 ) * Replace loss function in BERT_LOSS with SoftmaxCrossEntropyLoss. * Update BERT loss function with correct logit shapes for softmax cross entropy loss. * fix test and PR comments.	2020-07-16 15:28:24 -07:00
RandySheriffH	76b31d6ce2	fix xcode alerts (#4470 ) * fix xcode alerts * fix comment * fix comments * update text * fix comments * fix comments * remove checks on context Co-authored-by: Randy <Randy@randysmac.attlocal.net> Co-authored-by: Randy <Randy@randysmac.local> Co-authored-by: Tracy Sharpe <tracysh@microsoft.com>	2020-07-16 10:20:34 -07:00
Changming Sun	8ada440961	Move model tests to onnxruntime_test_all (#4521 ) 1. Move model tests to onnxruntime_test_all 2. Publish TestResults of Windows CI build.	2020-07-15 16:46:18 -07:00
Xueyun Zhu	5f188f4cf4	ci fix (#4519 )	2020-07-15 12:05:24 -07:00
stevenlix	0ebe2fab51	Refactor TensorRT EP code to better handle dynamic shape subgraphs (#4504 ) * build engine in runtime for dynamic shape subgraphs * Update TensorRT-ExecutionProvider.md * Update TensorRT-ExecutionProvider.md * fix build issue * Add more instructions on how to use engine caching * add precision to trt node name * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc	2020-07-15 02:35:42 -07:00
gwang-msft	cf92497c16	Nnapi, add auto_pad support for Conv/GlobalAveragePool/AveragePool/GlobalMaxPool/MaxPool operators (#4499 ) * Split ComputePadAndOutputShape into ComputePad and ComputeOutputShape * update NNAPI conv ouput shape compute to use shared ComputeOutputShapec * move use ptr to use reference for ComputePadAndOutputShape * nnapi conv support auto_pad * add logging operator support bt target devices * update InferOutputShape/ComputePadAndOutputShape/ComputePad to use force_symmetric_auto_padding as param instead of template * make log op support for target devices optional * add auto_pad support to pool operators * ignore GetTargetDevices if using all devices * fix some typo in padding calculation * fix a bug of compute padding difference between conv and pool ops * addressed CR comments, removed NNAPI device logging and move nnapi ep autopad handling into a shared function * change helper functions to static	2020-07-15 00:21:42 -07:00
edgchen1	34f73fa1aa	Add sudo --preserve-env option to allow environment to go through to docker commands. (#4512 )	2020-07-14 18:12:31 -07:00
liqunfu	f721f5f1cd	Liqun/multiple choice (#4480 ) * multiple choice runner * add docker cleanup task to frontent pipeline	2020-07-14 17:57:58 -07:00
Xueyun Zhu	7d96960ec8	support pipeline partition with shared initializer (#4321 ) * support bert partition with shared initializer * address feedback * address feedback * address feedback * add more test * remove bert-tiny model * address feedback * address function comment * move CreateNodeArg to graph_utils * rename function name * rename function name * fix windows build * fix windows type conversion warning * add function comment	2020-07-14 17:21:40 -07:00
edgchen1	1ebe598286	Conditionally compile without std::is_trivially_copyable to satisfy old GCC versions. (#4510 )	2020-07-14 16:47:40 -07:00
Sheil Kumar	ee5ca27ae2	Split Microsoft.AI.MachineLearning.nupkg in a NuGet package and symbol NuGet package (#4503 ) * add threadpool interface * generate snupkgs * include_pdb check * fix snupkg generation * Add task to merge snupkgs * folder exists * check dir * revert thread pool stuff Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-14 14:52:39 -07:00
Tianlei Wu	25885cf7d0	Add option --torchscript in benchmark_gpt2.py (#4500 ) * support TorchScript * change onnx filename format * change output name prediction_scores to logits	2020-07-14 11:53:23 -07:00
Tim Harris	a95ae164f7	Create N-1 threads in intra-op pool, given main thread now active (#4493 ) Create N-1 threads in a thread pool when configured with intra-op parallelism of N. This ensures we have N active threads, given that the main thread also runs work. To avoid ambiguity on the value returned, rename ThreadPool::NumThreads method to ThreadPool::DegreeOfParallelism, and make corresponding updates in MLAS and operators.	2020-07-14 09:48:50 +01:00
liqunfu	0bff55512e	updated expected values for frontend test to pass frontend e2e pipeline. raise tolerance to reduce future risk of failure (#4497 ) * updated expected values for frontend test, raise tol	2020-07-13 19:25:54 -07:00
Dmitri Smirnov	e0eddf502c	Bump version to 1.4.0 (#4496 )	2020-07-13 17:09:18 -07:00
Yufeng Li	3d4ac85124	Add quantization benchmark for transformer based model (#4482 ) * add support of quantization benchmark	2020-07-13 15:46:23 -07:00
gwang-msft	a3c358fd29	Split the shared ComputePadAndOutputShape into 2 separated functions ComputePad and ComputeOutputShape (#4487 ) * Split ComputePadAndOutputShape into ComputePad and ComputeOutputShape * update NNAPI conv ouput shape compute to use shared ComputeOutputShapec * move use ptr to use reference for ComputePadAndOutputShape	2020-07-13 15:07:34 -07:00
Tiago Koji Castro Shibata	3441c687b7	Revert "Remove docstrigs if __ONNX_NO_DOC_STRINGS" (#4495 ) This reverts commit bb4d331fa7bf1fe8d68b1527dda56e4739c80800.	2020-07-13 14:55:37 -07:00
gwang-msft	5f8f443ac4	Android CI build, test copy, emulator boot improvement (#4481 ) * Enable onnxruntime_test_all for NNAPI EP * switch to use ninja for ANdroid CI * make android elumator boot faster in android ci * simplify adb push * more style change * more tweaking on android ci * build.py style update	2020-07-13 14:18:34 -07:00
Dmitri Smirnov	35ee00d888	Pin typing version. (#4490 )	2020-07-13 11:48:30 -07:00
Bowen Bao	07455cff28	Support double type for Greater CPU (#4373 ) * Add double for Greater * add double type for Greater * udpate test according to dtype	2020-07-13 11:25:14 -07:00
Tiago Koji Castro Shibata	f18dee84c2	Remove docstrigs if __ONNX_NO_DOC_STRINGS (#4494 )	2020-07-13 11:08:46 -07:00
edgchen1	c71c49aaa0	Make TArray safer to use and update method name for consistency. (#4483 ) - make size_ and data_ data members private - rename GetCapacity() to Capacity() to be consistent (e.g., with Size()) - add static_assert for trivially copyable T because it is copied with memcpy	2020-07-13 09:59:56 -07:00
Sheil Kumar	00706e1502	dont add deps for uwp apps (#4485 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-07-10 22:11:32 -07:00
Derek Murray	3e48ffd21c	Move AutoPadType to common.h (#4474 ) Extracting some common code related to "AutoPadType" from the cpu execution provider into "common.h". Motivation and Context * Sharing code with authors of other execution providers that need the same functionality. * I didn't modify the code in shared_library or dnnl EP to avoid changing their dependency structure, so there is still a redundant copy of the AutoPadType code in there.	2020-07-10 16:40:32 -07:00
Tianlei Wu	e96a829e84	Handle multiple embed nodes in transformer optimizer (#4471 ) Handle model with multiple embed nodes: * update embed layer norm fusion in onnxruntime * Fix temp model path in optimizer * Add unit test for model with multiple embed nodes. * Add unit test for gpt2 fusion with past state and mask * Add unit test for change input to int32	2020-07-10 15:28:27 -07:00
Ashwini Khade	6a9a9a35be	fix crashes caused by test runner (#4475 ) * Fix crashes in test runner * plus some fixes * changes per review	2020-07-10 14:04:22 -07:00
Hariharan Seshadri	26ebcfab88	Fix Nuget GPU pipeline (#4462 )	2020-07-10 14:02:28 -07:00
gwang-msft	9b4c54bcef	Enable onnxruntime_test_all for NNAPI EP (#4476 )	2020-07-10 13:34:44 -07:00
edgchen1	6c7da5e9d3	Optimize CUDA Sum op kernel and refactor CUDA elementwise variadic input op kernels (#4418 ) For the special case where all variadic inputs of a kernel are the same shape (i.e. no broadcasting is required) and there are few enough of them, we perform the entire computation in a single kernel. The general implementation (which was previously used for this special case) handles broadcasting by repeatedly invoking a binary kernel on successive inputs.	2020-07-10 10:20:23 -07:00

1 2 3 4 5 ...

2894 commits