onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-28 03:20:58 +00:00

Author	SHA1	Message	Date
Maxim Kalinin	ec36c793e8	Eliminate redundant subexpressions (#3047 ) * Eliminate redundant subexpressions Apply local value numbering to merge graph nodes that will always evaluate to the same value. * Rename cpp->cc * Handle optional arguments * Add test models * Add more tests with optional arguments * Fix processing of subgraphs Also, be resilient to possible mixture of optional and variadic parameters * Fix random operators * Address PR comments * Minor changes and a test * Move CSE before constant folding * Random* operators are always non-deterministic Even when seed is provided. * Fix a CSE test * Reuse the list of non-deterministic operators with constant folding pass * Address PR comments * Fix formatting * Address PR comment * Minor cleanup / comments * Fix build failure in Linux * Reuse existing optimizer/utils file. Also, check for graph outputs when removing a node. * Add a test * Fix compiler warnings * Fix build in older compilers * More compatibility with old STL versions	2020-08-14 01:13:05 -07:00
Marcus Turewicz	ce65275edf	C# samples: Faster R-CNN (#4733 ) * C# sample: Faster R-CNN * Add link to new sample in samples README * Remove duplicate image	2020-08-13 17:05:01 -07:00
Sergii Dymchenko	de2685261b	Install AzureML support and commonly used packages in the training image. (#4790 )	2020-08-13 16:48:48 -07:00
stevenlix	7acef875bb	Fix bugs in TensorRT (#4780 ) * fix bugs * Move -Wno-deprecated-declarations to target compile flag	2020-08-13 16:09:27 -07:00
Yulong Wang	aa993e95c9	enable build flag '--use_openmp' on MacOS (#4774 ) * enable build flag '--use_openmp' on MacOS * cmake 3.16.1 to enable find_package(OpenMP) on mac	2020-08-13 15:56:42 -07:00
George Wu	f12e9de111	build fixes for https://github.com/microsoft/onnxruntime/pull/4721 (#4784 ) * test * test * add missing CUDA header include * debug * fix * fix python package for dnnl and tensorrt. * fix * fix windows build. * revert * target_link_directories for tensorrt shared lib.	2020-08-14 06:24:44 +08:00
ISS Build Account	f01579b8fb	Merge remote-tracking branch 'upstream/master' into DmlDev	2020-08-13 19:38:44 +00:00
James Yuzawa	aca34352a5	Java API: Documentation cleanup (#4395 ) * update java API docs * fix link * rearrange * update platforms, use table * use javadoc.io * craigacp tested it in java 14 * update link * fix broken link * fix testdata link	2020-08-13 12:06:42 -07:00
Sheil Kumar	722602f32d	replace namespace reference with alias (#4786 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-08-13 11:14:55 -07:00
ashbhandare	5e7a6e78e3	Changes for BART dynamic shapes in reduction (#4730 ) * Modify to hit row reduction over cudnn * kernel overflow fix * Cleanup * fix for mainz/zcode model * revert * Review comments * Review comments	2020-08-13 11:14:01 -07:00
edgchen1	74b3b8448c	Fix MatmulTransposeFusion::ApplyImpl() setting of modified flag (#4775 ) Update MatmulTransposeFusion::ApplyImpl() to set modified flag whenever a fusion is performed.	2020-08-13 09:51:52 -07:00
Scott McKay	8fb743f767	Refactor Cast to reduce binary size. (#4765 ) * Refactor Cast to reduce binary size. 82.5 -> 60.8KB on Windows * Address PR comments. Fix build issue.	2020-08-13 20:43:22 +10:00
Tim Harris	9cec98ec1b	Honor allow_spinning at barrier at end of parallel sections (#4767 ) This commit means that when the thread pool is configured to spin, then we spin at the barrier at the end of parallel sections in the main thread, in addition to having workers spin waiting for work. The change updates Barrier.h to take an additional boolean to select spin/block, and passes this in based on the thread pool configuration. It adds an additional test case for barriers, although no problems were identified by the test case.	2020-08-13 09:40:40 +01:00
Faith Xu	61b2a663a3	Update Python version support (#4778 )	2020-08-12 23:48:23 -07:00
Changming Sun	cddddc4d55	Add missing header file to MNIST.cpp (#4773 ) Resolve #4766	2020-08-12 21:46:11 -07:00
Tianlei Wu	a69ca63895	add --no_attention_mask option (#4750 ) output producer name and version in optimized model. avoid removing initializer that existed in graph output	2020-08-12 15:56:25 -07:00
jingyanwangms	adda8c66d9	Docker image release pipeline (#4682 ) * create orttraining-1p-linux-gpu-ci-pipeline.yml * fix syntax * fix file path * fix template path * publish docker image to test acr * use right task name * change parameter list * use variables * use python.version * remove --enable_onnx_tests due to segfault * add back --enable_onnx_tests * fix docker push command line * change docker login command * login differently * fix docker tag script * create password.txt * add ortrelease docker image * enable test in build.sh * add pipeline parameter * add pipeline parameter * change timeout * change timeout * fix run_dockerbuild.sh * use PR checkin build docker * fix strategy syntax * fix strategy syntax * change dockerfile * change run_dockerbuild.sh * change tag name * build with root user * use build id for docker image tag * remove all user lines * change docker tag * add mpi, mellanox * add missing args * use release dockerfile for ci build * remove install wheel * use release docker image * fix syntax * use different pool * add Dockerfile.training * remove sudo to run on Linux-Multi-GPU-V100 * change docker file path * update dockerfile * use latest dockerfile * change agent pool * remove --preserve-env * add back parameter * Add test_flag * use azuredevops docker * change repository * use cmd for docker login * echo build script * use ortrelrease ACR * change key vault connection * Move --build flag * change build command * add paramter for image tag * clean up for PR * remove unnecessary changes * whitespace changes * whitespace changes * change build flag * change flag name * change flag * use latest dockerfile * enable build tests * build builder stage and run test * Add back python.version * change build directory * always run build entire dockerfile * fix yml syntax * fix syntax * add en-UTF8 locale * rename * remove unused template * Update orttraining-linux-gpu-docker-release-pipeline.yml for Azure Pipelines * Update orttraining-linux-gpu-docker-release-pipeline.yml for Azure Pipelines * Test commit sha1 in pipeline * fix parameter * update docker file * fix --from=build * remove commented blocks * PR comments * fix syntax * fix syntax * use timestamp as build number * remove latest tag * add build_timestamp variable * remove wrong property * fix docker run command * test build id * Use datestamp build id * change build tags * add no-cache to docker build * rename BUILD_VERSION -> BUILD_CONFIG Co-authored-by: Jingyan Wang <jingywa@OrtDevTest2v100.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net> Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-08-12 13:29:37 -07:00
Sheil Kumar	8a66ad79a6	Add Experimental WinRT API IDL as placeholder for adding new winrt features (#4736 ) * Add experimental winrt api idl with dummy type to satisfy the build * remove experimental from the api_lib target * make experimental api available on windows builds also * remove /y /d * revert some pathing changes * remove experimental api call from tests * revert cppwinrt cmake changes * switch to stdapi Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-08-12 12:45:19 -07:00
Vincent Wang	7e955960f1	Optimize Slice Kernel by Removing If-statement (#4753 ) * Slice kernel optimization. * remove space Co-authored-by: Vincent Wang <weicwang@AiFramework2080ti2.corp.microsoft.com>	2020-08-12 16:36:03 +08:00
Josh Bradley	b7254551f0	Add new api function At() (#4457 ) * add modern standards to function arguments * add first version of At for better tensor element access	2020-08-11 18:34:03 -07:00
Scott Bonebrake	38c804a048	Fix broken link to ScoreMNIST.java in Java_API.md (#4213 )	2020-08-11 17:36:19 -07:00
Ryan Hill	ac725b53f6	Convert TensorRT provider into a shared library (#4721 ) Lots of changes to shared library interfaces, new lighter weight design.	2020-08-10 21:17:16 -07:00
Dmitri Smirnov	ac4997665a	Make Java Publishing and Java GPU pipelines to run nightly (#4749 ) Schedule Java daily Bump up iInux GPU build timeout	2020-08-10 17:38:45 -07:00
Yang Chen	f51385fd1e	Yanchen/nuphar/clip 11 (#4737 ) * [WIP] log unsupported ops in Nuphar * [Nuphar] added support for clip-11 also added some log information for unsupported ops in Nuphar	2020-08-10 15:45:21 -07:00
Jake Mathern	6037b41c17	Merged PR 5002755: Add Celu, GreaterOrEqual, LessOrEqual Adds onnx opset 12 operators elementwise GreaterOrEqual, LessOrEqual, and activation Celu [winml pr](https://microsoft.visualstudio.com/WindowsAI/_git/WindowsAI/pullrequest/5002669) Related work items: #27469675, #27469695, #27469701	2020-08-10 21:07:50 +00:00
Dmitri Smirnov	3530ce541c	Expose IOBinding features via C/C++/C# language bindings. (#4646 ) Expose I/O Binding in C/C++/C# Expose OrtAllocator, OrtMemoryAllocation, OrtMemoryInfo and OrtIoBinding	2020-08-10 13:33:49 -07:00
Scott McKay	6c33d7f5df	Fix bug in Loop optimization (#4210 ) * Fix bug where an optimization to avoid a copy resulted in the iteration num for a Loop subgraph * Update comments to clarify	2020-08-11 06:31:29 +10:00
Tiago Koji Castro Shibata	082a741636	Move DNNL workaround to EP (#4738 )	2020-08-10 13:06:22 -07:00
edgchen1	487665c21f	Transpose MatMul fusion fixes (#4728 ) Fix Transpose MatMul fusion handling of existing TransposeScaleMatMul node's attributes and enable support for missing Transpose perm attribute. Update expected test data to account for floating point calculation differences resulting from the fusion.	2020-08-10 13:00:22 -07:00
Tianlei Wu	316d1a9e69	Update benchmark for large model or model name with non-alphanumeric. (#4743 ) * Export model > 2GB using external data format	2020-08-10 12:58:01 -07:00
Vagif	6499a38b7d	Add the missing onnx_proto import (#4705 ) * add missing onnx_proto import * Fix TensorProto usage in calibrate.py * remove unused imports	2020-08-10 12:46:21 -07:00
Scott McKay	2e3ccc7518	Change order of some checks to workaround a linker issue when /LTCG:incremental is set. (#4713 )	2020-08-10 17:54:11 +10:00
Nat Kershaw (MSFT)	24d4f76436	Added explicit instructions to build for Jetson (#4714 ) * Added explicit instructions to build for Jetson. * Update after review	2020-08-09 20:28:20 -07:00
Bowen Bao	abbb7f6f5c	Avoid duplicated calls of postprocess in training frontend (#4579 )	2020-08-07 21:34:11 -07:00
stevenlix	77c69a0325	Upgrade TensorRT to v7.1.3.4 (#4704 ) * upgrade to TensorRT 7.1.3.4 * Upgrade onnx-tensorrt parser for TensorRT 7.1.3.4 * fix format issue * fix format issue * fix format issue * Update tensorrt_execution_provider.cc * change cmake version to 3.14 * Remove --msvc_toolset 14.16 * change to onnxruntime::make_unique * use onnxruntime::make_unique * disable some tests for TensorRT * disable some tests for TensorRT * Update upsample_op_test.cc * Update tile_op_test.cc * disable some tests for TensorRT * Update constant_of_shape_test.cc * update parser * Update Dockerfile.ubuntu_tensorrt	2020-08-07 17:43:56 -07:00
Oliver Rausch	9c3153acd6	Improve shape inference for OneHot (#4452 ) * Improve shape inference for OneHot Attempt to get the depth parameter before adding a new symbolic dimension. * Update symbolic shape infer * Nit	2020-08-07 14:05:20 -07:00
Tianlei Wu	9c729d1719	Update notebook for mac since onnxruntime 1.3 or 1.4 in mac does not have openmp (#4732 )	2020-08-07 14:01:48 -07:00
Marcus Turewicz	37c45c3d6b	C# ResNet50 v2 sample/tutorial (#4722 ) C# ResNet50 v2 sample Update samples README	2020-08-07 13:36:36 -07:00
Ye Wang	61726e58f0	fix (#4697 )	2020-08-07 13:08:41 -07:00
Sergii Dymchenko	c334b5738e	Remove docstring for removed parameter (#4734 )	2020-08-07 11:43:36 -07:00
Yufeng Li	b22091dc91	Add the framework to support prepack (#4413 ) * add support of prepack * add support for QAttention and DynamicQuantizeMatMul * add an use_prepacking option * add use_prepacking in c_sharp api	2020-08-07 09:39:19 -07:00
zhijxu-MS	33fe770037	Support log sigmoid gradient (#4719 ) * add log's gradient op and its related gradient test * support sigmoid's gradient op * resolve review comments	2020-08-07 11:21:36 +08:00
Wei-Sheng Chin	7905c57f43	Revert "Remove code which is not thread-safe. (#4454 )" (#4712 ) * Revert "Remove code which is not thread-safe. (#4454)" This reverts commit `5222b2c6c0`. * Resolve race condition * More thread-safe changes * Remove unused lock Polish comments	2020-08-06 18:42:05 -07:00
ashbhandare	fc2f36c608	Shape independent gradient builder for Concat (#4675 ) * Add gradient for ConcatTraining * Graph rewriter changes for concat * Add generated onnx graph, minor fixes * Revert unintended change * Fix for MaxPoolGradTest * Fix UT * Review comments, windows tests * Review comments	2020-08-06 14:39:33 -07:00
gwang-msft	8507bc1f48	[Android NNAPI EP] Enable test for BatchNormalization, enable dev_mode for Android, fix some issues in concat (#4715 ) * update batch_norm test, enable dev_mode for nnapi, ignore onnx protobuf warning for nnapi ep * fix some issues in concat and mark input without shape as not supported for now * address review comments * addressed comments	2020-08-06 14:11:59 -07:00
suffiank	4d39c6a6cb	Wire log(softmax) grad cuda kernel and add log(softmax) grad cpu kernel (#4726 ) * logsoftmax cuda kernel * add cpu logsoftmaxgrad * revert debug printout * revert disable for debug builds * use /alpha x + y instead * remove misleading log_softmax_ bool Co-authored-by: suffian khan <sukha@OrtTrainingDev1.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-08-06 10:49:08 -07:00
KeDengMS	9a73c8f448	ReshapeGrad optimization (#4708 ) * Reshape optimization * Refactor the Reshape optimization to be more generic Co-authored-by: Ke Deng <kedeng@OrtTrainingDev1.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-08-05 23:26:02 -07:00
suffiank	005fa5c3ae	Add initial Dockerfile for distributed training targets (#4578 ) * add training dockerfile tested for examples repo * forgot pytorch patch for build from source * make apt-get update -y adjacent apt-get install -y due to Docker caching rules * comment for mellanox libraries * mpi4py comment as I forgot where it came from * apparently curl not included anymore * grr.. nvidia change nccl location * dont need findnccl.patch after nvidia changed nccl location * pr comment /opt/ompi4 => /opt/openmpi-xxx * switch to pip install pytorch * use Release instead of RelWithDebInfo * comment wording * wordin * missed RelWithDebInfo => Release * replace Mellanox with libibverbs * stale comment * ordering * no more ninja * add / at end of copy * update cgmanifest.json * pr comments Co-authored-by: suffian khan <sukha@OrtTrainingDev1.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2020-08-05 18:54:54 -07:00
RandySheriffH	e802b0498f	EnrichPyOpUT (#4681 ) * cancel night build on pyop * enrich PyOp UTs * init script only once * remove space * update models * Show usage of kwargs in doc	2020-08-05 14:11:56 -07:00
Yang Chen	43142a8225	[Nuphar] added Gemm-to-MatMul conversion in model editor (#4691 ) * [Nuphar] added Gemm-to-MatMul conversion in model editor * added a mode gemm_to_matmul that turns Gemm Ops into MatMul Ops * enabled model_quantizer to quantize MatMul inside a Loop op * this PR also included Gemm-11 support from Ke Deng * Fixed a couple of existing bugs Fixed a couple of old bugs exposed by the newly-added tests and the support of Gemm-11, including: * correctly handle aliasing among states and outputs in Scan * fixed a transpose issue in building tvm IR for MatMul * fixed an issue related to generating IR for computing Gemm alpha * disabled several tests that triggered some deep issue (likely) in the graph partitioner. I think it might be better to have a separate PR to address the issue.	2020-08-05 13:31:30 -07:00

... 177 178 179 180 181 ...

11997 commits