onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-13 18:08:13 +00:00

Author	SHA1	Message	Date
edgchen1	999554cc53	CGManifest - add training entries and generate entries for submodules. (#3933 ) Add cgmanifest.json entries for training dependencies. Add script to generate git submodule cgmanifest.json entries.	2020-05-15 13:34:18 -07:00
ytaous	bc441b7e5c	Add cpu/mem usage for perf metrics (#3947 ) * add cpu/mem usage * on comments * on comments * renaming Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-05-15 12:29:40 -07:00
Tracy Sharpe	be003dbab7	Fix ACL build break (#3952 )	2020-05-14 23:14:15 -07:00
Pranav Sharma	47ae9691fd	Fix ordering of APIs. (#3951 )	2020-05-14 21:27:46 -07:00
Hariharan Seshadri	9ef376880b	Add test for If node with conditional branches only containing Constant nodes (#3949 )	2020-05-14 19:21:40 -07:00
Ryan Lai	38467f8c9a	DirectML Nuget package has different time stamp than Native and Managed Nuget (#3950 ) * Fix DirectML nuget creation in Nuget pipeline * DirectML Nuget package has different timestamp * remove accidentally changed file	2020-05-14 18:52:08 -07:00
Jeff Bloomfield	e6da5946d1	Update DML Nuget version and DML EP Doc (#3945 ) Update DML Nuget version and DML EP Doc	2020-05-14 17:33:46 -07:00
Tianlei Wu	782c6c24b2	Rename bert to transformers (#3946 ) * rename folder bert to transformers * rename bert_model_optimization.py to optimizer.py * update URL links in notebooks	2020-05-14 15:32:59 -07:00
Zhang Lei	3c4f3d01cd	Implement QLinearLeakyRelu (#3648 ) * Implement QLinearRelu and its unit test. * Add logic to compute table during constructor when all parameters is constant. * Fix test case rounding result related with rounding mode.	2020-05-14 14:52:55 -07:00
Scott McKay	5e0928a777	Enable running PEP8 on python scripts using flake8 (#3928 ) * Enable running PEP8 checks via flake8 as part of the build if flake8 is installed. Update scripts in \tools and \onnxruntime\python. Excluding \onnxruntime\python\tools which needs a lot more work to be PEP8 compliant. Also excluding orttraining\tools for the same reason. Install flake8 as part of the static_analysis build task in the Win-CPU CI so the checks are run in one CI build. Update coding standards doc.	2020-05-15 07:15:06 +10:00
Yulong Wang	39814319b2	[Node.js binding] fix linux build (#3927 )	2020-05-14 12:37:32 -07:00
Yufeng Li	50f798dad6	support non-zero zero point for matmulinteger u8s8 (#3883 ) * support non-zero zero point for matmulinteger u8s8	2020-05-14 12:02:28 -07:00
Changming Sun	9c989c8dd6	Update build doc for cross-compiling (#3672 )	2020-05-14 11:43:06 -07:00
manashgoswami	cab21223b3	Updated TPN for OpenMPI and cleanup (#3932 ) * Update README.md * Update ReleaseManagement.md * Updated Third Party Notice for training feature Added Open MPI license	2020-05-14 11:42:44 -07:00
gwang-msft	cba8bdc790	Make some compile change for Android NNAPI provider using DNNLibrary (#3935 ) * Change compile settings for NNAPI with DNNLib * update build.py * update build readme	2020-05-14 10:53:37 -07:00
Prasanth Pulavarthi	84c108a85e	link to folder instead of READMEs inside folder (#3938 ) otherwise hard to find the source code	2020-05-14 07:35:23 -07:00
Ryan Lai	48f69cfbb8	Fix DirectML nuget creation in Nuget pipeline (#3929 )	2020-05-13 19:34:38 -07:00
Chi Lo	f380460a9e	Update the build steps to support ORT on Jetson (#3869 ) * Update BUILD doc for ARM64 build for TensorRT support on Jetson device * minor revision * JetPack 4.4 is in developer preview stage, so we suggest to use JetPack 4.3	2020-05-13 19:30:27 -07:00
ytaous	93eb9bcfde	Add yaml/perf scripts for new perf test pipeline (#3909 ) * yaml/perf scripts for new pipeline * yaml/perf scripts for new pipeline * remove unused imports * testing some comments change * testing some comments change * testing jdbc * testing jdbc * testing jdbc * exclude pwd from jdbc properties * exclude pwd from jdbc properties * namedtuple * on comments Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-05-13 14:15:17 -07:00
Changming Sun	e86214e5c0	Fix the tensorflow performance test (#3847 )	2020-05-13 11:52:59 -07:00
Yufeng Li	7c774e967a	support quantization of optimized model with ir<4 (#3853 )	2020-05-13 11:16:37 -07:00
Prabhat	25257a661d	Added onnxruntime aarch64 wheel to pypi publishing pipeline (#3903 ) * Added onnxruntime aarch64 wheel to pypi publishing pipeline * Support nightly build flag * Add support for nightly build	2020-05-13 23:20:29 +05:30
Jeff Bloomfield	1c1685ad2b	Fix error handling in LearningModelSession.cpp (#3920 )	2020-05-13 10:13:16 -07:00
Tiago Koji Castro Shibata	385073e1cd	Fix DmlCopyTensor test (#3923 ) * Fix heap corruption * Cleanup	2020-05-13 09:14:55 -07:00
Zhang Lei	eab61e87ce	Fix quantization tool bugs when model nodes have no name. (#3854 ) Fix bugs when model nodes have no name.	2020-05-12 20:38:26 -07:00
liqunfu	9b5daa2039	patch torch onnx opset 10 (#3910 ) patch pytorch to export onnx nll_loss opset version 10. add mnist test to covert onnx opset version 10.	2020-05-12 18:11:25 -07:00
Ori Levari	7b858d60b0	Various changes for automated downlevel test pipeline (#3901 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2020-05-12 17:22:47 -07:00
Hariharan Seshadri	3065219cc1	Changes related to the release binaries requiring Visual C++ 2019 runtime (#3871 )	2020-05-12 17:07:06 -07:00
Xiang Zhang	bccbdd03f1	User/xianz/enable batch tests (#3914 ) * enable batch tests in winml_image_test * copy batchGroundTruth folder * skip GPU tests when GPU is unavailable	2020-05-12 15:46:46 -07:00
Dmitri Smirnov	18dc0ec39f	Rework jar by creating os-arch folders (#3849 ) Detect os and arch and move the artifacts to a new folder. Remove unnecesary jars so we cam focus on those we publish. Add signing Make signature simlper. Fix indent. Halt on 32-bit arch. Credits: @Craigacp	2020-05-12 15:31:20 -07:00
Hariharan Seshadri	c00945ae81	Build ORT by default for Mac OS X versions 10.12+ (#3626 )	2020-05-12 14:43:32 -07:00
Scott McKay	99415f09fe	Fix bug where linear_output_ is not cleared when linear_before_reset is true and no bias input is provided. Requires a batch size of 3 or more to trigger if initial_h is not provided. (#3893 )	2020-05-13 07:18:32 +10:00
Andrews548	475ea382f9	Fix ACL EP convolution-activation fusion optimization (#3896 ) Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-05-12 14:06:48 -07:00
Jeff Bloomfield	f170f31e1d	Extend workaround with input name matching in DML fused graph kernel (#3918 )	2020-05-12 14:04:09 -07:00
Bowen Bao	0f82b42fed	Ensure pt model is set to cpu in ort_trainer (#3867 ) * Ensure pt model is set to cpu in ort_trainer * add note comment	2020-05-12 13:32:27 -07:00
Tianlei Wu	29496170d7	Add Benchmark Script for Bert Models (#3829 ) Add benchmark script for Transformer models * Set intra_op_num_threads=1 for cpu (version <= 1.2.0) * Add percentiles for latency * torch.set_num_threads (for intra op) to get fair comparison * Allow export ONNX model with specified number of inputs * Add fusion statistics * Install transformers from source	2020-05-12 12:26:22 -07:00
Scott McKay	2fed37c8eb	Fix bug in handling of an initializer that provides a graph output. (#3912 ) * Outputs from model execution should always be returned in a newly allocated buffer or an pre-allocated buffer provided in fetches. When an initializer is providing a graph output (e.g. constant folding may result in this) we were returning an OrtValue that pointed to the initializer and not a separately allocated buffer with a copy. This was wrong as: - value wasn't returned in a pre-allocated fetch so whilst the value returned was correct, it was returned in the wrong place - user could alter the data in the initializer via the returned value * Add unit test with and without pre-allocated fetch. * Add some extra info around why we're handling this special case.	2020-05-12 20:42:58 +10:00
George Wu	6f729b100f	use LOAD_WITH_ALTERED_SEARCH_PATH for LoadLibraryExA (#3908 )	2020-05-11 19:53:34 -07:00
Thiago Crepaldi	70abb120b3	Remove ORTModel from frontend API (#3825 ) * Resolve conflict * Address review	2020-05-11 18:20:33 -07:00
Tracy Sharpe	b12d35b679	MLAS: tune softmax kernels for partial vectors (#3906 )	2020-05-11 18:02:50 -07:00
M. Zeeshan Siddiqui	c46a9e8d65	Add numerical stability to SoftmaxGrad test inputs. (#3857 ) * Increase the tolerance for SoftmaxGrad CPU-GPU compare tests. * Increase the tolerance for SoftmaxGrad CPU-GPU compare tests. * Add 1e-2 to Y for numerical stability. * build break. * comments. * PR feedback. * PR feedback.	2020-05-11 17:59:24 -07:00
Jeff Bloomfield	af7d453435	Merge DML Execution Provider updates (#3885 ) * Merged PR 4616739: Update QLinear Ops fix 1D support layout Update QLinear Ops fix 1D support layout Related work items: #26011523 * Merged PR 4617257: Gather operator DML EP fails with scalar indices and 1D inputs Fix gather with scalar value. The ONNX conformance test case is in another PR: // 0D, axis 1, rank 0 indices tensor { "op_type": "Gather", "axis": 0, "data": [1,2,3], "indices": 0, "output": 1, "T": "float32" } * Merged PR 4632178: Re-enable ORT onnx_test_runner test case (DirectML ConvTranspose validation needs to be loosened to comply with ONNX definition of output_padding) Re-enable 1D convolution tests. Related work items: #23499747 * Merged PR 4656672: Make DML EP use Direct queue While a Compute queue has benefits, Direct is consistent with Winml. Related work items: #26324112 * Update DML nuget version * Merged PR 4662079: Update DmlDev branch again from github master Include Sheil's changes to fix namespace and header file include paths. Without this, the ONNX conformance tests all fail with E_NOTIMPL. * Increment DML nuget version Co-authored-by: Nick Feeney <nickfe@microsoft.com> Co-authored-by: Dwayne Robinson <dwayner@microsoft.com>	2020-05-11 17:57:01 -07:00
Scott McKay	0cdc1459c6	Add a few more things to the helper python scripts. (#3842 ) * Add a few more things to the helper python scripts. Add documentation with usage examples. * Address PR comments	2020-05-12 09:59:40 +10:00
stevenlix	28f693a8a1	Update TensorRT dynamic shape profile when input shape changed during runtime (#3904 ) * Update dynamic shape range when input shape changed during runtime * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc	2020-05-11 16:11:41 -07:00
Ori Levari	6d2d927809	add test to api tests to ensure correct hresult is returned for corrupt model file (#3796 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2020-05-11 13:22:22 -07:00
Sheil Kumar	c7da194313	remove winrt (#3899 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-05-11 10:47:07 -07:00
Prabhat	ce3678ffaf	Added aarch64 build pipeline (#3841 ) * Added aarch64 build pipeline * Fix build error * Remove auditwheel repair which doesn't work with cross compiling * Statically link C++ * Added auditwheel repair back and fix stdlib.h * Remove extra space	2020-05-11 22:56:16 +05:30
Scott McKay	d7e39569da	Use MlasComputeSoftmax in traditional ML ops (#3892 ) * Use MlasSoftmax in ML ops * Refine when mlas is used based on perf testing.	2020-05-11 16:28:59 +10:00
Hariharan Seshadri	06985a9922	Stop proceeding with constant folding if a CPU kernel is not found found (#3836 ) * Stop proceeding with constant folding if a CPU kernel is not found for a node * Fix build * PR feedback * Fix typo * Refine * Remove unnecessary header inclusion * Refine * Fix build * More changes * More changes * More changes * Fix CentOS build	2020-05-10 19:08:37 -07:00
Ryan Lai	7fd2c8f9e8	Add signed GPU nuget package to publish ort-nightly nuget feed (#3834 ) * Add signed nuget package to publish ort-nightly nuget feed * Push managed nuget as well * Indentation fix * Indentation fix * Update gpu.yml to also publish directml nuget * Fix typo in naming of task	2020-05-10 16:24:45 -07:00

1 2 3 4 5 ...

2541 commits