onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-07 00:13:17 +00:00

Author	SHA1	Message	Date
edgchen1	024b92a970	Use path relative to script location to refer to symbolic_opset10.py from install_deps.sh. (#3975 ) Update install_deps.sh to use relative path from script directory to symbolic_opset10.py. This allows install_deps.sh to be called from different working directories.	2020-05-18 13:36:06 -07:00
Adam Pocock	9d2d1eb6f6	[java] Adds a CUDA test (#3956 ) * [java] - adding a cuda enabled test. * Adding --build_java to the windows gpu ci pipeline. * Removing a stray line from the unit tests that always enabled CUDA for Java.	2020-05-18 12:05:51 -07:00
Hariharan Seshadri	1a183784a8	Fix C# layer in the way it handles sequences (#3965 ) * Fix C# layer in the way it handles sequence of tensors * Revert comment	2020-05-18 11:10:13 -07:00
edgchen1	e259a13f8e	Initial training Python packaging pipeline (#3767 ) Add a pipeline to produce training-enabled ORT wheels.	2020-05-18 09:41:00 -07:00
edgchen1	e55f24364a	Disable LTO on Windows training CPU build (#3960 ) Disable LTO on Windows training CPU build. Add a parameter to the win-ci-2019.yml build template for enabling LTO with a default value of true.	2020-05-18 09:24:10 -07:00
M. Zeeshan Siddiqui	44731e88bb	Add comments for zero valued normalization factor in SoftmaxCrossEntropyLossGrad CUDA kernel. (#3972 )	2020-05-18 09:08:09 -07:00
Scott McKay	fd8ea4e466	Improve handling of symbolic dimensions in the onnxruntime_test.py script. (#3959 ) If a symbolic dimension is found allow the user to provide a value, or default to 1. `python .\onnxruntime_test.py --symbolic_dims batch=1,seqlen=4 onnxruntime\test\testdata\transform\fusion\fast_gelu_use_graph_input.onnx`	2020-05-18 16:51:09 +10:00
Tianlei Wu	523d70f667	Improve Transformer Benchmark for FP16 (#3970 ) Disable ORT in offline optimization script (ORT could generate some fused ops (like FusedGemm) which cannot be converted to fp16). Remove some models from benchmark until we have optimizations for them.	2020-05-17 21:50:45 -07:00
Wei-Sheng Chin	0d11649bb3	Address comments from #3823 and polish code (#3964 ) * Address comments from #3823 and polish code * One line	2020-05-17 14:08:33 -07:00
Prabhat	4ff73d00b0	Fix python pkg permission issue (#3957 ) * Fix python pkg permission issue * Run chown with sudo * Add workspace clean to arm pipeline * Run docker as current user	2020-05-17 14:06:55 +05:30
Tianlei Wu	07e9a4c164	Update benchmark to reflect those used in our latest results (#3967 ) Update optimizer for GPT2 models exported from PyTorch 1.5. Update benchmark to use GPT2 models without Past State inputs/outputs Update bert_perf_test to allow setting omp_num_threads etc to test only one setting	2020-05-16 20:17:40 -07:00
Tianlei Wu	56700bec83	Add example of python code to readme of transformers tools (#3966 ) * Use shorter name for tools * Use optimizer_cli * Add comments about -i parameter	2020-05-16 20:13:24 -07:00
Tianlei Wu	769c11f217	Update doc for transformers tools (#3963 ) * update readme for onnxruntime-tools package * update license section in benchmark	2020-05-16 11:13:34 -07:00
M. Zeeshan Siddiqui	a296b16719	Prevent divide by zero in CUDA implementation of SoftmaxCrossEntropyLossGrad. (#3962 )	2020-05-16 00:33:25 -07:00
KeDengMS	132ce3a561	Fixes for quantizing a BERT from HuggingFace (#3939 ) * Fixes for quantizing a BERT from HuggingFace * Address CR and some other minor fixes	2020-05-15 23:41:29 -07:00
Wei-Sheng Chin	33208c9f6b	Modify Pipeline Facilities to Fix PipeDream Deadlock (#3823 ) * Prepare utils for adding Wait's and Record's * Have a running PipeDream * Add comments * Polish comments * Clean code * Fix test * Polish names * Polish names * Remove debug headers * Fix a shape inference bug (not related to pipeline code) * Fix a warning * Address some comments * Address comments * Only touch consumers of outputs when re-wire edges	2020-05-15 18:27:19 -07:00
edgchen1	999554cc53	CGManifest - add training entries and generate entries for submodules. (#3933 ) Add cgmanifest.json entries for training dependencies. Add script to generate git submodule cgmanifest.json entries.	2020-05-15 13:34:18 -07:00
ytaous	bc441b7e5c	Add cpu/mem usage for perf metrics (#3947 ) * add cpu/mem usage * on comments * on comments * renaming Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-05-15 12:29:40 -07:00
Tracy Sharpe	be003dbab7	Fix ACL build break (#3952 )	2020-05-14 23:14:15 -07:00
Pranav Sharma	47ae9691fd	Fix ordering of APIs. (#3951 )	2020-05-14 21:27:46 -07:00
Hariharan Seshadri	9ef376880b	Add test for If node with conditional branches only containing Constant nodes (#3949 )	2020-05-14 19:21:40 -07:00
Ryan Lai	38467f8c9a	DirectML Nuget package has different time stamp than Native and Managed Nuget (#3950 ) * Fix DirectML nuget creation in Nuget pipeline * DirectML Nuget package has different timestamp * remove accidentally changed file	2020-05-14 18:52:08 -07:00
Jeff Bloomfield	e6da5946d1	Update DML Nuget version and DML EP Doc (#3945 ) Update DML Nuget version and DML EP Doc	2020-05-14 17:33:46 -07:00
Tianlei Wu	782c6c24b2	Rename bert to transformers (#3946 ) * rename folder bert to transformers * rename bert_model_optimization.py to optimizer.py * update URL links in notebooks	2020-05-14 15:32:59 -07:00
Zhang Lei	3c4f3d01cd	Implement QLinearLeakyRelu (#3648 ) * Implement QLinearRelu and its unit test. * Add logic to compute table during constructor when all parameters is constant. * Fix test case rounding result related with rounding mode.	2020-05-14 14:52:55 -07:00
Scott McKay	5e0928a777	Enable running PEP8 on python scripts using flake8 (#3928 ) * Enable running PEP8 checks via flake8 as part of the build if flake8 is installed. Update scripts in \tools and \onnxruntime\python. Excluding \onnxruntime\python\tools which needs a lot more work to be PEP8 compliant. Also excluding orttraining\tools for the same reason. Install flake8 as part of the static_analysis build task in the Win-CPU CI so the checks are run in one CI build. Update coding standards doc.	2020-05-15 07:15:06 +10:00
Yulong Wang	39814319b2	[Node.js binding] fix linux build (#3927 )	2020-05-14 12:37:32 -07:00
Yufeng Li	50f798dad6	support non-zero zero point for matmulinteger u8s8 (#3883 ) * support non-zero zero point for matmulinteger u8s8	2020-05-14 12:02:28 -07:00
Changming Sun	9c989c8dd6	Update build doc for cross-compiling (#3672 )	2020-05-14 11:43:06 -07:00
manashgoswami	cab21223b3	Updated TPN for OpenMPI and cleanup (#3932 ) * Update README.md * Update ReleaseManagement.md * Updated Third Party Notice for training feature Added Open MPI license	2020-05-14 11:42:44 -07:00
gwang-msft	cba8bdc790	Make some compile change for Android NNAPI provider using DNNLibrary (#3935 ) * Change compile settings for NNAPI with DNNLib * update build.py * update build readme	2020-05-14 10:53:37 -07:00
Prasanth Pulavarthi	84c108a85e	link to folder instead of READMEs inside folder (#3938 ) otherwise hard to find the source code	2020-05-14 07:35:23 -07:00
Ryan Lai	48f69cfbb8	Fix DirectML nuget creation in Nuget pipeline (#3929 )	2020-05-13 19:34:38 -07:00
Chi Lo	f380460a9e	Update the build steps to support ORT on Jetson (#3869 ) * Update BUILD doc for ARM64 build for TensorRT support on Jetson device * minor revision * JetPack 4.4 is in developer preview stage, so we suggest to use JetPack 4.3	2020-05-13 19:30:27 -07:00
ytaous	93eb9bcfde	Add yaml/perf scripts for new perf test pipeline (#3909 ) * yaml/perf scripts for new pipeline * yaml/perf scripts for new pipeline * remove unused imports * testing some comments change * testing some comments change * testing jdbc * testing jdbc * testing jdbc * exclude pwd from jdbc properties * exclude pwd from jdbc properties * namedtuple * on comments Co-authored-by: Ethan Tao <ettao@microsoft.com>	2020-05-13 14:15:17 -07:00
Changming Sun	e86214e5c0	Fix the tensorflow performance test (#3847 )	2020-05-13 11:52:59 -07:00
Yufeng Li	7c774e967a	support quantization of optimized model with ir<4 (#3853 )	2020-05-13 11:16:37 -07:00
Prabhat	25257a661d	Added onnxruntime aarch64 wheel to pypi publishing pipeline (#3903 ) * Added onnxruntime aarch64 wheel to pypi publishing pipeline * Support nightly build flag * Add support for nightly build	2020-05-13 23:20:29 +05:30
Jeff Bloomfield	1c1685ad2b	Fix error handling in LearningModelSession.cpp (#3920 )	2020-05-13 10:13:16 -07:00
Tiago Koji Castro Shibata	385073e1cd	Fix DmlCopyTensor test (#3923 ) * Fix heap corruption * Cleanup	2020-05-13 09:14:55 -07:00
Zhang Lei	eab61e87ce	Fix quantization tool bugs when model nodes have no name. (#3854 ) Fix bugs when model nodes have no name.	2020-05-12 20:38:26 -07:00
liqunfu	9b5daa2039	patch torch onnx opset 10 (#3910 ) patch pytorch to export onnx nll_loss opset version 10. add mnist test to covert onnx opset version 10.	2020-05-12 18:11:25 -07:00
Ori Levari	7b858d60b0	Various changes for automated downlevel test pipeline (#3901 ) Co-authored-by: Ori Levari <orlevari@microsoft.com>	2020-05-12 17:22:47 -07:00
Hariharan Seshadri	3065219cc1	Changes related to the release binaries requiring Visual C++ 2019 runtime (#3871 )	2020-05-12 17:07:06 -07:00
Xiang Zhang	bccbdd03f1	User/xianz/enable batch tests (#3914 ) * enable batch tests in winml_image_test * copy batchGroundTruth folder * skip GPU tests when GPU is unavailable	2020-05-12 15:46:46 -07:00
Dmitri Smirnov	18dc0ec39f	Rework jar by creating os-arch folders (#3849 ) Detect os and arch and move the artifacts to a new folder. Remove unnecesary jars so we cam focus on those we publish. Add signing Make signature simlper. Fix indent. Halt on 32-bit arch. Credits: @Craigacp	2020-05-12 15:31:20 -07:00
Hariharan Seshadri	c00945ae81	Build ORT by default for Mac OS X versions 10.12+ (#3626 )	2020-05-12 14:43:32 -07:00
Scott McKay	99415f09fe	Fix bug where linear_output_ is not cleared when linear_before_reset is true and no bias input is provided. Requires a batch size of 3 or more to trigger if initial_h is not provided. (#3893 )	2020-05-13 07:18:32 +10:00
Andrews548	475ea382f9	Fix ACL EP convolution-activation fusion optimization (#3896 ) Co-authored-by: Andrei-Alexandru <andrei-alexandru.avram@nxp.com>	2020-05-12 14:06:48 -07:00
Jeff Bloomfield	f170f31e1d	Extend workaround with input name matching in DML fused graph kernel (#3918 )	2020-05-12 14:04:09 -07:00

1 2 3 4 5 ...

2557 commits