onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-18 18:52:16 +00:00

Author	SHA1	Message	Date
pengwa	92b8a7a2be	GPT2 Gelu Fusion & Test (#3009 ) * GPT2 Gelu Fusion & Test * change header path * Refine code & add missing test onnx file * Fix builds & refine float/double/fp16 compare. * Fix builds * Add Bias Check and UTs * Fix build and uts * Fuse with second formula & test * minor change * disable FastGelu to see whether the builds can pass * Verify where is wrong * disable for debugging * Revert "disable for debugging" This reverts commit 535c0817fb36fb95a75773a7f00c8b969dd5362c. * Revert "Verify where is wrong" This reverts commit ffc43ec1d136636ba2cee30df49f563a75e84676. * disable the transformer for inference currently * Enable FastGeluFusion and fix segement fault when run bertsquad10.onnx test * Add more Unit tests convering Gelu subgraph use graph input/output (cherry picked from commit 0739ab985240c6d9acdb8f0afd40c5fb316166af) * Mode Bias Fusion in BiasGelu.cc Co-authored-by: Changming Sun <chasun@microsoft.com>	2020-02-21 18:25:43 +08:00
Scott McKay	932ecaea34	Some documentation updates. (#3060 )	2020-02-21 20:07:39 +10:00
William Tambellini	c033cfee92	Check python version (#356 ) Check python version in build.py	2020-02-20 23:26:25 -08:00
Changming Sun	45ba325fa6	Remove USE_NSYNC macro (#3052 )	2020-02-20 13:29:19 -08:00
Scott McKay	a1db87b382	Add SafeInt bounds checking to memory allocation size calculations. (#3022 ) * Add SafeInt bounds checking to memory allocation size calculations. * Fix TensorRT library includes	2020-02-20 11:41:03 -08:00
Pranav Sharma	21f9a8bdc2	Allow creation of string tensor sequence (#3048 )	2020-02-20 11:27:42 -08:00
Changming Sun	cb24e2a214	Update nsync	2020-02-20 11:25:34 -08:00
Changming Sun	85c0989e6f	Fix build errors in benchmark	2020-02-20 11:22:28 -08:00
Changming Sun	5306a1241b	Server: Add build options for the other EPs	2020-02-20 11:14:24 -08:00
smk2007	44d5eaf3d7	WinML exists in the nuget packages but does not publish its WinMD and headers (#3037 ) * publish winmd and raw headers * Add the lib too * add missing conditions * Fix copy/paste condition error	2020-02-20 10:24:29 -08:00
Scott McKay	bcbc5ee815	Improve error message for type mismatch between data in initializer and graph usage of the initializer.	2020-02-19 23:52:20 -08:00
James Yuzawa	dde4df148b	Add and run spotless formatter for Java and Gradle	2020-02-19 23:51:09 -08:00
Changming Sun	ef2bba316b	CUDA 10.1 for Windows(#3049 )	2020-02-19 23:26:47 -08:00
Xiang Zhang	18aa0940c2	checking condition is wrong (#3025 ) * checking condition is wrong * remove unnecessary check	2020-02-19 13:29:42 -08:00
Changming Sun	e3c27536d0	Python binding doesn't need to link to the python lib on Linux	2020-02-19 12:18:47 -08:00
Dmitri Smirnov	daf8c4bee4	Remove faturizers from CPU MLDNN and NoContribOps builds. (#3039 ) The first one is temp. The second one is permanent removal.	2020-02-19 06:23:36 -08:00
James Yuzawa	411b3aa801	Java build system enhancements (#2866 )	2020-02-18 15:41:49 -08:00
Tracy Sharpe	ecdcd682bb	NCHWc ReorderOutput->Transpose(NHWC) fusion (#3035 ) Add support to fuse ReorderOutput+Transpose(NHWC). Converting from NCHWc to NHWC tensors is a trivial copy of data and avoids the cost of a transpose node.	2020-02-18 10:23:48 -08:00
Changming Sun	71ca43b345	Revert an op version change (#3026 ) Revert an op version change, it was brought in from #2999	2020-02-18 09:43:18 -08:00
Tracy Sharpe	cc8adc87c3	fix NCHWc intermediates as graph outputs (#3033 ) This fixes a customer reported issue where the NCHWc optimizer was dropping graph outputs when an edge was used as both a graph output and an input to another NCHWc node.	2020-02-17 14:52:33 -08:00
daquexian	4ca50d9352	Update DNNLibrary to v0.9.0 and update NNAPI GetSupportedNodes	2020-02-17 13:24:10 -08:00
Tianlei Wu	aea76b0786	Bert optimization for onnx model exported by keras2onnx (#3014 ) * Optimization for Bert and DistilBert model exported by keras2onnx * Add model_type parameter for models from different export tools (pytorch, tf2onnx, keras2onnx). * Split LayerNormalization and SkipLayerNormalization fusions	2020-02-15 23:59:49 -08:00
Tracy Sharpe	3626c46fad	Optimize Math::Im2col + ConvInteger pointwise (#3017 ) Optimize the implementation of Math::Im2col that is currently used for ConvInteger/QLinearConv. Also, avoid Im2col for pointwise convolutions in ConvInteger.	2020-02-14 14:25:47 -08:00
Yulong Wang	f98f0f8717	fix type constraint name of Resize (#3019 )	2020-02-14 14:13:32 -08:00
ytaous	2b77cb19bd	merge training kernels to master (#2999 ) * merge training kernels to master * merge training kernels to master * revert two files * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master * merge training kernels to master	2020-02-13 14:52:35 -08:00
George Wu	584ba71485	TensorRT dockerfile updates (#3016 ) * change npy installation * update trt base image version * update build instructions for arm64/jetson	2020-02-13 14:34:00 -08:00
Hariharan Seshadri	c9f18756b9	Avoid unnecessary copies of ModelProto from being made in the InferenceSession class (#3012 ) * Avoid unneccesary copy creations of ModelProto * Comment nit * Nuit * Comment refactoring * Comment refactoring * Fix build break * Fix a few more instances where copies take place	2020-02-12 18:54:11 -08:00
Changming Sun	69bc8ce3c2	Upgrade protobuf to 3.11.3	2020-02-12 14:47:00 -08:00
Changming Sun	5f29c9c366	Add some test cases (#3010 ) 1. Add test cases for the schema registry in onnxruntime_graph 2. Use ASSERT_STATUS_OK when appropriate	2020-02-12 14:00:50 -08:00
Dmitri Smirnov	c6b499bfc9	Fix batch mode output for TfIdfVectorizer (#3008 ) Fix weights indexing in TfIdfVectorizer Add a test case to cover batching.	2020-02-12 10:46:20 -08:00
stevenlix	da653ccdac	Upgrade TensorRT to version 7.0.0.11 (#2973 ) * update onnx-tensorrt submodule to trt7 branch * add fp16 option for TRT7 * switch to master branch of onnx tensorrt * update submodule * update to TensorRT7.0.0.11 * update to onnx-tensorrt for TensorRT7.0 * switch to private branch due to issues in master branch * remove trt_onnxify * disable warnings c4804 for TensorRT parser * disable warnings c4702 for TensorRT parser * add back sanity check of shape tensort input in the parser * disable some warnings for TensorRT7 * change fp16 threshold for TensorRT * update onn-tensorrt parser * fix cycle issue in faster-rcnn and add cycle detection in GetCapability * Update TensorRT container to v20.01 * Update TensorRT image name * Update linux-multi-gpu-tensorrt-ci-pipeline.yml * Update linux-gpu-tensorrt-ci-pipeline.yml * disable rnn tests for TensorRT * disable rnn tests for TensorRT * disabled some unit test for TensorRT * update onnx-tensorrt submodule * update build scripts for TensorRT * formating the code * Update TensorRT-ExecutionProvider.md * Update BUILD.md * Update tensorrt_execution_provider.h * Update tensorrt_execution_provider.cc * Update win-gpu-tensorrt-ci-pipeline.yml * use GetEnvironmentVar function to get env virables and switch to Win-GPU-2019 agent pool for win CI build * change tensorrt path * change tensorrt path * fix win ci build issue * update code based on the reviews * fix build issue * roll back to cuda10.0 * add RemoveCycleTest for TensorRT * fix windows ci build issues * fix ci build issues * fix file permission * fix out of range issue for max_workspace_size_env	2020-02-12 07:03:58 -08:00
Dmitri Smirnov	273868eaa5	Disable NuGetPackaging on Linux GPU and remove DML from the pipelines (#3006 )	2020-02-11 20:08:18 -08:00
Faith Xu	ed0875909e	Reorganize and update readme (#2916 ) * Reorganize and update readme * Minor fixes * Minor fixes based on comments * fix typo	2020-02-11 15:59:20 -08:00
Tiago Koji Castro Shibata	e5f7e04e6f	Use target_add_dml function (#3005 ) * Use target_add_dml function * Don't access DML if disabled	2020-02-11 12:54:38 -08:00
Changming Sun	abb626ff60	Provide alternative std::mutex implementation on Windows (#3000 ) Provide alternative std::mutex implementation on Windows. OrtMutex is no longer an alias of std::mutex. We do it because: 1. This new thing is faster and much much simpler. 2. Static constructors are considered harmful. We should avoid such thing as possible as we can.	2020-02-11 11:46:08 -08:00
Dmitri Smirnov	c1997db85e	Exclude faturizers from Linux NuGet packaging.	2020-02-10 22:21:52 -08:00
Dmitri Smirnov	36915b3674	Temporarily remove Featirizers from packaging-pipelines	2020-02-10 22:21:52 -08:00
smk2007	ce713823cc	enable winml in the gpu ci pipeline (#2993 )	2020-02-10 22:21:13 -08:00
smk2007	5c5ac34b5c	Disable use_dml in nuget pipeline (#3001 )	2020-02-10 22:09:58 -08:00
Tiago Koji Castro Shibata	fb2182f3fc	Release ARM/ARM64 Nuget packages (#2987 ) * Enable ARM64 release builds * Add ARM release * Skip C# dll signing in ARM * Copy ARM binaries to Nuget * Restore nuget packages before ARM packaging * wip * Use host protoc at C# build * Set ProtocDirectory on cross-compiled builds * wip * Fix typo	2020-02-10 16:29:27 -08:00
Hariharan Seshadri	3afb83ac3c	Support a few new C/C++ APIs (#2794 ) * Initial commit * More changes * More changes * Changes * More changes * More changes * More changes * More changes * Updates * Fix break * PR feedback * Nit * Resolve conflicts * More changes	2020-02-10 16:18:42 -08:00
Dmitri Smirnov	7437928f47	Replace hardcoded State serialization for Featurizer kernel tests (#2992 ) Use in flight serialization for transformers State instead on hard coded values.	2020-02-10 10:02:09 -08:00
Changming Sun	64deb8030f	Update ABI_Dev_Notes.md (#2959 )	2020-02-07 20:09:56 -08:00
Xiang Zhang	b8d7d81506	Calling WinML enables telemetry by default for 1.2 (#2988 )	2020-02-07 15:52:27 -08:00
smk2007	ca7626d739	Winml should be built against v2 of the capi as that includes the new type info apis (#2991 )	2020-02-07 12:53:30 -08:00
Dmitri Smirnov	c8ea154e55	Package data_frame_tool, include featurizers into Manilinux2010 (#2989 ) * Package data_frame_tool, exclude featurizers from Manilinux2010 as their fail to build.	2020-02-07 11:38:42 -08:00
Tianlei Wu	62383b0328	Improve BERT optimization script: Gelu and LayerNorm for models from Tensorflow 2.* (#2957 ) * Add unit test. Add an option --use_onnxruntime to use onnxruntime to do optimization for pytorch model. Update layer norm and gelu for tensorflow 2.1 keras bert model. Add logging and use f-strings. Add extra checking for tensorflow model reshape fusion. Allow output model to json for test purpose. update match parent path utility function to return index * remove function not used.	2020-02-07 11:01:03 -08:00
Ori Levari	0beb75ce77	populate file metadata for onnxruntime.dll (#2978 )	2020-02-07 11:00:28 -08:00
Dmitri Smirnov	4f4f4bcd92	Handle dynamic shapes and reshape the input according to the model (#2986 ) Put type validation in separate method.	2020-02-06 16:46:07 -08:00
Changming Sun	ec07fdc5b0	Update ThirdPartyNotices.txt for eigen (#2985 )	2020-02-06 16:09:10 -08:00

1 2 3 4 5 ...

1884 commits