onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-06 00:03:22 +00:00

Author	SHA1	Message	Date
Ori Levari	93569bf0f4	fix regex to populate dll version information correctly	2020-03-13 11:35:49 -07:00
Yufeng Li	c69194ec4c	fix the missing return in _get_quantize_input_nodes and format code with yapf (#3199 ) * fix the missing return for function _get_quantize_input_nodes * format quantization code with yapf	2020-03-13 09:28:41 -07:00
Xavier Dupré	d99554bea1	Improves implementation of tree ensemble regressor and classifier (4 to 5 times faster) (#2692 ) * Improves implementation of tree ensemble regressor (4 to 5 times faster) * Use ORT_THROW	2020-03-13 14:10:37 +01:00
Scott McKay	e9d5ed270f	Normalizer performance improvements (#3201 ) * Simplify Normalizer as the spec only requires support for 2D input. Tried using eigen (LpNorm<1>(), and norm()) on each row but that was much slower. * Remove unused variable	2020-03-13 22:15:44 +10:00
Scott McKay	890cb78b20	Use Eigen::logistic instead of manually computing values. (#3186 ) * Use MlasComputeLogistic instead of manually computing values. * Update test script to allow the tolerance to be specified when checking float output from logreg_iris.onnx.	2020-03-13 20:27:25 +10:00
Hariharan Seshadri	b8575dda7b	Avoid some heap allocations in the InferenceSession and Model classes (#3103 ) * Avoid some heap allocations in the InferenceSession and Model classes	2020-03-12 18:38:10 -07:00
Changming Sun	a02638eb46	Adjust the threading logic in ThreadPool::ParallelFor (#3178 ) 1. Do not reuse the main thread. 2. Do not plus one when mlas calculate the number of tasks to schedule. (It was me put the plus one there) This is the second try of #1839 It's known that this change has negative performance impact on some of the models.	2020-03-12 11:33:33 -07:00
Scott McKay	f49912c42a	Performance improvement to Transpose when moving single axis. (#3173 ) * Avoid use of vectors for tracking reader/writer offsets as it adds too much overhead if there are a lot of readers or writers. Tracy found improvements in resnet34-ssd1200 and BERT Squad with this approach.	2020-03-12 14:49:02 +10:00
Paul McDaniel	6791ed0217	Documentation updates for 1.2 for WinML (#3149 ) * api goverannce draft * Update CONTRIBUTING.md updated for ABI proposals * Update CONTRIBUTING.md * Update CONTRIBUTING.md * Incomplete, a draft iteartion of 2 more changes - api docs and high levle design * pushing to see how the picture size works on screen. * added 2 charts on api choice and distribution choice * details on contract checking * lint cleanup and links * PR feedback. * fixed markdown and lists * more markdown and lists * fixed broken links * PR feedback * commas * PR comments from nick * PR feedback * fixed build section Co-authored-by: Nick Geisler <36938193+ngeisler11@users.noreply.github.com>	2020-03-11 14:19:30 -07:00
Hariharan Seshadri	a912415bac	Support custom ops targeting the CUDA EP (#3165 ) * Initial commit * Minor nit * Comment * Fix build * Fix build	2020-03-11 00:49:01 -07:00
Hariharan Seshadri	3464801c3e	Explicitly specify NugetPackage parameter while validating nuget in some release pipelines (#3139 )	2020-03-10 15:14:09 -07:00
Yufeng Li	3de1fc096d	Move zero point inputs of MatmulInteger to CPU memory (#3159 )	2020-03-10 13:56:23 -07:00
Tianlei Wu	51a8c82908	Update bert optimization script for SQuAD model exported by keras2onnx (#3163 ) Update script to make it work on fine-tuned bert model exported by keras2onnx	2020-03-10 12:57:49 -07:00
Yufeng Li	876d0c5430	Make quantization parameters as constant weigth instead of overrideable (#3160 )	2020-03-10 08:35:02 -07:00
Scott McKay	3d928de778	Use GEMM for LinearRegressor and LinearClassifier operators to improve performance (#3154 )	2020-03-10 20:24:25 +10:00
Dmitri Smirnov	f87b6913cd	Add package download step before pushing to feeds (#3162 ) Add package download step before publishing.	2020-03-09 14:32:18 -07:00
Changming Sun	6ed5d7c332	Update post_binary_sizes_to_dashboard.py (#3161 ) Discussed with Faith, because the data size is very small and changes are gradual, there is no need to delete the old data. We want to keep all the history.	2020-03-09 13:21:58 -07:00
Tiago Koji Castro Shibata	a59243090a	Publish release symbols (#3152 ) * Publish release symbols * Publish symbols if IsReleaseBuild	2020-03-05 22:32:18 -08:00
Andrew Kane	781a6ebb06	Updated Ruby supported versions	2020-03-05 19:50:41 -08:00
pranavm-nvidia	cfd18b583a	Help output typo fix Fixes a typo in the help output for `symbolic_shape_infer`	2020-03-05 19:50:13 -08:00
Tianlei Wu	5be6665b86	Update Gelu Fusion to support new graph pattern from PyTorch 1.4 (#3148 ) * update GeluFusion to support pattern from PyTorch 1.4; * Fix a bug that missing the check of an edge between mul2 and root. * update script to fuse gelu from PyTorch 1.4 * Add test for python optimizer	2020-03-05 18:31:52 -08:00
Dmitri Smirnov	e2894c5ffb	Fix package name overrides (#3150 ) Add env var with the package name.	2020-03-05 17:10:55 -08:00
Yufeng Li	1d2b8115e2	Support u8u8 in quantization tool (#3140 )	2020-03-05 14:42:46 -08:00
KeDengMS	ade4fa108f	Disable delayload for cuda dlls (#3147 ) This change fixes #3129. When running onnxruntime as dll on Windows, CUDA does some internal cleanups when process exits. After this, any call to CUDA would cause crash. Delayload makes thread_local destructor to happen after CUDA cleanup, thus the crash.	2020-03-05 14:40:22 -08:00
Dmitri Smirnov	2c446a7f2f	Add push to ORT-NIGHTLY. (#3146 )	2020-03-05 11:38:22 -08:00
Yufeng Li	fbb658e603	Implement QuantizeLinear and DequantizeLinear (#3098 ) * Implement QuantizeLinear and DequantizeLinear	2020-03-04 13:30:20 -08:00
take-cheeze	83753bcbe3	Suppress maybe uninitialized warning in gcc-9	2020-03-04 11:52:40 -08:00
Dmitri Smirnov	ef8768a53f	Override native package name. Preserve managed package name the same. (#3133 ) Override native package name. Preserve managed package name the same. Specify pckage name for validation purposes. Fix up validation package name parameter.	2020-03-04 10:12:55 -08:00
Prabhat	a2eeb126b9	Optimised kernel_dot() in SVM op (#3135 )	2020-03-04 16:30:40 +00:00
Tianlei Wu	9d874c1225	Add bert performance and correctness test tools (#3108 ) (1) Add performance test tool for bert model. (2) Add accuracy test tool to compare inference results of original and optimized bert models. (3) Add test data generator tool to create test data for onnxruntime_perf_test.exe (4) Improve bert optimization script: Verify model producer for model_type; Add warning if model is not fully optimized. (5) Add shape optimizer tool to assist developing optimization script. (6) Update readme.	2020-03-03 23:18:08 -08:00
Yufeng Li	84ad4eda8b	Implement MatmulInteger on GPU (#3070 ) * Implement MatmulInteger	2020-03-03 16:36:33 -08:00
Changming Sun	12605f05d1	Fix CUDA PATH (#3131 ) Previously, we put the "bin" folder of all the CUDA verions in the system PATH. And 10.2 is in the front. It's a mess. So I've removed all of them from the system PATH env. But I need to add one of them back through build scripts. (The problem only affect the C# test, not the C/C++ tests that forked from build.py).	2020-03-03 14:34:19 -08:00
smk2007	6cdd2b4934	Enable DML Nuget Package for x64 or x86 architectures (#3120 ) * add dml gpu pipelines * add x86 to the gpu dml dev build pipeline * Enable DML x86 builds * Fix uint64_t -> size_t warning * fix warnings * enable dml on x86 ci builds * operatorHelper 773 error uint32_t vs uint64_t * operatorHelper 773 error uint32_t vs uint64_t * make x86 pipeline use the gpu pool * more warnings * fix x86 directml path * make dml nuget package * disable tf_pnasnet_large * disable zfnet512 * make validation use wildcards * disable x86 dml gpu tests * add args. * update gpu.yml * change nupkg wildcard * add debug statements * package x86 dml nupkg * dont drop managed nuget again from dml pipeline build * Add DML EULA * directml license should be renamed to not clobber the existing license * casing on dml package.... * {} to () * fix license name * disable dml from x86 ci * typo and cr feedback * remove featurizers * ship the dml pdb as well	2020-03-02 20:18:46 -08:00
Dmitri Smirnov	e45326b5df	Create NuGet packaging pipeline for ORT Featurizers (#3125 ) Create a new pipeline to publish ORT with Featurizers Update pipeline for two separate packages. Change package names.	2020-03-02 17:00:56 -08:00
Tracy Sharpe	b538cb7e46	NCHWc Upsample/Mul optimizations (#3116 ) Extend the NCHWc layout optimizer to handle Resize(mode=nearest) and Mul.	2020-03-02 14:40:49 -08:00
Hariharan Seshadri	4188b1111a	Add a summary for each ExecutionProviderAppend methods in SessionOptions.cs (#3111 ) * Add a summary for each ExecutionProviderAppend methods in SessionOptions.cs OnnxRuntime managed dll is EP agnostic meaning it will expose all methods pertaining to all possible EPs supported by OnnxRuntime in general. Not all these methods are really "available" to use for a .NET developer unless they have the correpsonding native onnxruntime shared library. Adding a summary line so that intellisense points that out. * remove empty line	2020-02-28 21:46:57 -08:00
edgchen1	37f5fd8fb8	Add support for loading TensorProtos with external data from optimizer Initializer (#3045 ) - Added support for loading TensorProtos with external data from the optimizer Initializer class. - Added some file path utilities.	2020-02-28 13:19:16 -08:00
Changming Sun	c6ed077441	Add d2FH4- flag to cuda (#3105 )	2020-02-27 20:22:07 -08:00
Hariharan Seshadri	86b755774f	Create a separate Nuget hosting just managed assemblies (#3020 ) * Initial commit * More changes * More changes * More changes 3 * More changes 4 * More changes 5 * More changes 5 * More changes 6 * More changes 7 * More changes 8 * Remove C# ifdefs * More changes 10 * More changes 11 * YAML changes for other release pipelines * Add release notes metadata * Props and Targets change * Add CSHarp proj * More changes 12 * More changes * Minor fix * Minor fix * Fix yaml * Some missing logic for winml * Minor update * Fix casing for winmd file * Fix casing * Add targets and props for managed section into native nuget * revert file * a	2020-02-27 18:00:17 -08:00
Dmitri Smirnov	5008fc5b00	Featurizers: Import fix for Linux build adjust linkage (#3089 ) Advance FeaturizersLibrary SetAbsError on Output	2020-02-27 15:49:18 -08:00
Changming Sun	d72639ef77	Fix CUDA 10.1 DLL names (#3102 )	2020-02-27 14:43:16 -08:00
daquexian	37a905f557	Make Java API available on Android (#3030 )	2020-02-27 08:23:50 -08:00
Yufeng Li	ca2ed17ba7	Bump up version number to 1.2 (#3097 )	2020-02-26 17:25:16 -08:00
Yufeng Li	f1ba531d9c	Disable test_zfnet512 and test_bvlc_reference_caffenet for x86 in C# tests (#3094 )	2020-02-26 14:40:55 -08:00
Changming Sun	04063aa324	optimizer shouldn't depend on inference session	2020-02-26 13:42:14 -08:00
Hariharan Seshadri	bf7afbef23	Changes in the props file to support .NET + AnyCPU configuration (#3091 )	2020-02-25 20:28:36 -08:00
Ori Levari	5e0f7412cd	Properly handle downlevel and WCOS scenarios (#3075 )	2020-02-25 17:47:02 -08:00
Dmitri Smirnov	57384d5077	Add disclaimer to DataFrameTool header (#3085 )	2020-02-25 10:11:45 -08:00
Changming Sun	d7500b26bd	Remove Publish Build Symbols from pre-checkin CI build (#3088 )	2020-02-25 08:02:36 -08:00
stevenlix	f4a5d17294	Upgrade to CUDA10.2 for TensorRT (#3084 ) * Switch to CUDA10.2 * Update win-gpu-tensorrt-ci-pipeline.yml * Update win-gpu-tensorrt-ci-pipeline.yml * remove dynamic_shape * update onnx-tensorrt submodule * check if input shape is specified for TensorRT subgraph input and enable some TensorRT unit tests * fix format issue * add shape inference instruction for TensorRT * update according to the reviews * Update win-gpu-tensorrt-ci-pipeline.yml	2020-02-25 05:36:01 -08:00

1 2 3 4 5 ...

1955 commits