onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-13 18:08:13 +00:00

Author	SHA1	Message	Date
Xavier Dupré	edec8043d4	Fix python examples in documentation (#3379 )	2020-04-01 22:48:32 +02:00
Changming Sun	accffded5d	Build options for enabling AVX/AVX2/AVX512 (#3373 ) 1. Add build options for enabling AVX/AVX2/AVX512 2. Update eigen to a newer version, because the current one doesn't work with VC and AVX512.	2020-04-01 10:07:22 -07:00
Brian Martin	77c7d09ced	ERROR_NOT_SUPPORTED doesn't trigger Failed Hresult. Need E_NOTIMPL (#3396 )	2020-04-01 10:06:00 -07:00
Brian Martin	052c1fda44	fix some warnings in concurrency tests (#3395 )	2020-04-01 10:05:24 -07:00
Scott McKay	33d3239b67	Rework SVMClassifier to improve performance (#3363 ) * Rework SVMClassifier - use GEMM for initial scoring - minimize data allocations and copies - parallelize the second half of the scoring for larger batches	2020-04-01 22:00:01 +10:00
Tiago Koji Castro Shibata	a61400de01	Fix ARM cross compilation (related to #3378 , #3298 ) (#3385 )	2020-03-31 17:10:48 -07:00
Changming Sun	55fd283d20	Fix a bug in FunctionImpl::FunctionImpl (#3376 ) 1. Fix a bug in FunctionImpl::FunctionImpl. It set wrong name for the new attribute. 2. Set error code to NOT_IMPLEMENTED if a function contains a not implemented op.	2020-03-31 15:54:47 -07:00
Dmitri Smirnov	a4fe60c4d3	OpSet 12 ops (#3341 ) Advance ONNX commit to pickup the latest ArgMax, ArgMin, ReduceMax/ReduceMin, MaxPool Declare new versions for CPU/CUDA. Implement infrastructure support for int8/uint8. Adust GatherOp test for a new error. Adjust Scan9.BadShape test. Add exclusions for index out of bounds checks. Rework result verification for SVDTransformer.	2020-03-31 15:31:06 -07:00
manashgoswami	044c466158	Updated tags for v1.2.0 release (#3386 ) Updated the tags in the table to reflect the new images for Release v1.2	2020-03-31 14:54:56 -07:00
Tianlei Wu	ecbacd7d79	Add Benchmark of GPT2 CPU inference (#3351 ) * Add benchmark script and notebook for GPT2 * Update Reshape fusion for GPT2 model * Add opt_level option for bert_model_optimization to disable onnxruntime by setting --opt_level 0 * Fix keras optimization	2020-03-31 13:43:09 -07:00
Scott McKay	ace741680d	Constant-12 support (#3304 ) 1. Support the new fields for Constant in opset 12 2. Support SparseTensor in the Constant node by converting to dense tensor when lifting the Constant to an initializer. Will make a model with a sparse tensor in a Constant work but isn't an overly efficient approach.	2020-03-30 23:13:52 -07:00
stevenlix	2332a93db0	Update onnx-tensorrt parser (#3369 ) * sync onnx-tensorrt parser and update TensorRT doc * remove --msvc_toolset 14.16 in tensorrt ci pipeline	2020-03-30 20:31:59 -07:00
Jan Scholz	ce9acf0c21	iOS crosscompilation under linux (#3298 ) * added support for ios crosscompilation under linux * reverted cmake generator change * if --ios is added protoc can be compiled for host system * accidently reverted change to compile protoc for host system for ios if protoc exe is not set * wdata is now used * accidentally pasted CMAKE_OSX_ARCHITECTURES into CmakeLists.txt, also made bad merge on build.py previously * removed print * fixed typeo, deleted commented statements for earlier debugging * reverted accidental delete * added asmmacro.h for aarch64 asm now MlasSgemmKernel**** gets underscore added if needed no need anymote to differentiate between iOS arm64 and normal amr64 build onnxruntime.cmake: added check if iOSCross is set to properly set RPATH * removed 2 spaces * fix: logcial error fixed, now protoc gets compiled if not supplied with --path_to_protoc_exe * removed unecessarily added spaces * removed some more spaces	2020-03-30 19:39:17 -07:00
Yufeng Li	af618278f6	fix bugs in quantization and calibration tools (#3329 ) Fix 3 bugs: node names duplicate in calibration augment_graph if the name of node to quantize is empty. If output nodes are quantized, output value are quantized and not dequantized back Gather with data type int64 should not be quantized	2020-03-30 17:50:25 -07:00
Maxim Kalinin	f2ca2b2981	Avoid "infinite" loop in optimizer (#3321 ) * Avoid "infinite" loop in optimizer When symbolic dimensions are present and can be overridden, FreeDimensionOverrideTransformer always sets modified flag to true. As a consequence, the optimizer loops until the iteration limit is reached.	2020-03-31 08:37:00 +10:00
Changming Sun	06fc9506fd	Thread pool changes (#3153 ) 1. Copy tensorflow's thread pool class to ORT, so that we can get a better implementation of thread pool based parallelfor 2. Copy Eigen's thread pool class to ORT 3. Support thread affinity 4. Remove RNN kernel’s private thread pool 5. Modify pool kernels to use the thread pool when openmp is disabled.	2020-03-30 12:18:40 -07:00
Yulong Wang	0494036006	fix tensor location mismatch in allocation planner (#3249 )	2020-03-30 11:20:43 -07:00
Cassie	2b10e625f9	added public value varibale to NamedOnnxValue (#3347 ) Co-authored-by: cassieview <cassie.siljander@microsoft.com>	2020-03-30 10:45:39 -07:00
George Wu	355f39ddee	fix cuda build for cmake >= 3.17.0 (#3362 )	2020-03-30 00:38:57 -07:00
Yang Chen	33b5010e62	skip optional inputs for scan subgraphs (#3349 ) * skip optional inputs for scan subgraphs We may have cases where the subgraph has optionial inputs that appear in both subgraph's input and initializer, but not in the node's input. In such cases, the input model might be invalid, but let's not choke on it. Instead, let's issue a warning, skip the optional inputs, and keep going forward. * address CR feedback	2020-03-28 16:15:45 -07:00
Tiago Koji Castro Shibata	c3cea486d0	Port ConcurrencyTests from TAEF (#3086 ) * Add ConcurrencyTests * Make ConcurrencyTests compatible with TAEF * Use test PCH in concurrency tests * Fix include header * Ignore unused code warnings on WINML_SKIP_TEST * Remove BOM * Remove conflicting namespace in older SDK * Refactor duplicate code * Fix unused DELAYLOAD * Fix unused DELAYLOAD * Remove link to internal bug * Address code style fixes * Add new concurrency tests	2020-03-27 17:39:22 -07:00
Yang Chen	5278f73202	Fixed two issues in symbolic_shape_infer script (#3332 ) * Fixed two issues in symbolic_shape_infer script This change addressed #3293 There were two issues in the script: * We need to handle a special case for infer_Reshape, where input_shape is empty and target shape_value is [-1]. In such case, we need to get sympy data for the output dim (or create one if it doesn't exist). * We need to update computed dims for newly-created shape for Range op * also call _update_computed_dims for _infer_Expand addressed CR feedback * added ai.onnx into opset list * instead of manipulating _infer_Reshape, call _update_computed_dims from _infer_Expand to update newly-computed dims	2020-03-26 23:27:37 -07:00
Xiang Zhang	810a10b230	Enable Onnxruntime Telemetry by Default for 1.3 (#3338 )	2020-03-26 20:57:39 -07:00
Faith Xu	2e875f4e67	Delete outdated page (#3320 )	2020-03-26 18:24:02 -07:00
Pranav Sharma	497e83eda5	Minor update to the issue template. Add a line to attach model where applicable. (#3339 )	2020-03-26 14:28:27 -07:00
Hector Li	0e81962e98	correct the cmake version to 3.13 for Arm build (#3333 )	2020-03-26 10:20:18 -07:00
Changming Sun	5f6ec8ea6d	Fix a bug in Maxpool v8	2020-03-25 16:27:43 -07:00
Scott McKay	dee4fc8b8a	Apply the same check for no_transpose from the Reduce* ops to ArgMin and ArgMax (#3315 )	2020-03-26 07:41:16 +10:00
Sheil Kumar	51e95ea946	Make ort errors appear in winml exceptions (#3316 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-03-25 12:20:40 -07:00
Scott McKay	4db01309cb	Use GEMM for SVMRegressor. (#3305 )	2020-03-25 11:49:44 +10:00
Tianlei Wu	19edad132c	Move AzureML Bert notebook from onnx tutorial (#3302 )	2020-03-24 12:31:02 -07:00
Weixing Zhang	fef7989866	Replacing CudaAsyncBuffer with TArray to improve perf (#3303 ) * removing using CudaAsyncBuffer * Keep CudaAsyncBuffer for these ops: non_max_suppression, cudnn_rnn_base, concat, split * fix windows build error * fix windows build error. * fix build error * fix windows build error Co-authored-by: Weixing Zhang <wezhan@microsoft.com>	2020-03-24 12:13:27 -07:00
Hariharan Seshadri	ef7b98f988	Support DisposableNamedOnnxValue inputs in c# Run() (#3175 ) * Initial commit * Update error message * Update * Updates to support holding onto onnxValue and pinnedmemoryBuffer * Updates * Minor updates * Comment out a portion of the tests * PR feedback * Minor nit update * Resolve comments * PR feedback * PR updates * PR feedback	2020-03-23 18:36:12 -07:00
Faith Xu	fb5ab858d2	Update BUILD instructions (#3282 ) Include guidance for building release packages per question from #3251	2020-03-23 18:35:22 -07:00
Sheil Kumar	b72fe13941	Update WinML Projection to accept sequence of tensors (#3287 ) * Enable sequence of tensor * add tests * small updates * There should only be 2 elements returned * CR feedback, and another 6->2 check update in the test. * missing semicolon... * Add explicit to constructor taking pointer paramter Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2020-03-23 15:55:20 -07:00
Weixing Zhang	843ee346a8	Implement struct TArray and simplify code. (#3291 ) * Implement operator[] for TArray and simplify the code. * fix a build error. * add a constructor with std::vector input * fix build error * update based on code review feedback Co-authored-by: Weixing Zhang <wezhan@microsoft.com>	2020-03-23 10:51:54 -07:00
Tracy Sharpe	57468c651c	QLinearMatMul speed up (#3283 ) The equivalent of PR#3196 but done for QLinearMatMul. Use MLAS to do a u8u8=s32 GEMM and then requantize this intermediate buffer.	2020-03-21 15:37:25 -07:00
Changming Sun	9c3b6d2e4b	Fix warnings in nuphar	2020-03-20 21:49:46 -07:00
Tianlei Wu	403f99cd77	Use yapf to format python (#3276 ) Update ReformatSourcePython.bat to use YAPF to format python code, and add onnxruntime\test directory to be formatted. Add onnxruntime\.style.yapf for configuration. The style is based on google, except max column width 120. Format python scripts using ReformatSourcePython.bat.	2020-03-20 14:34:10 -07:00
Pranav Sharma	84015d9491	Fix post merge test. This doesn't get triggered as part of gated PR checks. (#3277 )	2020-03-20 13:23:09 -07:00
Dmitri Smirnov	b880c48c4c	Make reduction ops handle Scalar input (#3260 ) Handle Scalar values for CPU and GPU Ifdef CUDA nd TVM as they require more changes.	2020-03-20 12:04:47 -07:00
Ye Wang	c5149e89d9	Wangye/shortgraindropper (#3273 ) (#3274 ) * Featurizer Library update * update Featurizer Library * add short_grain_dropper_transformer * resolve comments * resolve comments * resolve comments	2020-03-20 11:48:31 -07:00
Tianlei Wu	1d9be2baed	Add Notebook for Bert Model exported by Keras2onnx (#3271 ) * Add notebook for bert squad model exported by python 1.4 * update bert performance test tool: (1) set OpenMP environment variable before importing onnxruntime. (2) launch new process for each test. * Add notebook Reduce combinations in perf test * update readme * fix quote * Allow test multiple batch_size * Add latency percentile * Add warm up run Reset logger for notebook * refine default settings to test for cpu/gpu * Add script to dump machine info * Add notebooks for PyTorch SQuAD model GPU and CPU inference * Update machineinfo.py: add license header; format by yapf * Do not reset log handler. Skip adding handler if existed. * Add comments about GPU result diff. Filter rows of batch set to keep only one setting. * update according to review feedback * Download script from master branch * Add notebook for bert model exported by keras2onnx * format columns in result table * re-run and update notebook	2020-03-20 11:37:25 -07:00
Yufeng Li	a69d859912	fix quantize_bias (#3270 )	2020-03-20 11:36:47 -07:00
Scott McKay	6dc25a60f8	Make the reduction ops more consistent in checking if no transpose is required and skipping the copy of the input data if that is the case. Significantly better performance when this is done (2x faster for model calling ReduceSumSquare with input of {2048,10}). (#3265 )	2020-03-20 06:55:38 +10:00
Changming Sun	8f00147c14	Fix a few warnings	2020-03-19 09:22:28 -07:00
Tiago Koji Castro Shibata	3bdb0b620a	Fix WCOS/Win32 linking bugs (#3126 ) * Fix WCOS/Win32 linking bugs * Remove unused NODEFAULTLIB flags * Avoid plain target_link_libraries signature * Avoid plain target_link_libraries signature * Fix library list escaping * Use library list instead of string * Remove duplicate link to windowsapp.lib * Remove Win32 build workarounds * Specify CMake policies before initializing language * Expose Win32 header definitions during build * Force set API family * Enable Win32 APIs in featurizer * Use MT dynamic CRT * Expose Win32 specific functions * Disable app container globally * Disable default wide functions in featurizers * Add featurizers to test include path * Workaround https://gitlab.kitware.com/cmake/cmake/issues/19428 * Revert pipeline debugging hacks * Skip /FI in CUDA sources * Default to Win32 builds * Enable WCOS when using WinML * Use generator expression to apply CMAKE_MSVC_RUNTIME_LIBRARY to C++ only	2020-03-19 08:52:40 -07:00
Pranav Sharma	435f014d71	Add support for sessions to share a global threadpool. (#3177 ) * Add support for sessions to share a global threadpool. * Fix build issues * Add tests, fix build issues. * Added some documentation * Fix centos issue when threadpools become nullptr due to 1 core. * Fix mac and x86 build issues * Address some PR comments * Disabled test for android, added few more tests and addressed more PR comments. * const_cast	2020-03-18 15:42:46 -07:00
edgchen1	e03b8a1e2f	Move path_lib from onnxruntime/core/framework to onnxruntime/core/platform. (#3253 ) Moved path_lib.h/cc from onnxruntime/core/framework to onnxruntime/core/platform and from the onnxruntime_framework to the onnxruntime_common libraries.	2020-03-18 11:53:46 -07:00
Xiang Zhang	61621d4053	Add extra fields to ORT telemetry (#3234 ) * Add extra fields to ORT telemetry * fix linux build failure caused by using HRESULT * little refactor	2020-03-18 09:37:35 -07:00

1 2 3 4 5 ...

2018 commits