onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-16 18:31:27 +00:00

Author	SHA1	Message	Date
Hector Li	2b8677b210	Enable Openvino nightly build on edge device (#1684 ) 1. Add openvino GPU nightly build pipeline, this test is running on Intel Up square Edge device. The device are host locally not from Azure VM. We persist a smaller model test data on Edge device. 2. Update the build condition for openvino GPU so it works for GPU_FP32, GPU_FP16 3. add option to install_ubuntu.sh to exclude the package used for nuphar, so that we can save some disk space as the Edge device usually have limited disk space.	2019-09-11 16:36:12 -07:00
Dmitri Smirnov	fe8915863c	Implement C API entry points for creating and fetching non-standard types to OrtValue (#1714 ) C/C++ Opage APIs Add new virtual interfaces for NonTensorType Implement entry points. Add shared header for the data container. Add export symbols. Add serialization/deserialization. Implement model with Opaque types. Rework opqaue_api_test as a standalone executable.	2019-09-11 14:52:47 -07:00
Chi Lo	d9fa632863	Add Cuda Kernel for Not operator (#1801 ) * Add Cuda Kernel for Not operator * Register Not CUDA Kernel	2019-09-11 14:30:44 -07:00
Dmitri Smirnov	a9e4de2cea	Follow up on proto3 compatiblity. (#1799 ) This provides additional has_*() methods abstraction/replacement for proto3 compatibility.	2019-09-11 11:36:13 -07:00
Scott McKay	3b7f047a49	General performance testing tooling improvements (#1577 ) * Miscellaneous updates to help with perf testing	2019-09-11 19:46:59 +10:00
Scott McKay	6586afc8eb	Refine the output shape calculation to avoid unnecessary re-allocations and vector insert operations. (#1781 )	2019-09-11 14:31:53 +10:00
Scott McKay	35c5c4d418	A subgraph may have no inputs (e.g. subgraph in If has no explicit inputs) or value infos (e.g. a subgraph with just an If node in it). (#1083 ) It should always have outputs but in case it doesn't (nothing fails currently if it doesn't even though that makes it meaningless) make sure it also has a node.	2019-09-11 11:03:55 +10:00
Hariharan Seshadri	206278ca44	Fix error message in Cast op (#1792 )	2019-09-10 15:40:53 -07:00
Pranav Sharma	f9d85d654a	Add GetDataTransfer() interface in the EP. (#1773 ) * Mention OrtCreateSessionFromArray in C API doc * Add GetDataTransfer() interface in the EP. * Check return status of RegisterDataTransfer * Address PR comments	2019-09-10 14:07:17 -07:00
ybrnathan	bd48660592	Add Cuda Kernel for Less operator (#1790 ) * Add Cuda Kernel for operator Less * Register Less CUDA Kernel	2019-09-10 11:33:57 -07:00
Ran Cohen	b32f24a3f9	added support for Less(double) (#1722 )	2019-09-10 11:15:01 -07:00
Pranav Sharma	0b609d3e68	Add make_unique implementation for use with C++11. (#1793 ) * Mention OrtCreateSessionFromArray in C API doc * Add make_unique implementation for use with C++11 * Add cgmanifest and TPN files as well * Add annotation to cgmanifest to identify the component that uses the dependency	2019-09-09 23:55:44 -07:00
Scott McKay	98dbdb1e0b	Rework the feed/fetch copy setup so that it can be calculated prior to subgraph execution (#1761 ) * Rework the feed/fetch copy setup so that it can be calculated upfront by the control flow nodes. Also simplifies how it all works. Update the control flow nodes to do the calculation prior to graph execution.	2019-09-10 15:46:00 +10:00
Scott McKay	2e242a4089	Clarify naming of the API involving the RunOptions terminate flag. (#1768 ) * Clarify naming of the RunOptions terminate flag. * Update C# code to use new names.	2019-09-10 08:32:33 +10:00
Dmitri Smirnov	75f241d02c	Enhance compatibility with proto3 and replace or abstract has_() methods. (#1778 ) Enhance proto3 compatibility. Replace has_() method to corresponding enum handling so we can deal with proto3 generated stream from proto2 code. Add utility wrappers for remaining has_*() methods so we can easily deal with them if/when we switch to proto3.	2019-09-09 14:07:30 -07:00
shahasad	6a5b11756b	Conditionally export execution provider apis in chsarp (#1724 )	2019-09-09 11:17:44 -07:00
Tracy Sharpe	071a0c2522	MLAS: MlasSgemm refactoring (#1749 ) Refactor the SGEMM kernels to resynchronize the code between Windows/Linux and remove unneeded binary bloat from a different zero/add mode kernel. Another goal is to get to a cleaner state for then doing a DGEMM kernel.	2019-09-06 22:26:28 -07:00
Tracy Sharpe	a324ad7b96	MLAS: clang u8u8 GEMM fix	2019-09-06 09:11:10 -07:00
Ashwini Khade	b2a2326a45	add dequantize and quantize back to contrib ops (#1712 )	2019-09-06 08:55:42 -07:00
Scott McKay	e1a12b1760	Fix some unnecessary copies of the Node attributes (#1763 )	2019-09-06 17:00:35 +10:00
Pranav Sharma	52fe574fed	Rename OrtAllocatorInfo to OrtMemoryInfo to make it more obvious. (#1758 ) * Mention OrtCreateSessionFromArray in C API doc * Rename OrtAllocatorInfo to OrtMemoryInfo to avoid confusion	2019-09-05 14:20:37 -07:00
KeDengMS	58fe5a6bf1	Enable Nuphar docker build, and reinstate Nuphar tests (#1757 ) Enable Nuphar EP docker build Revert back to LLVM 6.0.1 Reinstate disabled Softmax tests caused by LLVM 8.0.1 Reinstate Nuphar Python test due to stale sympy version Increase build timeout of Linux CI	2019-09-05 08:50:48 -07:00
Yang Chen	eddb9d78f9	fixed "unreachable code" warnings on Windows (#1755 ) When NUPHAR_USE_MKL or NUPHAR_USE_AVX2 is not defined, we got "unreachable code" warnings on Windows, which were truned into errors and broke the build.	2019-09-04 20:30:47 -07:00
Pranav Sharma	7c5b3a5ecc	Update coding guidelines to prefer using make_unique for heap allocations (unless where not possible). (#1730 ) * Mention OrtCreateSessionFromArray in C API doc * Fix perf test executable due to removal of certain C APIs * fix linux build * Avoid duplication * Update coding guidelines to prefer using make_unique for heap allocations (unless where not possible).	2019-09-04 19:16:16 -07:00
manashgoswami	3d44c55092	Updated docs related to base images (#1753 ) * Update README.md * Update onnx-inference-byoc-gpu-cpu-aks.ipynb * Update README.md	2019-09-04 10:33:41 -07:00
Tomasz Dołbniak	4ed8d4b30e	Put the initializers at the end of the cluster inputs list (#1751 ) Restore the missing variable	2019-09-03 15:09:37 -07:00
suryasidd	9523977cc2	Added emotion ferplus support (#1752 ) Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com>	2019-09-03 15:01:22 -07:00
Changming Sun	94d9161166	Add nuphar to Linux CI build (#1750 )	2019-09-03 11:39:27 -07:00
Ashwini Khade	0f6cf9a335	enable quantizing specific nodes (#1742 )	2019-09-03 11:04:17 -07:00
Pranav Sharma	ad7ab3d880	Enforce shape validation. (#1716 ) * Mention OrtCreateSessionFromArray in C API doc * Enforce shape validation. * Update broken models	2019-09-02 20:00:37 -07:00
KeDengMS	c9240f4e93	Implementation of Nuphar execution provider (#881 ) * Implement Nuphar execution provider Nuphar execution provider is a TVM-based compilation provider. It has shown great speedups for RNN models using Scan. This PR is mainly for a preview of the shared codegen library for other TVM-based providers. * Fix submodules * Fix TVM submodule * Update Nuphar to latest and resolve confliction * Remove stale files caused by merge -X theirs * Revert heap buffer change to not introduce onnxruntime_framework into onnxruntime_perf_test * Fix bad merge * Merge from Nuphar * Fix warning treated as error, revert some unnecessary changes * Revert some more test changes * Some more test revert or comments to make review easier New tests could be added later * One more revert of unnecessary changes * More change revert. Test could be added back later.	2019-09-01 23:01:47 -07:00
Sreekanth Yalachigere	f4a6d267c1	MKL-DNN EP: control flow fix (#1740 ) * moved subgraph_index to MklDnn Execution Provider * code cleanup	2019-08-31 09:58:59 -07:00
Takeshi Watanabe	259863758e	Fix typo in NMS code Fix typo in NMS code	2019-08-30 22:37:36 -07:00
Hector Li	dc9c89546d	Update the docker file for OpenVINO (#1741 ) Update the docker file for OpenVINO which is used for AML	2019-08-30 22:32:24 -07:00
shahasad	833e18345d	Publish perf tool with nightly build (#1728 )	2019-08-30 11:25:55 -07:00
Hector Li	810ee0068f	Fix a issue that CUDA EP fallback to much nodes to CPU for some case which cause huge data copy. If the node's inputs are all initializer, we shouldn't fallback the node to CPU. (#1727 ) Fix an issue that CUDA EP fallback too much nodes to CPU for some case which cause huge data copy. https://github.com/microsoft/onnxruntime/issues/1675 Currently, if the node's inputs are all as initialier, CUDA EP will fallback it to CPU. And it will also fallback some nodes under it. It could cause some huge data copy. for the case reported by a user, it has several Slices with input from initializer, and a Concat op to concat the output from Slice output. The data is huge 16MB after concat, which make the data copy from CPU to GPU quite costly because it's a sync copy. Fix If the node's inputs are all initializer, we shouldn't fallback the node to CPU.	2019-08-29 13:54:17 -07:00
Pranav Sharma	25d02a33c8	Fix reading of onnx domain causing one of the automl models to break in 0.5 release. (#1694 ) * Mention OrtCreateSessionFromArray in C API doc * Fix registration of Equal op causing one of the automl models to break in 0.5 release. * updates...	2019-08-29 12:18:39 -07:00
Ashwini Khade	e54904e6a3	add implementation for dynamic quantize linear (#1697 )	2019-08-29 11:40:19 -07:00
Hariharan Seshadri	4b5b037289	Support 'Bilinear' mode for 2D inputs in Resize and Upsample kernels (#1679 ) * Support bilinear mode with actual 2D inputs in Resize and upsample * Fix build break * Fix build break * Add test * CUDA changes * Resolve PR comments * Resolve comments	2019-08-29 11:34:31 -07:00
rakelkar	0f7c01b49b	Use exec form of ENTRYPOINT for docker server (#1690 ) * Use exec form of ENTRYPOINT for docker server # Issue The entrypoint currently uses the shell form - this prevents users from passing in any cmdline arguments... also passing a model_path in means the server only works in the envvar is set... however this is not what the error message says! ``` $ docker run -v /home/rakelkar/try/onnxzoo/style:/mnt/models -it mcr.microsoft.com/onnxruntime/server --model_path /mnt/models/model.onnx Version: local_build Commit ID: default model_path must be the location of a valid file Allowed options: -h [ --help ] Shows a help message and exits --log_level arg (=info) Logging level. Allowed options (case sensitive): verbose, info, warning, error, fatal --model_path arg Path to ONNX model --address arg (=0.0.0.0) The base HTTP address --http_port arg (=8001) HTTP port to listen to requests --num_http_threads arg (=4) Number of http threads --grpc_port arg (=50051) GRPC port to listen to requests ``` # Fix 1. remove the env var 2. use the exec form * Update readme to use model_path arg	2019-08-29 10:18:08 -07:00
KeDengMS	068b568472	Add support for int8 x uint8 for MatMulInteger, and int16 x int16 custom op (#1391 ) Description: The change adds necessary quantization support on CPU with mixed int8/uint8, as well as int16 for matrix multiply operations that outputs int32 Motivation and Context Integer operations are critical for quantized model's performance Current MatMulInteger implementation in CPU only supports uint8 x uint8, while the spec supports int8 x uint8. Having a default CPU implementation that fully support the spec would help accuracy verification. Besides, some model may need to quantize to int16, but MatMulInteger op does not support that yet. A custom op of MatMulInteger16 is added to satisfy such models.	2019-08-28 21:40:24 -07:00
KeDengMS	8fc8910a0e	Allow input used across execution providers as long as they use the same allocator device (#1715 ) as long as these providers use the same allocator device Description: Currently ORT throws error when one input is used in different EPs. The change removes that restriction Motivation and Context It is now possible to share inputs across EPs now that allocation are device-based, instead of EP based.	2019-08-28 20:30:00 -07:00
Changming Sun	81ad48080b	Remove TaskThreadPool (#1713 )	2019-08-28 18:00:10 -07:00
Tracy Sharpe	73312b8195	MLAS: Android sgemm kernel build fix (#1710 ) Fix the aarch64 kernel to build properly with the Android NDK (specifically clang).	2019-08-28 16:14:12 -07:00
Tracy Sharpe	14eae293bf	remove @PCGOTREL x64 usage (#1707 ) Avoid the need for @PCGOTREL relocations by annotating MLAS global data shared with assembly modules with attribute(visibility("hidden")).	2019-08-28 11:27:16 -07:00
Faith Xu	d9cdf4b4ed	Doc updates (#1522 ) * Updates * Remove preview texts * Update README.md * Updates * Update README.md * Update README.md * Minor wording update * Update README.md * Update doc on CUDA version * revert update * Update readme for issue #1558 * Clean up example section * Cosmetic updates - Add a index of build instructions for browsability - Update build CUDA version from 9.1 to 10 * Fix broken link * Update README to reflect upgrade to pip requirement * Update CuDNN version for Linux Python packages * Clean up content Updated ordering and add table of contents * Minor format fixes * Move Android NNAPI under EP section * Add link to operator support documentation * Fix typo * typo fix * remove todo section	2019-08-27 21:31:19 -07:00
Ashwini Khade	8813b79c5b	make gemmlowp default for arm (#1701 ) * make gemmlowp default for arm * force use_gemmlowp in header for default case * remove unnecessary white space	2019-08-27 15:52:03 -07:00
shahasad	121d308a33	Python API naming and other cleanup (#1678 ) - Make the naming of properties in python SessionOptions and RunOptions consistent with other apis. - Remove unnecessary apis	2019-08-27 12:48:46 -07:00
jywu-msft	938200de9b	fix typo in max batch size error msg. (#1687 )	2019-08-27 11:15:18 -07:00
Ashwini Khade	961b14ac4a	use MLAS for QGEMM in matmulInteger and convInteger (#1692 ) * use mlas qgemm for u8u8_s32 gemms * update test	2019-08-26 18:13:22 -07:00

1 2 3 4 5 ...

1216 commits