onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-11 17:48:34 +00:00

Author	SHA1	Message	Date
Sreekanth Yalachigere	4c996a8699	DNNL CMAKE update (#2548 )	2019-12-05 13:48:57 -08:00
Xiang Zhang	3e7aaf8fa1	User/xianz/telemetry (#2458 ) * enabme telemetry * enable telemetry * set enable telemetry as default * for debugging * remove log and set disable telemetry as default back * delete private file while testing * resolve comment: mainly add license header, rename macro and update docs * rewording in privacy.md	2019-12-03 23:34:53 -08:00
stevenlix	293b15480b	Add dynamic shape support in TensorRT execution provider (#2450 ) * remove onnx-tensorrt submodule * add new onnx-tensorrt submodule (experiment) for trt6 * update engine build for trt6 * update compile and compute for tensorrt6.0 * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * switch to onnx-tensorrt master for TensorRT6' * Update tensorrt_execution_provider.cc * Handle dynamic batch size and add memcpy in TensorRT EP * update test cases * Update tensorrt_execution_provider.cc * update onnx-tensorrt submodule * Update Dockerfile.ubuntu_tensorrt * Update Dockerfile.ubuntu_tensorrt * Update run_dockerbuild.sh * Update run_dockerbuild.sh * Update install_ubuntu.sh * Update concat_op_test.cc * Update tensorrt_execution_provider.cc * Upgrade TensorRT to version 6.0.1.5 * Update onnxruntime_providers.cmake * Update CMakeLists.txt * Update reduction_ops_test.cc * Update install_ubuntu.sh * Update Dockerfile.ubuntu_tensorrt * Update Dockerfile.tensorrt * Update BUILD.md * Update run_dockerbuild.sh * Update install_ubuntu.sh * Update onnxruntime_providers.cmake * Update install_ubuntu.sh * Update install_ubuntu.sh * Update gemm_test.cc * Update gather_op_test.cc * Update CMakeLists.txt * Removed submodule * update onnx-tensorrt submodule * update header file * Removed submodule * add submodule onnx-tensorrt kevin's branch shape-test' * add debugging code * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * merge master * Removed submodule * update onnx-tensorrt submodule * add more changes for dynamic shapes * Update tensorrt_execution_provider.cc * update for dynamic shape * update dynamic shape processing * fix logger issue * remove submodule onnx-tensorrt * add submodule onnx-tensorrt * add env variable min_subgraph_size * remove redundency * update document * use onnxruntime::make_unique * fix multi-run issue * remove some tests to save CI build time * Add dynamic shape test * Update TensorRT-ExecutionProvider.md * Add example of running Faster R-CNN model on TensorRT EP * Add more details on env variables * update environment variables * Update tensorrt_basic_test.cc * Update model tests * Update tensor_op_test.cc * remove --use_full_protobuf * Update build.py	2019-12-03 23:18:33 -08:00
Hariharan Seshadri	5c2e474751	Add provision in ORT for session options to be parsed when available via model file (#2449 ) * Initial commit * Fix gitmodules * Nits * Nits * Updates * Update * More changes * Updates * Update * Some updates * More changes * Update * Update * Merge * Update * Updates * More changes * Update * Fix nits * Updates * Fix warning * Fix build * Add comment * PR feedback * PR feedback * Updates * Updates * Update * More changes * Fix build break * Comment test for now * Updates * Updates * PR feedback * Updates * Nits * Add tests * Fix build * Fix build * Fix build * Fix build break * Fix build * Nits * PR feedback * More change * Expose GetSessionOptions in pybind logic and add unit test for python * Fix build * PR feedback * PR feedback	2019-12-03 16:56:07 -08:00
Sreekanth Yalachigere	31ea11a696	Renaming MKL-DNN as DNNL (#2515 ) * DNNL: Moving Files to rename file names * DNNL name change * azure pipeline updated * disable ceil/dialation and enable Opset10 * disable ceil/dialation tests in Python * mlperf_ssd_resnet34_1200 disabled	2019-12-03 07:34:23 -08:00
Changming Sun	4354023913	Make link time optimization work on Linux (#2477 )	2019-12-02 22:25:41 -08:00
KeDengMS	60208463a9	[NupharEP] Enable parallel schedule (#2505 ) * [NupharEP] Enable parallel schedule * Update TVM with the fix to TVM threadpool to use OpenMP if possible * Add parallel schedule when trying to vectorize With this change, BERT squad perf on a 4-core (8 HT) CPU goes from 187ms to 150ms * Address CR, docs and cmake update * Doc fix * Fix mkl * Fix TVM windows build when using mklml	2019-11-28 08:35:56 -08:00
shahasad	ca0ed96621	CSharp api and test for loading custom op shared library (#2420 ) - Added C-API test for loading custom op shared lib. - Made some changes in C++ api header and C-api implementation to get it working. - Added C# API and corresponding test for loading custom op shared library.	2019-11-21 15:45:49 -08:00
Changming Sun	109b3cb450	Avoid using the default logger in the graph lib and optimizers (#2361 ) 1. Use the session logger if it is available. 2. Don't disable warning 4100 globally. We should fix the warnings instead of disabling it.	2019-11-14 13:23:28 -08:00
Ilya Lavrenov	b90d55b7ea	Fixed compilation with ngraph (#2388 )	2019-11-13 17:49:00 -08:00
Changming Sun	fc6773a65b	Add Tracelogging for profiling (#1639 ) Enabled only if onnxruntime_ENABLE_INSTRUMENT is ON	2019-11-11 21:34:10 -08:00
avidiyal	3d3cf0e159	Openvino EP R3.1 onnxrt server (#2357 ) * onnxrt server with OVEP * onnxrt server with OVEP * Update Dockerfile.server.openvino * onnxrt server OVEP fix reviews * onnxrt server OVEP fix reviews	2019-11-11 12:22:19 -08:00
Changming Sun	080a0a3186	Nuget pipeline changes (#2305 ) 1. refactor the pipeline, remove some duplicated code 2. Move Windows_py_GPU_Wheels job to Win-GPU-CUDA10. We'll deprecated the "Win-GPU" pool 3. Delete cpu-nocontribops-esrp-pipeline.yml and cpu-nocontribops-pipeline.yml 4. In Linux nuget jobs, run "make install" before creating the package. So that extra RPAH info will be removed	2019-11-08 09:45:52 -08:00
Patrick Foley	151075790d	[OpenVINO-EP] Update to latest version: OpenVINO 2019 R3.1 (#2308 ) * Updates OpenVINO EP to latest version: 2019 R3.1 * Reviews fixed * Update Dockerfile.openvino * Addressed PR comments and disabled model tests temporarily * Update Dockerfile.ubuntu_openvino	2019-11-05 19:55:46 -08:00
George	8a102c6e99	apply eigen patch only for ACL.	2019-11-05 13:53:53 -08:00
mikecaraman	358b517d49	[v2] Add ACL (Arm Compute Library) execution provider (#2258 ) * Guard unused parameter Guard unused parameter for Linux Arm and other cases. * Add ACL (Arm Compute Library) execution provider Add a new execution provider targeting Arm architecture based on Arm Compute Library. Validated on NXP i.MX8QM CPU with ResNet50, MobileNetv2 and VGG models. All unit tests are passing. Comparative performance improvements for ResNet50v1 model obtained with onnxruntime_perf_test: A72 2xA72 A53 4xA53 ACL vs CPU 16% 9% 21% 13% Usage documentation available in ACL-ExecutionProvider. * Fix eigen unused parameter Fix eigen unused parameter error for Arm cross-compilation.	2019-10-31 12:25:36 -07:00
Changming Sun	a5da5ff6f4	Remove onnxruntime_USE_EIGEN_THREADPOOL cmake option	2019-10-30 21:51:54 -07:00
zhijxu	ce23d628a5	fix bug in cmake/onnxruntime_server.cmake	2019-10-28 10:03:18 -07:00
Yuri	a2596b706b	FreeBSD compatibility patch. * Treat the 'amd64' architecture the same way as 'x86_64' * Use thr_self() instead of gettid() on FreeBSD	2019-10-26 12:44:12 -07:00
edgchen1	6a27cb5ad6	Fixed tensor reference to const data and cleaned up Env API. (#1979 )	2019-10-24 10:28:13 -07:00
kile0	bede664af7	mimalloc allocator (#2071 )	2019-10-23 22:34:00 -07:00
Changming Sun	4b62241c77	Update ONNX to 1.6.1 (#2235 )	2019-10-23 13:47:45 -07:00
Paul McDaniel	02dc3a9dcb	build break for arm64, adding advapi32.lib (#2206 )	2019-10-21 08:48:28 -07:00
Scott McKay	5c86889beb	Fix linux build issue with debug dump of shapes and data. (#2202 ) Add option to dump just shapes or shapes and data.	2019-10-20 20:35:48 -07:00
Pranav Sharma	69970d1f2a	Include the new Privacy.md file in all release packages. (#2200 )	2019-10-20 07:58:36 -07:00
Changming Sun	cff7879d89	Update C API pipeline to use CentOS 6 (#2198 )	2019-10-19 22:25:42 -07:00
Paul McDaniel	d1159b7008	Adding platform telemetry (#2109 )	2019-10-19 18:25:57 -07:00
Changming Sun	021073b5e5	Update python packaging pipelines (#2167 )	2019-10-19 07:42:54 -07:00
Tomasz Dołbniak	72110d3508	Patch for the MKLDNN v1 segfaults (#2145 )	2019-10-17 12:10:00 -07:00
Scott McKay	3fcb4ee7d4	Refine optimizers (#1407 ) * Refine optimizers * Address PR comments * Changes from PR comments and discussion. * Fixed signed/unsigned mismatch * Address PR comments * Address PR comments * Fix linux build * Fix issue with mkldnn logic. * Turn off optimizers by default for operator unit tests. * Handle edge case of graph with no nodes in partitioner so all execution providers don't need to. * Comment out change to turn off optimizers for unit tests. Add details on what needs to be done to re-enable.	2019-10-15 14:49:59 -07:00
Sreekanth Yalachigere	485c24b62d	MKL-DNN 1.0 (#2134 ) * MKL-DNN 1.0 * changed libmkldnn version to 1	2019-10-15 12:06:34 -07:00
Adrian Tsai	4090d0d0de	Add DirectML Execution Provider (#2057 ) This change adds a new execution provider powered by [DirectML](https://aka.ms/DirectML). DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning on Windows. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers. The DirectML execution provider is capable of greatly improving evaluation time of models using commodity GPU hardware, without sacrificing broad hardware support or requiring vendor-specific extensions to be installed. Note that the DML EP code was moved verbatim from the existing WindowsAI project, which is why it doesn't yet conform to the onnxruntime coding style. This is something that can be fixed later; we would like to keep formatting/whitespace changes to a minimum for the time being to make it easier to port fixes from WindowsAI to ORT during this transition. Summary of changes: * Initial commit of DML EP files under onnxruntime/core/providers/dml * Add cmake entries for building the DML EP and for pulling down the DirectML redist using nuget * Add a submodule dependency on the Windows Implementation Library (WIL) * Add docs under docs/execution_providers/DirectML-ExecutionProvider.md * Add support for DML EP to provider tests and perf tests * Add support for DML EP to fns_candy_style_transfer sample * Add entries to the C ABI for instantiating the DML EP	2019-10-15 06:13:07 -07:00
Yufeng Li	8c5db7f973	use legacy stream mode (#2076 ) In ORT, there is only 3 cuda stream: default, HtoD, DtoH. And both HtoD and DtoH are non-blocking stream. Thus, per-thread stream mode doesn't have any benefit. I also tried in multiple thread env and the legacy mode is also better than per-thread model. Below is the perf of a 3 layer bert on v100. Unit is ms: batch size 1: concurrency \| c=1 \| c=2 \| c=4 legacy \| 0.54 \| 1.17 \| 2.68 per-thread \| 0.66 \| 1.37 \| 2.86 batch size 4: concurrency \| c=1 \| c=2 \| c=4 legacy \| 1.1 \| 2.22 \| 4.6 per-thread \| 1.21 \| 2.44 \| 4.98 batch size 64: concurrency \| c=1 \| c=2 \| c=4 legacy \| 8.09 \| 16.13 \| 32.37 per-thread \| 8.18 \| 16.26 \| 32.45	2019-10-14 16:03:04 -07:00
Tomasz Socha	f93be8af90	Update nGraph to version 0.26 (#1965 ) * Adjust ngraph cmake files to onnx 1.5.0 * Enable LSTM reverse direction mode in nGraph EP * Enable full support for the Split op in nGraph EP * Revert "Disable the unsigned input Shrink op tests for nGraph until the next update" This reverts commit 257b42a55bdd98f804d4846868542b8e3aeb4b4e. * Enable Gather and remove unused subgraph attribute * Remove the unused param from AppendClusterToSubGraph * Fix for the incorrect onnx opset version * Use the r0.26 release branch before the tag is created * Enable the quantizelinear and dequantizelinear for NGEP * Use the v0.26.0-rc.2 tag in ngraph.cmake * Add skip for modes others than default in Pad operator * Reenable negative axis tests for ngraph * Use temporary ngraph version * Use branch name instead of SHA for temporary ngraph branch * Use ngraph v0.26.0-rc.4 * Remove patch for missing symbol in MKLDNN * Use MKLDNN 1.0 in ngraph * Exclude the Pad op for opsets greater than 10 * Disable quantizelinear and dequantizelinear tests for ONNX 1.5.0 * Fix the onnx-headers related compilation errors * ONNX libs linking fix * Use a tag for ngraph and support more Pad modes * Use the v0.26.0 release tag for nGraph * Update ngraph to RC8 - bigobj flag for Windows builds * Fix the MKLDNN constexpr error on Windows	2019-10-14 10:37:48 -07:00
Changming Sun	c24d7a8a0a	Update eigen to the latest version (#1910 )	2019-10-11 10:44:19 -07:00
Changming Sun	a314402097	Downgrade python gpu package to CUDA 10.0 (#2086 )	2019-10-10 18:31:24 -07:00
Dmitri Smirnov	af9dbb70f2	Introduce a separate check and conditional for AVX512BW build (#2083 ) Separate checks for AVX512f and AVX512BW Make AVX512BW cmake instructions nested within AVX512F support.	2019-10-10 16:14:00 -07:00
Tracy Sharpe	57e0099425	MLAS: Implement U8S8 GEMV kernels (#2069 ) This implements an optimization for U8S8 MlasGemm when M=1, aka GEMV.	2019-10-09 11:54:16 -07:00
Dmitri Smirnov	cae571c713	Add a test for AVX512 compilation before compiling 512 asm (#2055 )	2019-10-08 21:18:04 -07:00
Scott McKay	db0dd09ded	Cleanup some aspects of the Initializer class used by optimizers (#2005 ) * Move check on data type outside of the Initializer class as it's specific to Conv processing. Use references for arguments that can't be null.	2019-10-09 10:37:44 +10:00
RandySheriffH	f501b6e234	pack pyop in nightly build (#2018 ) * pack pyop in nightly build * correct logic * add comment * exclude debug build * add dependency * reset postbuild rule * remove dep	2019-10-08 12:02:45 -07:00
Changming Sun	8f7657fa32	Ignore some gcc warnings (#1996 )	2019-10-07 16:32:34 -07:00
stevenlix	544e53e24e	Update TensorRT to version 6.0.1.5 (#1966 ) * remove onnx-tensorrt submodule * add new onnx-tensorrt submodule (experiment) for trt6 * update engine build for trt6 * update compile and compute for tensorrt6.0 * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * switch to onnx-tensorrt master for TensorRT6' * Update tensorrt_execution_provider.cc * Handle dynamic batch size and add memcpy in TensorRT EP * update test cases * Update tensorrt_execution_provider.cc * update onnx-tensorrt submodule * Update Dockerfile.ubuntu_tensorrt * Update Dockerfile.ubuntu_tensorrt * Update run_dockerbuild.sh * Update run_dockerbuild.sh * Update install_ubuntu.sh * Update concat_op_test.cc * Update tensorrt_execution_provider.cc * Upgrade TensorRT to version 6.0.1.5 * Update onnxruntime_providers.cmake * Update CMakeLists.txt * Update reduction_ops_test.cc * Update install_ubuntu.sh * Update Dockerfile.ubuntu_tensorrt * Update Dockerfile.tensorrt * Update BUILD.md * Update run_dockerbuild.sh * Update install_ubuntu.sh * Update onnxruntime_providers.cmake * Update install_ubuntu.sh * Update install_ubuntu.sh * Update gemm_test.cc * Update gather_op_test.cc * Update CMakeLists.txt * Removed submodule * update onnx-tensorrt submodule * Add Ubuntu18.04 build option * Add Ubuntu18.04 build option * Add Ubuntu18.04 build option * Add Ubuntu18.04 build option * Remove redundency * Fix issue that it does not add memcopy node correctly if some nodes fall back to CUDA EP. e.g. after partition, there's TRT_Node -> Cuda_node (with CPU memory expected), we still need to add memcpy node between them. * update for Trt Windows build * Update onnxruntime_providers.cmake * Disable opset11 tests on TensorRT * Update pad_test.cc * Update build.py * update scripts for ubuntu18.04 * Disable warning for Windows build	2019-10-06 10:40:53 -07:00
baowenlei	4bb6385dca	Weba/merge ngemm (#2021 ) * save status: add tiling layout; add avx512 skylake cpuid info * unit tests and matmul integer model passed on skylake, need to verify model * save commit before update master * fix check * address comments	2019-10-05 12:09:22 -07:00
Hariharan Seshadri	f528da35f2	Update ONNX to a newer commit (#2015 ) * Update ONNX to a newer version * PR comments	2019-10-04 19:41:00 -07:00
Dmitri Smirnov	f5a8a23951	Replace std::regex with re2 bc CentOS std::regex is broken (#2017 )	2019-10-04 18:47:03 -07:00
daquexian	e071a1249b	Android CI (#1600 )	2019-10-04 17:39:51 -07:00
Dmitri Smirnov	627f853a44	Downgrade compiler to CentOS 4.8.5 (#1985 ) Make onnxruntime CPU build and run on CentOS GCC 4.8.5	2019-10-03 15:40:46 -07:00
Dmitri Smirnov	d1b1cdc5c4	Replace GSL with GSL-LITE submodule and fix up refs (#1920 ) Remove gsl subodule and replace with a local copy of gsl-lite Refactor for onnxruntime::make_unique gsl::span size and index are now size_t Remove lambda auto argument type detection. Remove constexpr from fail_fast in gsl due to Linux not being happy. Comment out std::stream support due to MacOS std lib broken. Move make_unique into include/core/common so it is accessible for server builds. Relax requirements for onnxruntime/test/providers/cpu/ml/write_scores_test.cc due to x86 build. Add ONNXRUNTIME_ROOT to Server Lib includes so gsl is recognized	2019-10-01 12:43:29 -07:00
KeDengMS	e361174f78	Add nuphar python scripts to wheel, and notebook tutorial (#1952 ) * Fixed a bug of missing tvm in python wheel * Put Nuphar Python scripts into wheel * Add note book tutorial * Some improvements in symbolic shape inference for quantized models	2019-09-30 10:39:02 -07:00

1 2 3 4 5 ...

270 commits