onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-06 04:28:32 +00:00

Author	SHA1	Message	Date
Changyoung Koh	7666d130e5	Rename MKL-DNN to DNNL to fix broken link (#2730 )	2020-01-06 08:50:42 -10:00
Changming Sun	013642ed37	Revert "Change default optimization level to All (from Basic) (#2745 )" This reverts commit `56bb503c2f`.	2020-01-03 15:28:23 -08:00
Ashwini Khade	56bb503c2f	Change default optimization level to All (from Basic) (#2745 ) * change default optimization level to All (from Basic) * fix test * fix c# test	2019-12-30 12:31:44 -08:00
KeDengMS	71940c0915	Update Nuphar tutorial notebook (#2721 ) 1. Reflect int8 GEMV improvements for multi-threading from #2696 2. Add notes on multi-threading control using OpenMP 3. Add samples of running multi-isa AOT, and show int8 GEMM differences between AVX and AVX2 4. Add rnn_benchmark example to resolve #1993	2019-12-22 22:42:03 -08:00
Xavier Dupré	7c0235c15a	Propagate documentation modification from rel-1.0.0 (#2713 )	2019-12-21 00:25:45 +01:00
Faith Xu	bb7f43ee91	Documentation update: build instructions (#2636 ) * Spacing fix for code block * Update instructions Include java, acl, and nn api instructions on build page * Update build instructions to link to build.md * typo * Update build instructions to link to build.md * Include other minor build.md page updates * Update CUDA version * Fix dockerfile links	2019-12-19 13:40:34 -08:00
KeDengMS	c767e264c5	[NupharEP] update tutorial with GPT-2 (#2677 )	2019-12-16 17:57:34 -08:00
Adam Pocock	35ceb1a6a6	Java API for onnxruntime (#2215 )	2019-12-10 08:28:46 -08:00
daquexian	62de8fa841	Update docs for Android NNAPI EP (#2586 )	2019-12-09 14:37:03 -08:00
Ryan Hill	36eb1771ba	Update version (#2584 )	2019-12-08 18:00:12 -08:00
KeDengMS	0f12346d76	[Nuphar EP] fixes for some object detection models (#2581 ) Update notebook tutorial with multi-threaded int8 GEMM from #2517	2019-12-07 13:37:00 -08:00
Xiang Zhang	3e7aaf8fa1	User/xianz/telemetry (#2458 ) * enabme telemetry * enable telemetry * set enable telemetry as default * for debugging * remove log and set disable telemetry as default back * delete private file while testing * resolve comment: mainly add license header, rename macro and update docs * rewording in privacy.md	2019-12-03 23:34:53 -08:00
stevenlix	293b15480b	Add dynamic shape support in TensorRT execution provider (#2450 ) * remove onnx-tensorrt submodule * add new onnx-tensorrt submodule (experiment) for trt6 * update engine build for trt6 * update compile and compute for tensorrt6.0 * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * switch to onnx-tensorrt master for TensorRT6' * Update tensorrt_execution_provider.cc * Handle dynamic batch size and add memcpy in TensorRT EP * update test cases * Update tensorrt_execution_provider.cc * update onnx-tensorrt submodule * Update Dockerfile.ubuntu_tensorrt * Update Dockerfile.ubuntu_tensorrt * Update run_dockerbuild.sh * Update run_dockerbuild.sh * Update install_ubuntu.sh * Update concat_op_test.cc * Update tensorrt_execution_provider.cc * Upgrade TensorRT to version 6.0.1.5 * Update onnxruntime_providers.cmake * Update CMakeLists.txt * Update reduction_ops_test.cc * Update install_ubuntu.sh * Update Dockerfile.ubuntu_tensorrt * Update Dockerfile.tensorrt * Update BUILD.md * Update run_dockerbuild.sh * Update install_ubuntu.sh * Update onnxruntime_providers.cmake * Update install_ubuntu.sh * Update install_ubuntu.sh * Update gemm_test.cc * Update gather_op_test.cc * Update CMakeLists.txt * Removed submodule * update onnx-tensorrt submodule * update header file * Removed submodule * add submodule onnx-tensorrt kevin's branch shape-test' * add debugging code * Update tensorrt_execution_provider.cc * Update tensorrt_execution_provider.cc * merge master * Removed submodule * update onnx-tensorrt submodule * add more changes for dynamic shapes * Update tensorrt_execution_provider.cc * update for dynamic shape * update dynamic shape processing * fix logger issue * remove submodule onnx-tensorrt * add submodule onnx-tensorrt * add env variable min_subgraph_size * remove redundency * update document * use onnxruntime::make_unique * fix multi-run issue * remove some tests to save CI build time * Add dynamic shape test * Update TensorRT-ExecutionProvider.md * Add example of running Faster R-CNN model on TensorRT EP * Add more details on env variables * update environment variables * Update tensorrt_basic_test.cc * Update model tests * Update tensor_op_test.cc * remove --use_full_protobuf * Update build.py	2019-12-03 23:18:33 -08:00
Sreekanth Yalachigere	31ea11a696	Renaming MKL-DNN as DNNL (#2515 ) * DNNL: Moving Files to rename file names * DNNL name change * azure pipeline updated * disable ceil/dialation and enable Opset10 * disable ceil/dialation tests in Python * mlperf_ssd_resnet34_1200 disabled	2019-12-03 07:34:23 -08:00
KeDengMS	c1be615c45	[NupharEP] refine parallel schedule control (#2514 ) * [NupharEP] Add parallel schedule to JIT function name Update Nuphar docker to use Python 3.6 and ubuntu 18.04 * Update notebook * Avoid JIT cache file name conflict	2019-12-02 17:40:51 -08:00
KeDengMS	60208463a9	[NupharEP] Enable parallel schedule (#2505 ) * [NupharEP] Enable parallel schedule * Update TVM with the fix to TVM threadpool to use OpenMP if possible * Add parallel schedule when trying to vectorize With this change, BERT squad perf on a 4-core (8 HT) CPU goes from 187ms to 150ms * Address CR, docs and cmake update * Doc fix * Fix mkl * Fix TVM windows build when using mklml	2019-11-28 08:35:56 -08:00
avidiyal	95e8c3377e	onnxrt server documentation update (#2396 )	2019-11-18 15:31:07 -08:00
KeDengMS	aa7c79eac9	[NupharEP] Update notebook and docker image (#2416 ) Add BERT squad in Nuphar tutorial Enhance speed comparsion readability	2019-11-18 10:38:14 -08:00
Patrick Foley	151075790d	[OpenVINO-EP] Update to latest version: OpenVINO 2019 R3.1 (#2308 ) * Updates OpenVINO EP to latest version: 2019 R3.1 * Reviews fixed * Update Dockerfile.openvino * Addressed PR comments and disabled model tests temporarily * Update Dockerfile.ubuntu_openvino	2019-11-05 19:55:46 -08:00
Faith Xu	556bae17a5	Fix versions table (#2309 ) * Update table values * Fix onnxml opset version	2019-11-03 08:58:21 -08:00
mikecaraman	358b517d49	[v2] Add ACL (Arm Compute Library) execution provider (#2258 ) * Guard unused parameter Guard unused parameter for Linux Arm and other cases. * Add ACL (Arm Compute Library) execution provider Add a new execution provider targeting Arm architecture based on Arm Compute Library. Validated on NXP i.MX8QM CPU with ResNet50, MobileNetv2 and VGG models. All unit tests are passing. Comparative performance improvements for ResNet50v1 model obtained with onnxruntime_perf_test: A72 2xA72 A53 4xA53 ACL vs CPU 16% 9% 21% 13% Usage documentation available in ACL-ExecutionProvider. * Fix eigen unused parameter Fix eigen unused parameter error for Arm cross-compilation.	2019-10-31 12:25:36 -07:00
KeDengMS	ff64d1f55b	Relax check for optimized model saving (#2291 ) So user may save model with layout optimization.	2019-10-30 21:48:49 -07:00
Changming Sun	7b11f05a97	Update version number	2019-10-30 08:13:09 -07:00
suryasidd	f7b4bc15e1	Updated documentation for VAD-F (#2248 ) Signed-off-by: suryasidd <surya.siddharth.pemmaraju@intel.com>	2019-10-24 14:31:44 -07:00
Faith Xu	303a78c301	Update Python documentation (#2210 )	2019-10-21 16:56:31 -07:00
Nathan	0dd781fd57	Perf tuning doc update with latest API (#2128 ) * Update perf tuning md * Remove AppendExecutionProvider	2019-10-19 21:03:09 -07:00
stevenlix	a9f01a5f29	Fixed node index remapping issue in TensorRT graph partitioning (#2155 ) * Fixed node index mapping issue during graph partitioning * add test for node index mapping * Update BUILD.md * Update TensorRT-ExecutionProvider.md	2019-10-19 20:31:56 -07:00
Xavier Dupré	836d22cd4c	Update readme.rst for pypi, change documentation style (#1663 )	2019-10-19 18:26:34 -07:00
Paul McDaniel	d1159b7008	Adding platform telemetry (#2109 )	2019-10-19 18:25:57 -07:00
Ashwini Khade	fc3c168402	Graph Optimizations Doc (#2050 ) * Initial draft * updates per review * fix link * plus one more link fix * small changes to the optimizer documentation * some more changes * done * update C_API with doc link	2019-10-18 08:03:40 -07:00
Faith Xu	86af54ded8	Add roadmap file (#2127 ) * Add roadmap file * Minor updates * fixes based on feedback * Add IOT section	2019-10-17 13:03:25 -07:00
Faith Xu	ec136ac60f	Documentation Refresh (#1990 ) Various documentation updates, primarily for EP and main readme page	2019-10-15 15:58:02 -07:00
Adrian Tsai	4090d0d0de	Add DirectML Execution Provider (#2057 ) This change adds a new execution provider powered by [DirectML](https://aka.ms/DirectML). DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning on Windows. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers. The DirectML execution provider is capable of greatly improving evaluation time of models using commodity GPU hardware, without sacrificing broad hardware support or requiring vendor-specific extensions to be installed. Note that the DML EP code was moved verbatim from the existing WindowsAI project, which is why it doesn't yet conform to the onnxruntime coding style. This is something that can be fixed later; we would like to keep formatting/whitespace changes to a minimum for the time being to make it easier to port fixes from WindowsAI to ORT during this transition. Summary of changes: * Initial commit of DML EP files under onnxruntime/core/providers/dml * Add cmake entries for building the DML EP and for pulling down the DirectML redist using nuget * Add a submodule dependency on the Windows Implementation Library (WIL) * Add docs under docs/execution_providers/DirectML-ExecutionProvider.md * Add support for DML EP to provider tests and perf tests * Add support for DML EP to fns_candy_style_transfer sample * Add entries to the C ABI for instantiating the DML EP	2019-10-15 06:13:07 -07:00
Pranav Sharma	91db840b6b	Introduce execution mode enum for clarity and extensibility; Change Python, C and C# APIs accordingly; Removed EnableSequentialExecution, DisableSequentialExecution in favor of the more general SetExecutionModeAPI. (#2098 ) * Introduce execution mode for clarity and extensibility; Change Python APIs accordingly; Replace DisableSequentialExecution API with EnableParallelExecution for clarity. * Fix cuda build * Modify the test slightly * Make C and C# APIs consistent with Python.	2019-10-14 09:48:19 -07:00
baowenlei	b82de794d5	Weba/update nuphar doc (#2026 ) * update nuphar xp doc * address comments * address CR * update doc	2019-10-08 12:41:25 -07:00
Pranav Sharma	f13b66768a	Fix build for gcc 4.8.5. (#2036 )	2019-10-08 00:50:53 -07:00
KeDengMS	e361174f78	Add nuphar python scripts to wheel, and notebook tutorial (#1952 ) * Fixed a bug of missing tvm in python wheel * Put Nuphar Python scripts into wheel * Add note book tutorial * Some improvements in symbolic shape inference for quantized models	2019-09-30 10:39:02 -07:00
Emma Ning	02c122d6e4	Add OLive in perf tuning section (#1772 ) * Add OLive in perf tuning section * Add OLive to perf tuning section * Update README.md * Update ONNX_Runtime_Perf_Tuning.md	2019-09-27 13:10:40 -07:00
Xavier Dupré	2ecac41614	update python examples (#1935 )	2019-09-26 11:25:59 -07:00
Pranav Sharma	a9ce941579	Refine threading control options and move inter op thread pool to session state. (#1841 ) Description: Refine threading control options and move inter op thread pool to session state. Added thread_utils.h/cc to centralize the decision around the thread pool size under various conditions. Motivation and Context Currently the thread pool size of the parallel executor is hardcoded to 32 for some reason. This PR makes the options to configure the thread pool sizes clearer.	2019-09-18 22:36:23 -07:00
Faith Xu	a60283845b	Update link format and example sections in readme (#1729 ) * Fix broken link and minor wording updates * Update links to use relative paths * Update sample section organization * Fix a few more links * Update links to relative paths * Fix link urls * Update links to relative paths * Update link to perf test doc page * Update links to relative paths * Update to relative paths for links * Update link	2019-09-12 17:49:29 -07:00
Pranav Sharma	f8c3442880	Part 2 of renaming AllocatorInfo to MemoryInfo. (#1804 ) * Mention OrtCreateSessionFromArray in C API doc * Part 2 of renaming AllocatorInfo to MemoryInfo. * pr comments * fix comment	2019-09-12 08:19:29 -07:00
Pranav Sharma	7c5b3a5ecc	Update coding guidelines to prefer using make_unique for heap allocations (unless where not possible). (#1730 ) * Mention OrtCreateSessionFromArray in C API doc * Fix perf test executable due to removal of certain C APIs * fix linux build * Avoid duplication * Update coding guidelines to prefer using make_unique for heap allocations (unless where not possible).	2019-09-04 19:16:16 -07:00
manashgoswami	3d44c55092	Updated docs related to base images (#1753 ) * Update README.md * Update onnx-inference-byoc-gpu-cpu-aks.ipynb * Update README.md	2019-09-04 10:33:41 -07:00
KeDengMS	c9240f4e93	Implementation of Nuphar execution provider (#881 ) * Implement Nuphar execution provider Nuphar execution provider is a TVM-based compilation provider. It has shown great speedups for RNN models using Scan. This PR is mainly for a preview of the shared codegen library for other TVM-based providers. * Fix submodules * Fix TVM submodule * Update Nuphar to latest and resolve confliction * Remove stale files caused by merge -X theirs * Revert heap buffer change to not introduce onnxruntime_framework into onnxruntime_perf_test * Fix bad merge * Merge from Nuphar * Fix warning treated as error, revert some unnecessary changes * Revert some more test changes * Some more test revert or comments to make review easier New tests could be added later * One more revert of unnecessary changes * More change revert. Test could be added back later.	2019-09-01 23:01:47 -07:00
Faith Xu	d9cdf4b4ed	Doc updates (#1522 ) * Updates * Remove preview texts * Update README.md * Updates * Update README.md * Update README.md * Minor wording update * Update README.md * Update doc on CUDA version * revert update * Update readme for issue #1558 * Clean up example section * Cosmetic updates - Add a index of build instructions for browsability - Update build CUDA version from 9.1 to 10 * Fix broken link * Update README to reflect upgrade to pip requirement * Update CuDNN version for Linux Python packages * Clean up content Updated ordering and add table of contents * Minor format fixes * Move Android NNAPI under EP section * Add link to operator support documentation * Fix typo * typo fix * remove todo section	2019-08-27 21:31:19 -07:00
Emma Ning	d0d82432f3	Update PyTorch Section for supported onnx version (#1635 ) PyTorch exporter in Pytorch1.2 can natively support multiple opset now	2019-08-20 13:56:19 -07:00
Pranav Sharma	377dcf60ac	Update onnx test runner documentation (#1651 ) * Mention OrtCreateSessionFromArray in C API doc * Update perf tool documentation to reflect the new graph optimization enums. Relax constraint for enable_all. * Update one more doc * Update onnx test runner documentation * Add default in the docs	2019-08-19 18:28:09 -07:00
Pranav Sharma	6f3a835d38	Update perf tool documentation to reflect the new graph optimization enums. Relax constraint for enable_all. (#1650 )	2019-08-19 14:27:33 -07:00
shahasad	0c5d2c998b	Generate documentation from the registered operator kernels (#1395 ) - Added python script for generating markdown doc from the registered opkernels. - Made some conditional changes in the pybind to expose necessary python API - Added some missing type-constraints in the op kernel registrations	2019-08-14 18:12:24 -07:00

1 2 3

131 commits