onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-27 03:11:28 +00:00

Author	SHA1	Message	Date
Sheil Kumar	6255194659	All LearningModelSessions created from a common LearningModelDevice should share the same thread pool (#11457 ) * Share thread pools between devices * make tests reuse device * Change cpu thread pool options for dml sessions to use 1 thread with no spinning * fix test failure * Update missing type constraints for dft * Add comment and rename inference session parameter * default missing causing inconsistent test behavior Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-05-13 11:12:43 -07:00
Yi Zhang	5709ed2e16	Fix shellcheck warning (#11489 ) * fix shellcheck warning * Update java_linux_final_test.sh	2022-05-13 15:36:59 +08:00
RajalakshmiSR	b14c1fd479	POWER: Optimize MlasQLinearAddKernelHelper() (#11454 ) This patch uses vector instrinsics to optimize MlasQLinearAddKernelHelper function for POWER processor. Co-authored-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>	2022-05-12 18:38:45 -07:00
George Wu	09590f013a	fix windows ci debug build break (#11495 ) * update msc version check * update comment * typo * whitespace	2022-05-12 16:54:00 -07:00
Rachel Guo	4aef7e3aab	[CoreML EP] Add DepthToSpace op support (#11468 ) * initial impl of depthtospace coreml support * fix build * address pr comments * minor update * minor pr comments Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net> Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>	2022-05-12 13:48:51 -07:00
Yi Zhang	a3f05da338	Revert "[TVM EP] update set input to remove excess copying inside TVM (#11247 )" (#11504 ) This reverts commit `5ae461ec0a`.	2022-05-13 02:27:36 +08:00
Tianlei Wu	ece1274ffa	revert safeint version (#11500 )	2022-05-12 11:24:43 -07:00
Justin Chu	f94b25933a	ci(cpplint): Ignore runtime/references warnings (#11499 ) Allow non-const references `6f85d3e5c8/docs/Coding_Conventions_and_Standards.md (L11-L12)`	2022-05-12 07:51:45 -07:00
sumitsays	2660eb8364	DML EP: Gelu (#11483 ) Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>	2022-05-11 16:24:11 -07:00
Justin Chu	6f85d3e5c8	fix(onnx_export): Extract arg value from torch Value (#11471 ) Description: Extract arg value from torch Value Motivation and Context Input to gelu is `torch._C.Value` type values. This caused the `if approximate == "none"` check to always fail, preventing the optimized `com.microsoft::Gelu` op from being used.	2022-05-11 11:36:43 -07:00
Tianlei Wu	f5473596fa	Change longformer default kernel (#11470 ) * change default to compact memory kernel * Remove a cuda stream synchronize that is not needed * Update longformer benchmark tool	2022-05-11 10:54:59 -07:00
Changming Sun	48ae27d578	Update protobuf-java to 3.20.1 (#10420 )	2022-05-11 07:52:12 -07:00
Changming Sun	207ad7eef9	Remove spdlog from cgmanifest.json	2022-05-10 22:02:21 -07:00
Changming Sun	027fc1d391	Completely delete ORT server	2022-05-10 22:02:21 -07:00
Changming Sun	903743e823	Delete unused TRT docker files (#11486 ) * Delete unused TRT docker files * revert tools/ci_build/github/linux/docker/Dockerfile.manylinux2014_cuda11_4_tensorrt8_0	2022-05-10 22:00:53 -07:00
Dwayne Robinson	205b61c5d8	Fix bad merge in build.py	2022-05-10 17:17:55 -07:00
Dwayne Robinson	f82946c4a0	Merge branch 'master' into user/dwayner/WindowsRiTest2	2022-05-10 16:57:47 -07:00
Changming Sun	0ac2e6e546	Update install-entrypoint.sh: add version lock for NCCL (#11475 )	2022-05-10 15:37:55 -07:00
pengwa	d8a1531c37	CKPT API Implementation (On Device Training) (#11261 ) * Checkpoint API Implementation * fix build issues * fix undefined reference for ParseData of type string. * refinements * resolve some comments * expose python api * make save and load test pass * some clean up * make optimizer save/load test pass * make custom property save/load test pass * formatting * fix comments - fix wave - code placement, remove legacy ckpt logic dependency, remove external data support * fix comment - wave 2 - Remove ParseData/ParseStringData, Use UnpackTensor, Simplify CheckpointProperty usage * fix comment - wave 3 - rename all api_test namespace to api * fix comment - wave 4 - load/save trainable/nontrainable param seperately. * Rename Load/SaveORTCheckpoint * renaming API && remove CheckpointUntils. api::LoadCheckpoint/SaveCheckpoint is the exposed interfaces. * revert unnecessary format change for onnxruntime/core/framework/tensorprotoutils.h/cc * formatting * re-org the class folders for better dependency managerment * save_checkpoint accpeting TensorProto as inputs * More clean up * clean up the naming * refactor a bit type constraints on custom property * fix comment - file read/write && report error when file read/write failed * extract LoopDir to FilterFilesFromDirectory * fix build	2022-05-10 18:43:57 +08:00
Yulong Wang	3437967e63	[js/rn] fix CI packaging for react native E2E test (#11463 ) * [js/rn] fix ORTRN packaging in CI * fix env var setting	2022-05-09 18:09:52 -07:00
Edward Chen	738d9b153c	Consolidate several types into onnxruntime::ArgType. (#11430 )	2022-05-09 14:44:28 -07:00
Rachel Guo	288892335e	[NNAPI EP] Add support for DepthToSpace Op (#11354 ) * initial implementation for support nnapi depthtospace * modify depthtospace output tensor shape and enable test pass * minor update * minor update * modify input output layout order and hack nnapi instance to use nchw flag for optest * address pr comments * add depthtospace to layout logic * format length and revert UT log level * add nchw and android feature level check in opsupportchecker * minor fix * update * update * fix * minor update	2022-05-09 11:38:12 -07:00
Changming Sun	3b16fb2000	Delete java-test-final-jar-step.yml (#8894 )	2022-05-09 11:25:03 -07:00
Justin Chu	c541063245	Format coding conventions documentation (#11405 ) Add proper formatting to code blocks to make the doc more readable. - Wrap code blocks with ` - Fix typos	2022-05-09 10:19:15 -07:00
symphonylyh	c2de603c10	Contrib ops for TRT plugin: Disentangled Attention Plugin (#11287 ) * Add disentangled attention TRT plugin as contrib op * update plugin name & remove null character * update onnx-tensorrt submodule with my beta version * use suggested plugin name & simpler shape propagation * update onnx-tensorrt gitsubmodule to temporary fork * update onnx-tensorrt to temporary commit * redirect submodule back to latest 8.2-GA release of onnx-tensorrt repo Co-authored-by: HHH-ComputeLab <haohangh@nvidia.com>	2022-05-08 15:25:25 -07:00
George Wu	70e501866b	Revert "[TensorRT EP] reduce CI pipelines test execution time (#11440 )" (#11460 ) This reverts commit `8d6ade9e08`.	2022-05-07 11:41:11 -07:00
Dwayne Robinson	69b2fab810	Update DirectML from 1.8.0 to 1.8.2 (#11459 )	2022-05-06 17:52:52 -07:00
RandySheriffH	8467af832f	Fix reduced pipeline by excluding test case standalone op (#11458 ) * exclude reduce build from standalone op test * exclude test from reduced op build	2022-05-06 16:19:49 -07:00
Brian Popow	3624f7c5a5	Update samples (#11420 )	2022-05-06 13:32:16 -07:00
Hubert Lu	2a90922f01	Using vectorized loads (float2) for fp16 to improve performance (#11390 )	2022-05-05 14:19:21 -07:00
Changming Sun	d2ae0f49b2	Make Graph::InlineFunction be able to process initializers (#11443 )	2022-05-05 12:30:29 -07:00
George Wu	8d6ade9e08	[TensorRT EP] reduce CI pipelines test execution time (#11440 ) * add global builder placeholder to improve CI test time for TRT EP * fix build error * rename var, put in unnamed namespace * fix build error * fix	2022-05-05 09:25:54 -07:00
Tang, Cheng	3f3c5fcd68	Unify the Compile API for mobile build and normal build (#10632 ) * use the lightweight compile api as default; use dnnl ep for testing * apply to tensorrt ep * fix the missing files * fix build * fix the copy issue on linux * migrate migraphx and openvino ep * fix openvino build break * fix linux build * fix unused parameter * fix coreml build * use graph view's filtered initializers * fix openvino break * fix tvm compile api * fix tvm / rknpu / vitisai ep build * add IsInitializedTensor in graph_viewer; fix nuphar build * use serializer directly as tvm ep is still static lib * fix the type mismatch * fix the type mismatch * fix merge conflict * add a comment * fix minimal build * fix the DML EP's legacy approach * save type/shape in dnnl IR * fix linux break * fix tvm failure * dnnl ep: move initializer referenced out of dnnl subgraph * Revert "add IsInitializedTensor in graph_viewer; fix nuphar build" This reverts commit 1cc3c7f08c16fee4fe3309a67209eb769d479587. * add IsInitializedTensor to graph viewer * add the legacy code for nuphar build to temporarily make nuphar build work * ignore internal test for nuphar * remove the out of date tests * keep the legacy API in EP for a while * turn serializer into a static function * update comments * fix tvm build * Update include/onnxruntime/core/framework/execution_provider.h Co-authored-by: Pranav Sharma <prs@microsoft.com> * Update include/onnxruntime/core/framework/execution_provider.h Co-authored-by: Pranav Sharma <prs@microsoft.com> * Update onnxruntime/core/framework/execution_provider.cc Co-authored-by: Pranav Sharma <prs@microsoft.com> * updatee comments; add warning message for legacy compil call * add a flag to control out of scope arg in serialization * fix trt build; improve the test * resolve merege errors * fix a typo Co-authored-by: Cheng Tang <chenta@microsoft.com> Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Pranav Sharma <prs@microsoft.com>	2022-05-05 08:30:07 -07:00
cloudhan	eca4cbc419	Avoid using word 'crazy' (#11396 ) Avoid using word 'crazy' and simplify the comment of else branch	2022-05-05 23:07:50 +08:00
Valery Chernov	5ae461ec0a	[TVM EP] update set input to remove excess copying inside TVM (#11247 ) * update TVM * small fixes * update TVM with new set_input and NDArray API * use set_input instead of set_one_input Co-authored-by: Valery Chernov <valery.chernov@deelvin.com>	2022-05-05 14:25:02 +02:00
Vincent Wang	084165c748	Change MinGrad/MaxGrad to Use Distributed Logic (#11388 ) * change min max grad * resolve comments	2022-05-05 11:49:40 +08:00
Yulong Wang	860ba8820b	[js/rn] fix ORTRN for iOS (#11425 ) * align ios version with onnxruntime-mobile-c * support 'file://' in iOS * fix lint error	2022-05-04 13:58:55 -07:00
Changming Sun	963e1ace4e	Fix SAL annotations for custom op (#11432 ) Fix SAL annotations for custom op. For example, "_In_" only applies to pointers, not integers.	2022-05-04 10:47:28 -07:00
Justin Chu	a1f9847b23	[Fix] Add the extra param to match gelu in PyTorch in the contrib symbolic function (#11318 ) Description: Add the extra param to match gelu in PyTorch in the contrib symbolic function Motivation and Context Why is this change required? What problem does it solve? The symbolic function in /onnxruntime/python/tools/pytorch_export_contrib_ops.py is missing a recently added parameter approximate. We add this parameter and use the exporter defined gelu if approximate is "tanh".	2022-05-04 10:36:38 -07:00
Hariharan Seshadri	1aad59fa49	Increase timeout for IOS packaging pipeline (#11431 )	2022-05-04 10:00:41 -07:00
Changming Sun	57b51e72d7	Linux CI: uninstall onnx before installing it (#11428 )	2022-05-04 08:49:37 -07:00
Yulong Wang	af21a04977	[js] upgrade async@3.2.3 /js/ (#11421 ) * [js] upgrade async@3.2.3 /js/ * format code	2022-05-03 23:41:36 -07:00
Sheil Kumar	85fa168dc1	Add optional dft_length input to the DFT and IDFT operators. (#11427 ) * Add optional dft_length input. * CR Feedback Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2022-05-03 16:17:43 -07:00
Tang, Cheng	ae043e3963	Support ort device tensor in ortmodule's inference (#11112 ) * support ort device tensor in ort module inference * fallback aten equal to cpu; add ortmodule inference test case * fix python format Co-authored-by: Cheng Tang <chenta@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2022-05-03 14:28:30 -07:00
RandySheriffH	8d69b9398b	APIs for custom op to invoke ort operator directly (#10713 ) * draft kernel creation * setup eager context * call into kernel in eager mode * redefine test case * refact eager context * add comment * remove header * rename argument * redefine API definition with types * list outputs as argument * switch to int to represent length * fix compile err * create attribute API * add test case for topk * remove bool from c api * add gru test case * remove var * fix compile warnings * rename status * fix compile err * exclude sparse tensor * fix comments * fix comments * fix build err * rename file and move location * format code * move file to session folder * fix comments Co-authored-by: Randy <Randy@randysmac.attlocal.net>	2022-05-03 14:16:30 -07:00
Yulong Wang	a3e38d7c90	[js] upgrade async@3.2.3 /js/web/ (#11426 )	2022-05-03 14:04:22 -07:00
Changming Sun	253c8b41ed	Move some of the transpose kernel code to onnxruntime_framework.lib (#11380 ) * Move some of the tranpose kernel code to onnxruntime_framework.lib * Fix C4244 warnings in the tranpose code * Rename IsMovingSingleAxis to IsTransposeMovingSingleAxis	2022-05-03 14:03:50 -07:00
Yulong Wang	308b605047	[wasm] increase timeout for Web Assembly static lib CI (#11306 ) * [wasm] increase timeout for Web Assembly static lib CI * update config format	2022-05-03 11:29:40 -07:00
Yulong Wang	d306e00351	[js/rn] set minSdkVersion to 21 for ORT-RN Android (#11403 )	2022-05-02 19:36:41 -07:00
Changming Sun	5023f6750b	Revert "Call pluggable EP's shutdown function in Environment::~Environment() (#11120 )" (#11393 ) This reverts commit `4983d6e5d6`. We can't destroy OrtEnv through python's atexit function, because at that time there might be many other ORT python objects alive.	2022-05-02 14:38:31 -07:00

... 21 22 23 24 25 ...

7863 commits