onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-06 04:28:32 +00:00

Author	SHA1	Message	Date
ytaous	7abe1fd392	Identity elimination with graph output (#7312 ) * Identity removal * fix build * fix build * fix build * fix builld * UTs * fix UT * fix UTs * per comments * fix UTs * fix UTs * per comments Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-04-19 16:36:35 -07:00
Sheil Kumar	265db2ad96	Fix Microsoft.AI.MachineLearning .NET5 publishing and C# Store Release build (#7373 ) * fix .net publishing * make experimental api build with microsoft.ai.machinelearning.idl import Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-04-19 15:36:43 -07:00
satyajandhyala	bb1e417da0	Add logging support to Cast Propagation transformation from python (#7353 ) * Fixes needed to PropagateCast transformation. * Added number of passes to the logs. * Added logging support to OrtModuleGraphBuilder. * Added new testcases. * Added NodeArgToConsumerMap	2021-04-19 12:14:30 -07:00
M. Zeeshan Siddiqui	6dda1e0681	Flag for tensor memory re-use in allocation planner. (#7359 )	2021-04-16 17:53:25 -07:00
Guoyu Wang	96cdc65d57	Fix android CI failure after gradle updated to 7.0 (#7364 ) * Fix android ci failure after gradle updated to 7.0 * minor update	2021-04-16 15:28:28 -07:00
Yulong Wang	009f342caf	[JS] refactor Javascript/Typescript libraries in ONNX Runtime (#7308 ) * working on re-organizing js code for ortweb * remove dup files * move folder * fix common references * fix common es5 * add webpack to common * split interfact/impl * use cjs for node * add npmignore for common * update sourcemap config for common * update node * adjust folder/path in CI and build * update folder * nit: readme * add bundle for dev * correct nodejs paths * enable ORT_API_MANUAL_INIT * set name for umd library * correct name for commonjs export * add priority into registerBackend() * fix npm ci pwd * update eslintrc * revise code * revert package-lock lockfileVersion 2->1 * update prebuild * resolve comments * update document * revise eslint config * update eslint for typescript rules * revert changes by mistake in backend.ts * add env * resolve comments	2021-04-16 01:33:10 -07:00
Sunghoon	ded2b08380	WebAssembly multi-threads support. (#7326 ) * WebAssembly multi-threads support. * PROXY_TO_PTHREAD is not required for wasm library * Remove an unnecessary line commented out	2021-04-15 21:46:11 -07:00
Guoyu Wang	28e229ac4c	Enable build dynamic framework for macOS/iOS (#7343 ) * Enable build dynamic framework for macOS/iOS * Address CR comments	2021-04-15 16:47:53 -07:00
Chen Fu	ef1aaa367a	Adding interface for batched integer gemm (#7249 ) Parallelize MinMax, Quantize and batched quantize GEMM Performance problem identified in T5 decoder model (quantized). DynamicMatMul operator is identified as the culprit. This operator spend time on getting MinMax of a Tensor, quantize a tensor, and perform a batched qgemm. All of these can be parallelized. Currently GEMM is parallelized. However, in batched GEMM, we sequentially call GEMM multiple times. This causes multiple starting and ending of parallel sections, which can be slow sometimes. So we made the following changes: Parallel task partition no longer depends on degree of parallelism, only on shape of the matrices. In a single GEMM, perform 2D partition of the multiplication, along panel lines, to reduce repeated packing. For batched GEMM, all parallel tasks are executed in a single parallel section, reducing the cost of starting threads and waiting for them to finish.	2021-04-15 10:25:31 -07:00
Changming Sun	f1c1c38d44	Delete an unused var in nuget pipelines(#7345 )	2021-04-15 07:29:52 -07:00
Tianlei Wu	aa9ab565f5	FastGelu fusion for Megatron model (#7344 ) * add a fastgelu pattern from Megatron model * update comment * add test	2021-04-15 00:39:33 -07:00
satyajandhyala	0da085ed48	Propagate Cast operations to maximize lower precision (float16) computation (#7191 ) * Added propagate_cast_ops option and PropagateCastOps transformation. * Added test cases to propagate Cast operations. * Expose GraphTransformerConfiguration to python interface and added propagate_cast_ops options. * Added functionality to propagate Cast operations. * Added logging. * Apply cast propagation to the subgraphs.	2021-04-14 20:54:24 -07:00
Jesse Benson	be79575c6a	Use built-in reduce_sum() for simple reduction cases, specifically reduce all to a scalar.	2021-04-14 08:55:35 -07:00
Brian Martin	3eb2d349a6	fix typo in scenariotestscppwinrt.cpp (#7334 ) the word is spelled, "resetting".	2021-04-14 08:26:55 -07:00
Oliver Rausch	87bd836886	Fixes in symbolic shape inference (#7258 ) * Add symbolic shape inference for Transpose * Support steps in symbolic shape inference for Slice * Add inference for BatchNormalization * Address review changes * Address review changes	2021-04-13 22:17:30 -07:00
liqunfu	75d8319286	Liqun/ort package name2 (#7337 )	2021-04-13 20:36:24 -07:00
Zhang Lei	f62db1a09c	quantization tools support qlinear average pool (#7309 )	2021-04-13 18:22:42 -07:00
liqunfu	4c862c73ed	for training to use new python package naming convention to explicitl… (#7204 )	2021-04-13 16:19:42 -07:00
ashbhandare	6ceee5d131	IsInf ReduceSum transform (#7188 ) * IsInf ReduceSum transform * Revert unnecessary changes, add isinf_only and isnan_only attr * add tests, review comments * Disable test for non-cuda * Move IsAllFinite from training to contrib op * review comments * Review comment, formatting * Enable test for ROCm EP	2021-04-13 16:05:21 -07:00
G. Ramalingam	f8a36dd6b3	Add DropoutGrad function body (#7310 ) * Add DropoutGrad function body * Add DropoutGrad function body * Fix documentation and add test cases * Fix template specialization * Check expansion for float16 and bfloat16	2021-04-13 14:31:53 -07:00
harshithapv	a5d3a52d1a	Add Tile grad (#7289 ) * tile grad * fixed bugs * added tile grad test * bug fix * Added tests. Addressed comments * added optimization recommended and addressed comments * fixed comment	2021-04-13 12:54:45 -07:00
Edward Chen	ce9cd6ad9a	Update usage of generator expression $<COMPILE_LANGUAGE:L1,L2> which is not available in CMake 3.14. (#7318 )	2021-04-13 11:18:34 -07:00
Hariharan Seshadri	2c96050336	Fix SDL warning (#7331 )	2021-04-13 11:14:43 -07:00
Ahmad Zakaria	f34468a309	Fix TRT EP memory leak (#7195 revisited) (#7276 ) * pass trt_profile by pointer pointer to avoid memory leak * have 1 optimization profile per state instead of 1 per provider instance	2021-04-13 09:43:19 -07:00
Zhang Lei	f616ea632e	remove mlas unittest.cpp which is already refactored. (#7319 )	2021-04-13 09:24:56 -07:00
Guoyu Wang	fce67e2b9b	Create Android Package pipeline (#7295 ) * Create Android Package pipeline * adress CR comments * Switch to jdk11	2021-04-12 17:56:25 -07:00
Sheil Kumar	b7c89ce78a	User/sheilk/add api usage telemetry (#7320 ) * winml telemetry * change name to ApiUsage Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-04-12 17:51:25 -07:00
Hariharan Seshadri	4971310d6a	Fix split op in the way it deals with the optional input (#7302 )	2021-04-12 10:26:08 -07:00
jeyblu	61ba9ac1bb	matmul in dnnl (#7311 ) * update dnnl to v2.2 * dnnl matmul	2021-04-12 08:03:03 -07:00
sfatimar	21c282ed54	yolov3 accuracy (#7235 ) Co-authored-by: sfatimar <sahar.fatima@intel/com>	2021-04-10 20:53:17 -07:00
Zhang Lei	6334c29240	Zhalei/mlas test (#7213 ) * Refactor mlas unittest. * Fix building issue on Linux (non msvc). * Fix unused variable CI issue seems for old gnuc. * Move to unittest foler one level down, and some other word change. * Fix typo cause some test wrong. * Correct some missing registered test_case count.	2021-04-09 17:02:38 -07:00
Weixing Zhang	75c0192e4f	enable more unit tests for ROCM EP (#7307 )	2021-04-09 15:15:13 -07:00
Tracy Sharpe	f27f5afd8a	NCHWc: Support "sizes" argument for Resize transform (#7290 )	2021-04-09 13:54:16 -07:00
jingyanwangms	2edf29552d	Add Optype to type mismatch message (#7305 ) * Include optype in error message Co-authored-by: Jingyan Wang <jingywa@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-04-09 13:40:47 -07:00
baijumeswani	b221a4fd86	Better error message when ORTModule used with torch.DataParallel (#7287 ) * Better error message when ORTModule used with torch.DataParallel	2021-04-09 10:07:22 -07:00
Weixing Zhang	c22963c23d	Polish Lamb Kernel (#7299 )	2021-04-09 09:55:57 -07:00
Hariharan Seshadri	711cc99f4d	Improve logged message for nodes that are forced to execute on CPU rather than some other EP (usually CUDA) (#7297 )	2021-04-09 01:36:19 -07:00
Weixing Zhang	8ad5007f8f	Polish Adam kernel (#7294 ) * Polish Adam kernel	2021-04-09 01:11:09 -07:00
Tianlei Wu	274e2fea0c	change half gemm to use compute_32f as default (#7253 ) change half gemm to use compute_32f as default; add env variable for configuration	2021-04-08 20:54:37 -07:00
Zhang Lei	a4fdb4dbd9	Support transpose by merge Reshape etc into direct xint8 operators. (#7265 ) * Suppose transpose by merge Reshape etc into direct xint8 operators. * Add resize operator quantization support * Add QDQ tests for resize, reshape, maxpool, transpose.	2021-04-08 18:00:35 -07:00
RandySheriffH	42051c912a	Narrow profiling scope (#7281 ) * record endtime ealier * rename func * narrow down scope * rename args Co-authored-by: Randy Shuai <rashuai@microsoft.com>	2021-04-08 17:47:17 -07:00
Guoyu Wang	370f9b88c2	Enable CoreML EP for minimal extended mode (#7266 ) * Enable CoreML EP for minimal extended mode * minor code formatting * Fix CI run failure * Address CR comments * remove redundant ifdef	2021-04-08 17:45:22 -07:00
Thiago Crepaldi	7b4362c21a	Add support to dynamic positional/keyword input for ORTModule (#7189 )	2021-04-08 12:46:21 -07:00
Guoyu Wang	4969431eba	Fix codeql java warning (#7280 )	2021-04-08 11:08:12 -07:00
KeDengMS	0d49e53985	[Symbolic shape infer] fix scalar shape in Expand (#7285 )	2021-04-08 10:26:28 -07:00
Tracy Sharpe	bc6ef809bb	NCHWc: avoid buffer reordering around Add nodes (#7279 ) Use Reshape to handle more NCHWc Add cases without ReorderInput/ReorderOutput.	2021-04-08 09:57:23 -07:00
ytaous	e14b291ce7	Enable symbolic shape inference in ORTModule (#7282 ) Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-04-08 09:47:09 -07:00
baijumeswani	d272c8434d	Suppress tracer warnings from onnx export in ORTModule (#7221 ) * Suppress tracer warnings from onnx export in ORTModule	2021-04-08 03:41:38 -07:00
Maajid khan	27e778909d	[OpenVINO-EP] Enabling save/Load blob feature (#7054 ) * Enabling save/Load blob feature for OpenVINO-EP Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Added changes to enhance save/load feature ->This feature applies only for MYRIAD device target ->cleaned up the code and added error checks Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Enabled the feature only for MyriadX and only for Linux Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed compilation issues on windows Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Added changes to fix const subgraph issue Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed issues on windows Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Added changes for the feature -> Removed default location dir dump using cmake -> Enabled saving blob dumps at the executable path by default Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Made save/load dump path configurable -> The save/load blob dump path is now also made configurable using a c/python Api's. -> Introduced a flag named blob_dump_path Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Minor fixes added Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed python API issues Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Using GetEnvironmentVar to get the path Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixed python runtime option issue Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixes import network issue on windows Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>	2021-04-07 20:59:16 -07:00
Chen Fu	def4cc09c7	Add QGEMM benchmark (#7268 ) * Add QGEMM benchmark	2021-04-07 20:24:49 -07:00

1 2 3 4 5 ...

4644 commits