onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-03 03:58:54 +00:00

Author	SHA1	Message	Date
baijumeswani	f3a70f1aec	Ignore invalid input argument to install_os_deps.sh (#7566 )	2021-05-05 14:33:31 -07:00
Zhang Lei	9465948715	Quantization tools using one more extra_options on interface. (#7293 ) handle nnapi special sigmoid options.	2021-05-05 13:51:50 -07:00
Changming Sun	a284eede64	Fix Linux CPU pipeline (#7584 )	2021-05-05 13:26:10 -07:00
Ye Wang	8a9ddfe963	Longformer Attention non-determinism issue fix (#7574 ) * Fix run-to-run not deterministic bug. * Remove non-deterministic logic in softmax * Fix value diff when removing non-deterministic issue. Co-authored-by: Lei Zhang <zhang.huanning@hotmail.com>	2021-05-05 09:54:25 -07:00
Derek Murray	94c97ac8c2	Fix compiler warnings treated as errors in GistEncodeDecode. (#7568 ) * Fix compiler warning in GistEncodeDecode. * Fix other use of member variable. * Make `compression_type_` const. * Change floor to floorf in CUDA code. * Statically cast size_t to int in GIST CUDA kernels * Add explicit cast to `long` in gist.cc Co-authored-by: Derek Murray <demurra@microsoft.com>	2021-05-05 09:05:11 -07:00
Sheil Kumar	91985ab03d	add use_dml (#7569 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-05-05 08:55:13 -07:00
Xavier Dupré	2f0479780e	Improves NonMaxSuppression on CPU (#7557 ) * improves non max suppression * use pointer instead of boxes	2021-05-05 09:14:40 +02:00
Xavier Dupré	ade6ed51eb	Speed up Reduce operators for consecutive reduced axes (#7206 ) * Improves Reduction for three specific configurations * Support ReduceMean * add ReduceMax, ReduceMin * refactoring	2021-05-05 09:14:00 +02:00
Pranav Prakash	053bada30f	Add support for setting shape inference function on fused nodes (#7007 ) * Add support for setting shape inference function on fused nodes * Add test for fused node shape inference	2021-05-05 13:32:07 +10:00
Rachel Guo	d8cf960412	Add android test app to validate Java API for ORT-Mobile Android (#7477 ) * test * [gwang] make cmake compile work * [gwang] enble build apks * some build update * add simple sigmoid test android project and cmake * add build.py * refine and remove unused import lib * address CR comments * remove unnecessary files * add README.md * minor update * remove * minor change * fix ci failure and minor update * fix typo in project folder * remove * remove and minor update * refine * minor fix * fix * fix typo * add gradle spotlessApply task to fix CI failure * fix * enable spotlessApply in build gradle * revert some changes * minor fix * run spotless apply for format * address CR comments and fix CI version and format * refine * Refine * address comments * refine * refine * modify * reformat * resolve version conflicts * minor update * minor update * address comments * minor update Co-authored-by: Guoyu Wang <wanggy@outlook.com>	2021-05-04 15:39:14 -07:00
Zhang Lei	f6cefc92e2	Add quantized value map after quantize input node added. (#7558 )	2021-05-04 15:27:56 -07:00
Sergii Dymchenko	a647da3e1a	Fix 2 input Gemm grad (#7561 ) * Add test for 2 input Gemm grad. * Fix 2 input Gemm grad.	2021-05-04 12:00:14 -07:00
harshithapv	d812354ebd	Tile grad fix (#7556 ) * tile grad fix * code clean up	2021-05-04 11:16:26 -07:00
Guoyu Wang	e05528a365	Update Android AAR packaging pipeline script (#7559 ) * update android package pipeline * update shell script * update script * add kMSExperimentalDomain to reduction	2021-05-04 11:13:33 -07:00
Guoyu Wang	71ff6ff2ec	Disable NNAPI support for dynamic input shape, add warning logs (#7439 ) * disable nnapi for graph with dynamic input shape * Add warning for multiple paritions * minor update * update the message logging * Fix coreml ci failure	2021-05-04 11:09:23 -07:00
Fanny Nina Paravecino	c3c4db2c1b	Upgrade GIST memory compression nodes, kernels, optimizer rule, and cli (#6262 ) * Add gist nodes, kernels, optimizer rule, and cli * Add Gist CUDA kernels * Added/updated gist compression cli to bert, gpt2, mnist * Fix decode priority generator for large models * Fix hardcoded decode priority generator, update gist training test * Fix incomplete if/else sequence for CI build * Added MSFP15 for gist compression type * fix Msfp15 bug * Resolved azure pipeline errors - unsupported ORT_RETURN macro format, cudastream argument * Resolved hardcoded cudastream argument, Pack8 zero error * Resolved PR comments - except gist tests * Added TypeInference to Gist Nodes, To attribute to Gist Decoder, Updated Gist Test Cases * Reverted error in merge commit * Updated logger usage in Gist rule, Updated GistPackMSFP15 compressed tensor's explaination * Converted onnxruntime::make_unique to std::make_unique based on PR 7502 Co-authored-by: Fanny Nina Paravecino <faninapa@microsoft.com> Co-authored-by: Aayush Ankit <aayushankit@microsoft.com> Co-authored-by: Aayush Ankit <Aayush-Ankit@users.noreply.github.com> Co-authored-by: Fanny Nina Paravecino <fanny.nina@microsoft.com>	2021-05-04 10:33:35 -07:00
Sherlock	c1ed647170	ORTModule enable run_symbolic_shape_infer by default (#7423 ) * ORTModule enable run_symbolic_shape_infer by default * Fix UTs by replacing Relu with Softmax Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-05-04 10:08:14 -07:00
Chi Lo	a94a893d5e	Update SessionOptions.cs (#7540 ) Fix compile warning	2021-05-04 01:51:35 -07:00
Scott McKay	594dde2647	Validate that the conversion script from the python package can be used to convert models. (#7517 )	2021-05-04 16:25:04 +10:00
sfatimar	898fff702c	compatibility was broken for myriad config parameter (#7349 ) Co-authored-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: sfatimar <sahar.fatima@intel/com> Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com>	2021-05-03 21:13:12 -07:00
Tianlei Wu	3c9ece4a11	[transformers optimizer] catch symbolic shape inference exception and clean up (#7560 ) catch symbolic shape inference exception. no prune graph when there is inner graph (Loop/If/Scan) add an wrapper for numpy_helper.to_array so that we can debug onnx graph without external data remove fuse_mask that is not used any more in onnx_model_bert_tf.py	2021-05-03 20:42:13 -07:00
George Wu	faea7a222d	linux trt package pipeline (#7537 )	2021-05-03 19:14:20 -07:00
Yulong Wang	8eaa4c33e2	[js] fix library bundling and some trivial improvement (#7550 ) * [js] fix library bundling * fix filename in code comments	2021-05-03 18:31:55 -07:00
Tianlei Wu	731f9e5033	Fix symbolic shape inference for Unsqueeze (#7555 ) * fix Unsqueeze shape inference * add tests	2021-05-03 18:06:59 -07:00
Yulong Wang	418623355a	disable logging for WASM in inference session ctor (#7545 )	2021-05-03 16:01:41 -07:00
Dmitri Smirnov	8b6602ae68	Refactor provider test utils to prepare for expansion. (#7538 ) Refactor provider test utils to prepare for expansion. Do not sort Tensors in place. This is reasonably rare.	2021-05-03 15:56:07 -07:00
baijumeswani	cab84d902e	Install and use conda on ortmodule CI pipelines (#7530 ) * Install and use conda on ortmodule CI pipelines * Update build script to install onnxruntime wheel before running unit tests * Remove python 3.5 from install_python_deps * Pinning deepspeed version to 0.3.15	2021-05-03 15:52:22 -07:00
Yufeng Li	ad15811ade	Add QDQ support for MatMulIntegerToFloat, Gather and Transpose (#7500 ) * add QDQ support for MatMulIntegerToFloat, Gather and Transpose	2021-05-03 15:51:25 -07:00
Scott McKay	830d9e54dd	Add script to dump initializer, NodeArg, Node and subgraph info from an ORT format model (#7516 )	2021-05-04 08:34:35 +10:00
Yulong Wang	3600c3e66e	[js/web] integrate latest changes from onnxjs (#7535 ) * [js/web] integrate latest changes from onnxjs * apply ESLint rules: filename-case and header * remove filename-case rule for wasm .d.ts	2021-05-03 15:03:25 -07:00
sumitsays	9c0e5954cb	Output Tensor Shape Validation b/w ONNX inference and ORT (#7252 ) * Adding Output Shape Validation for ORT-CPU execution flow * Skipping validation check in-case output is not a tensor. Fixed conv_transpose test. Ignoring pad and reduction test * Comparison b/w signed and un-signed int. Removed const for a primitive variable * Commented the un-used test function signature * Removed exception instead logging warning. Because there are lots of ORT tests which are failing because of this validation * Fixed warning condition and test * Fixed test and addressed comment on the PR * Output shape verification will happen only for final output nodes of the model * Changed variable name from camel case to underscore style * Enable the tests as the validation failure will now logs warning instead of throwing an exception * Adding Output Shape Validation for ORT-CPU execution flow * Resolve merge conflict * Comparison b/w signed and un-signed int. Removed const for a primitive variable * Commented the un-used test function signature * Removed exception instead logging warning. Because there are lots of ORT tests which are failing because of this validation * Fixed warning condition and test * Fixed test and addressed comment on the PR * Output shape verification will happen only for final output nodes of the model * Changed variable name from camel case to underscore style * Enable the tests as the validation failure will now logs warning instead of throwing an exception * Remove duplicate function "GetLogger()" Remove duplicate function "GetLogger()" * Fixed typo in method name TestConvTransposeOpInitializer Fixed typo in method name "TestConvTransposeOpInitializer" Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>	2021-05-03 12:56:09 -07:00
sumitsays	e344a583b0	updated sampleTolerance of model fp16_inception_v1 for GPU execution provider (#7533 ) Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>	2021-05-03 12:08:31 -07:00
G. Ramalingam	b0a3b501fe	Add function body for Gelu and FastGelu (#7496 ) * LayerNorm function body v1 * LayerNorm function body * layernorm function test * Minor fixes * Fix signed unsigned comparison * Move contrib ops test * Handle optional output parameters * Add test case for optional outputs * Handle float16 random generation * Add function body to Gelu and FastGelu * Add FastGelu test * Fix comments * Include cmath	2021-05-03 12:00:37 -07:00
Yulong Wang	7079dfb93d	[wasm] fix and unify webassembly target name (#7549 )	2021-05-03 10:37:25 -07:00
Tim Harris	2e09d9921a	"Sticky" allocation of worker threads (#7551 ) [ PR previously merged as https://github.com//pull/7372, then reverted pending investigation of lost-wake-up issue seen with ParallelExecutor. Issue was a missing test for new work pushed to thread concurrent with a worker blocking. Change from 7372 is the addition of: https://github.com/microsoft/onnxruntime/blob/tiharr/dev-sticky-4/include/onnxruntime/core/platform/EigenNonBlockingThreadPool.h#L1473-L1492 ] Description: This change updates the heuristics used when a thread selects which worker threads to push work to on entering a parallel loop. Previously, worker threads would maintain a best-effort bitmap of "good worker hints" indicating the threads that were likely to be spinning waiting for work. This change uses a simpler heuristic where a thread records which workers ran its previous loop, and then re-submits its next loop to those same workers. The aim is to retain affinity between a thread and a set of workers, and to avoid maintaining the "good worker hints" bitmaps. Motivation and Context: Profiling suggested that maintaining the "good worker hints" was taking unexpected time, particularly on NUMA systems. In addition, when running many concurrent workloads, the hints did not provide a way to help retain locality of workers and hence data in caches. Testing to confirm no regressions on microbenchmark (./build/Linux/Release/onnxruntime_benchmark --benchmark_filter=BM_ThreadPoolParallelFor) and on Linux mobilenet_v1_1.0_224.onnx, comparing p50 and p99 with vs without this change: 1 concurrent: p50 0.0172s vs 0.0181s p99 0.0204s vs 0.0216s 2 concurrent: p50 0.0172s vs 0.0181s p99 0.0213s vs 0.0221s	2021-05-03 18:28:13 +01:00
Sherlock	6714f2f85d	Improve tol value logging in ORTModule test (#7544 ) Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-05-03 09:43:40 -07:00
Ryota Tomioka	d1cb8c9dc9	Support negative indices and fix bound checking in symbolic shape inference for Slice (#7401 ) * Use positivity everywhere; handle negative index in Slice * limit positivity to inputs * make handle_negative_index private * strengthen sympy comparison * further strengthen compariso n and a minor refactoring * Add flip test * Fall through if -int_max in handle_negative_index() * minor fix for infer_Concat to include initializers * Add more tests * use simplify * more tests	2021-05-03 09:07:55 -07:00
Sheil Kumar	8e3cdf0452	Use unicode apis for loadlibrary (#7523 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-05-03 07:24:40 -07:00
Yulong Wang	add4e4225b	[js/web] fix pacakge metadata of onnxruntime-web (#7543 )	2021-05-02 13:26:07 -07:00
Yulong Wang	97de078c24	[js/node] fix pacakge metadata of onnxruntime-node (#7542 )	2021-05-02 11:17:32 -07:00
Yulong Wang	79dc7d3e50	[js/common] revise TSDoc of some interfaces (#7541 )	2021-05-01 22:20:22 -07:00
Pranav Prakash	8ba6ed953f	Fix batch norm training op on CPU (#6946 ) * Fix batch norm training op on CPU * Add BatchNorm 14 Op Support * Update hashes for BN * Exclude TRT and OpenVINO for BatchNorm training test	2021-05-01 11:25:19 -07:00
Sheil Kumar	94c4c44bfc	Enable Microsoft.Ai.MachineLearning package to work on .NET5 down to 17763 Windows SDK (#7522 ) * upgrade cswinrt and downgrade target framework * fix sdk version references * cswinrt 1.1.0 Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-05-01 00:56:36 -07:00
satyajandhyala	9f1e61be92	Check whether nvcc supports -Wstrict-aliasing before adding the flag. (#7509 ) * Check whether nvcc supports -Wstrict-aliasing before adding the compiler flag in CMakeList.txt. * Removed reinterpret_cast to not cause strict aliasing violation errors or require -Wno-strict-aliasing when it is not available.	2021-05-01 00:14:50 -07:00
Changming Sun	00882ce495	Set CMAKE_CUDA_STANDARD to 14 because we are using std::make_unique (#7534 )	2021-04-30 20:20:00 -07:00
Dwayne Robinson	93e93d0851	Merge pull request #7519 from microsoft/user/dwayner/ort1.8dml1.5.1 Update DML EP changes related to DML 1.5.1 for ORT 1.8	2021-04-30 19:25:16 -07:00
Sherlock	668a65f1a7	Complete GetGlobalAveragePoolGradient (#7514 ) * Improve GetGlobalAveragePoolGradient Co-authored-by: Sherlock Huang <bahuang@OrtTrainingDev3.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-04-30 18:04:01 -07:00
Tim Harris	9c1900866a	Revert ""Sticky" allocation of worker threads (#7372 )" This reverts commit `3d92723d1c`.	2021-04-30 14:39:58 -07:00
Thiago Crepaldi	9ba9da0c95	Fix unused registered buffers issue on ORTModule (#7525 )	2021-04-30 13:50:23 -07:00
Tang, Cheng	54db6648af	kerne invoker api for eager mode (#7473 ) * initial draft for kernel invoke api * initial implementation of kernel invoker * [eager] fix build on Mac * [eager] increment input name in kernel invoker * temp fix for type in eager mode * use global default log manager * rollback the previous commit since it break linux build * Revert "rollback the previous commit since it break linux build" This reverts commit `58c2c3423a`. * Eager Mode: fix linking on macOS * optimizer_execution_frame: ignore unused lambda capture (model_path) * fix link issue * ORTInvoker: set correct input argument tensor element proto types Do not set a type proto on output arguments to allow ORT to deduce them * ORTInvoker: create only one logging manager * Minor fix to set execution provider type correctly. (#7000) Co-authored-by: Chandru Ramakrishnan <chandru-r@github.com> * training fix * support config output ml values in frame, so we can use it to implement inplace update * Fix range loop error while building. (#7087) Co-authored-by: Chandru Ramakrishnan <chandru-r@github.com> * Conditionally link with nsync_cpp if not windows. (#7151) Co-authored-by: Chandru Ramakrishnan <chandru-r@github.com> * Fixed initialization order in ORT kernel invoker (#7342) * Updated constructor of ort_kernel_invoker to take a logger. * Changed linking order. * Updated test. * add inplace ut * add build option * Update include/onnxruntime/core/eager/ort_kernel_invoker.h Co-authored-by: Derek Murray <Derek.Murray@microsoft.com> * resolve comments in pr * fix build break;merge from master * fix build break Co-authored-by: Cheng Tang <chenta@microsoft.com> Co-authored-by: Aaron Bockover <abock@microsoft.com> Co-authored-by: Chandru Ramakrishnan <41447659+chandru-r@users.noreply.github.com> Co-authored-by: Chandru Ramakrishnan <chandru-r@github.com> Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>	2021-04-30 13:33:58 -07:00

1 2 3 4 5 ...

4783 commits