onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-20 19:12:24 +00:00

Author	SHA1	Message	Date
Ryan Hill	5da718a114	Formatting	2021-05-17 22:32:11 -07:00
ashbhandare	3c1cf82ff1	Bump to rel-1.9.1 (#7684 )	2021-05-16 23:24:16 -07:00
Ryan Hill	8d911833ef	Merge error	2021-05-16 20:50:27 -07:00
Ryan Hill	9ddbb046a2	Merge related issues and code review feedback.	2021-05-16 19:22:03 -07:00
Ryan Hill	67adb7bfe0	Merge with master	2021-05-16 18:41:57 -07:00
Tracy Sharpe	943ab9dcef	undo clang-format corruption (#7720 )	2021-05-16 15:32:30 -07:00
liqunfu	d604281a86	Liqun/training pkg to run tests (#7662 )	2021-05-16 09:10:57 -07:00
liqunfu	3ead2f2f39	update pt lightning version (#7711 ) Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-05-15 21:46:16 -07:00
Yulong Wang	017b94ab36	Specify correct dependency for CI pipeline of nodejs binding (#7717 )	2021-05-15 08:56:58 -07:00
Yufeng Li	6b0a7905ed	fix quant weight cleanup bug (#7707 )	2021-05-14 22:04:35 -07:00
Sergii Dymchenko	9ba8da65d2	Fix BiasDropoutFusion when there are multiple Dropout consumers. (#7708 ) * Add tests for multiple consimers bias dropout fusion. * Don't fuse resudial Add for multiple Dropout consumers. * Remove duplicate code. * Fix comment typo. Co-authored-by: Derek Murray <Derek.Murray@microsoft.com> Co-authored-by: Derek Murray <Derek.Murray@microsoft.com>	2021-05-14 21:03:23 -07:00
Hariharan Seshadri	53d1d55ea8	Add ability for pre-packed weights of shared initializers to be shared across sessions (#7421 )	2021-05-14 20:44:42 -07:00
Ashwini Khade	c78a40c0c7	add opset14 registrations for cuda provider (#7699 ) * add opset14 registrations for cuda provider * update macros * fix typo * updates to rocm provider	2021-05-14 19:08:45 -07:00
Yulong Wang	97d9bcd644	[js/web] fix bundle for multi-thread, add e2e test and support nodejs (#7688 ) * fix bundle for multi-thread, add e2e test and support nodejs * add copyright banner * resolve comments * add comments for isMultiThreadSupported()	2021-05-14 18:15:38 -07:00
Yufeng Li	a74e41e47d	Add non-zero zp support for quant matmul and attention (#7570 ) * add non-zero zp support * support A and B scale with any dimensions	2021-05-14 16:50:31 -07:00
Yulong Wang	c53b5be509	force multi steps to use the same commit in CI (#7697 )	2021-05-14 15:13:38 -07:00
Zhang Lei	0f7721a019	Fix bug for not checking original float value names when treat it as not existing. (#7695 )	2021-05-14 12:50:30 -07:00
Changming Sun	1d403ba03b	Fix a compile warning in EigenNonBlockingThreadPool.h (#7638 )	2021-05-14 11:38:34 -07:00
Zhang Lei	033f0b3b7c	fix typo. (#7690 )	2021-05-14 10:25:34 -07:00
Sunghoon	a9b47ca8e7	update debug.keystore for a public usage (#7696 )	2021-05-14 09:54:54 -07:00
liqunfu	359fe1d197	Liqun/ort training version (#7620 )	2021-05-14 09:54:19 -07:00
ashbhandare	bfbcc89db1	Add MLFloat16 support for SoftmaxCrossEntropyLoss for CUDA EP (#7679 ) * Forward op changes * Add tests, improve kernel * add opset 13 registration, remove unnecessary changes * Add fp16 grad for SCELoss, review comments	2021-05-14 09:00:27 -07:00
Changming Sun	39fac6d304	Fix a buffer reuse bug in allocation_planner.cc (#7645 )	2021-05-14 07:31:35 -07:00
Ashwini Khade	442c7300eb	add opset14 rnn ops (#7687 ) * add opset14 rnn ops * update kernel hashes	2021-05-14 05:52:54 -07:00
Yufeng Li	3725d0211f	support maxpool QDQ fusion from opset 12 (#7693 )	2021-05-13 23:58:15 -07:00
Zhang Lei	50c5edcf13	Add nhwc support for QLinearAveragePool operator (#7656 ) * Add nhwc support for QLinearAveragePool operator * Update ContribOperators.md * Update OperatorKernels.md with cpu,dnnl and cuda enabled.	2021-05-13 22:05:30 -07:00
baijumeswani	37f69fcee5	Regain performance by caching initializer names in ORTModule (#7685 )	2021-05-13 20:54:49 -07:00
Edward Chen	19704aedbb	Update Objective-C API (#7675 ) - Add session/run configuration - Add additional supported tensor data types - Clean up	2021-05-13 18:47:22 -07:00
ashbhandare	56e993a434	Bump to rel-1.9.1 (#7684 )	2021-05-13 18:41:28 -07:00
Xueyun Zhu	32d8278c2f	reshape fix (#7678 )	2021-05-13 16:34:03 -07:00
raviskolli	4b37901f10	Aten support for rocm (#7680 ) * Aten support for rocm * Removed aten_ops.cc as it is resued from cuda version	2021-05-13 15:56:03 -07:00
Aswin John Mathews	4afdc19958	ROCm optimized layernorm for MI100 (#7682 ) * layernorm optimizations * Changed HIP flag from HIP_VERSION to __HIP_PLATFORM_HCC__	2021-05-13 15:54:06 -07:00
satyajandhyala	d90a99aad5	Fix the build on dev machines by replacing std::tuple with two arguments with std::pair (#7683 )	2021-05-13 15:11:51 -07:00
Hariharan Seshadri	7bb3f243ff	Revert (#7663 )	2021-05-13 14:11:17 -07:00
Sunghoon	1ab8a95eb6	Bind existing SessionOptions and RunOptions in Javascript API with WebAssembly (#7621 ) * support session options and run options. use onnxruntime c api. * fix lint errors * add an error code on throwing an exception * resolve comments. change remaining C++ APIs to C API	2021-05-13 10:50:04 -07:00
Guoyu Wang	333318af04	[CoreML EP] add clip support (#7666 )	2021-05-13 10:41:30 -07:00
Ryan Hill	5f62d4bb3b	Oops, scope DistributedRunContext to just NCCL	2021-05-13 10:34:28 -07:00
Zhang Lei	46246f1bbd	Add nhwc transformer support and unittest for qlinear concat. (#7587 )	2021-05-13 09:50:37 -07:00
satyajandhyala	3a407b40dd	Add the missing source file to the target onnxruntime_test_debug_node… (#7676 ) * Add the missing source file to the target onnxruntime_test_debug_node_inputs_outputs	2021-05-13 09:14:12 -07:00
harshithapv	31ca21b782	Replace Where Grad "Mul" with "Where" (#7672 ) * replace where grad mul with where * clean up * auto formatting * remove not for second input	2021-05-13 08:54:43 -07:00
Ryan Hill	374ae71739	Missing static	2021-05-13 02:28:54 -07:00
Ryan Hill	8b538e7918	Test passed, now with fix	2021-05-13 02:28:49 -07:00
Ryan Hill	0a59bc3902	Test moving DistributedRunContext instance into shared provider layer (with purpose error to verify it's being built properly)	2021-05-13 02:28:44 -07:00
Ryan Hill	741e09a882	Convert aten_op to shared (due to marge with master)	2021-05-12 22:32:51 -07:00
Ryan Hill	86cb70a118	Sync with Master	2021-05-12 21:28:07 -07:00
George Nash	b4e8e9b004	Add DnnlOpManager (#7521 ) * Add DnnlOpManager The DnnlOpManager is able to more accurately check if a node is supported by the DNNLExecutionProvider. The DNNLExecutionProvider::GetCapability function has been updated to use the DnnlOpManager. This commit adds the ability to check if data type, attributes, and tensor dimensions of the node are supported. The IsDimensionSupported function is no longer needed since the checks it was doing have been moved into the individual implementations of the virtual class DnnlNodeCapability. Signed-off-by: George Nash <george.nash@intel.com> * Fix AveragePool entry in the DnnlOpManager Added check for ceil_mode attribute in the PoolNodeCapability check. DnnlExecutionProvider does not support ceil_mode other than the default value. Signed-off-by: George Nash <george.nash@intel.com>	2021-05-12 22:04:26 -05:00
Ryan Hill	06128b9677	Missed a logging reference in mpi_context.h	2021-05-12 19:39:52 -07:00
Ryan Hill	3263241baa	Update comments	2021-05-12 18:38:43 -07:00
Ryan Hill	1a20b6d13e	Add in a couple more missing GetType methods	2021-05-12 18:36:05 -07:00
Ryan Hill	b1924a9323	Remove LOGS statements as a test	2021-05-12 18:36:01 -07:00

1 2 3 4 5 ...

5033 commits