onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-02 03:55:34 +00:00

Author	SHA1	Message	Date
Changming Sun	fa27c19342	Delete create_nuspect.py and template.nuspec	2021-08-30 09:34:26 -07:00
Changming Sun	1b5909dea8	Delete download_cmake.py (#8885 )	2021-08-30 09:34:08 -07:00
liqun Fu	c8dd0bf37e	to publish stable wheel to ort channel (#8873 ) Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-08-30 09:33:01 -07:00
pengwa	36fa0de8b7	fix regression and enable custom autograd func tests in CIs (#8868 ) * fix regression and enable tests in CIs * Update orttraining/orttraining/python/training/ortmodule/_custom_autograd_function.py Co-authored-by: Wei-Sheng Chin <wschin@outlook.com> * fix Co-authored-by: Wei-Sheng Chin <wschin@outlook.com>	2021-08-30 09:34:18 +08:00
Sherlock	6e20eb7eb3	Stop gradient for Multinomial, RandomNormalLike, RandomUniformLike and EyeLike (#8836 )	2021-08-28 16:21:34 -07:00
baijumeswani	df9438192a	Re-introduce saving of optimized onnx model (#8860 ) * Re-introduce saving of optimized onnx model	2021-08-28 14:27:25 -07:00
satyajandhyala	31926176ac	Support external custom operator schemas on Ubuntu (#8807 ) * Expose symbols in onnx and protobuf namespaces in python when building with --enable_external_custom_op_schemas * Add external onnx and protobuf files to wheel * Added an example to demonstrate external custom ops use-case * Added a Linux build pipeline to test external custom ops	2021-08-28 11:05:21 -07:00
Zuwei Zhao	89e8bff121	Enable selecting custom ops in onnxruntime-extensions. (#8826 ) * Enable selecting custom ops in onnxruntime-extensions. * Move cmake_helper.py. * Remove over-indented spaces. * Add doc. * Remove onnxruntime-extensions from git submodules, and user should pass path of onnxruntime-extensions for build. * Modify doc. * Remove argument --enable_onnxruntime_extensions and use --onnxruntime_extensions_path. * Fix build error. * Fix build error. * Use onnxruntime_extensions_path. * support both submodule and external source folders * refinement * Update cgmanifest.json * Support building onnxruntime-extensions from either git submodule or pre-pulled path. * Update doc. * more standard name * update docs * add the copyright header Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com> Co-authored-by: Wenbing Li <wenbingl@outlook.com> Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>	2021-08-27 21:45:52 -07:00
Tianlei Wu	6ea9324f82	fix EmbedLayerNormalization shape inference (#8876 )	2021-08-27 19:18:45 -07:00
Tang, Cheng	ae7f2d824d	Share the execution provider instance for training (#8719 ) * seperate the training python module; share the execution proivder instance * fix build break * fix cuda test crash; reorg the python module code base * se correct env * use provider customized hash func * fixbuild break * fix rocm break * use const ref in argument * rename the file * move hash func to trainiing module	2021-08-27 16:23:35 -07:00
Guoyu Wang	6a1939252f	Fix Android java API failure (#8865 ) * Fix Android Package break * Without java fix -- pipeline should fail * With java fix, should pass now * address CR comments	2021-08-27 15:58:56 -07:00
Tianlei Wu	615df42b46	Add force_fp16_initializers in convert_float_to_float16 (#8871 )	2021-08-27 14:35:38 -07:00
Scott McKay	0034ad72e6	Minimize changes to fix missing symbols used from C# (#8867 ) * Revert "Cleanup C# bindings to add EP (#8810)" This reverts commit `b21ea00020`. * Add back in a minimal set of changes. Provide stubs in for a limited set of things - things called from C# using a static lib of ORT built for mac/ios - things in OrtApis that are not included in the build by default - things in OrtApis that are excluded in a minimal build * Cleanup order or EPs in test * Fix unused function in ROCM build	2021-08-28 07:10:14 +10:00
Dmitri Smirnov	f3083f4bf3	Support of sparse initializers with smaller indices data type (#8834 ) Support of sparse initializers with smaller indices data type to save space. Make the script more efficient by selecting indices data type and checking resulting sparse bytes Exclude new code from SPARSE_TENSORS	2021-08-27 14:02:48 -07:00
Sheil Kumar	775f862067	Add new option to disable cpu sync for tensors (#8490 ) * add options to disable cpu copy back * null check proprties * only affect gpu outputs * change name to disabletensorcpusync * slight refactoring * Globally enable ms-experimental ops * change meaning of ms_experimental to mean all ms_experimental ops. Some experimental ops will still be enabled globally without this flag like audio ops. * remove changes incorrectly merged * bad merge * add test Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-08-27 13:29:52 -07:00
Chi Lo	6a477acecf	Add tensorrt_provider_factory.h to artifact (#8869 )	2021-08-27 09:09:54 -07:00
Edward Chen	7e53a1df6f	Enable selector action transformer infrastructure in minimal build. (#8804 )	2021-08-27 17:16:05 +10:00
Rachel Guo	1886f1a737	Make SparseTensor infrastructure optional (#8802 ) Add cmake parameter and #ifdefs to allow for disabling sparse tensor support. This comes with a significant binary size cost so we want to be able to exclude it in a minimal build.	2021-08-27 17:12:26 +10:00
Tianlei Wu	cb59f46e04	Add gpt2 mixed precision conversion and parity tools (#8845 )	2021-08-26 15:34:45 -07:00
Yulong Wang	e8564d6597	[js/web] update emsdk to v2.0.26 (#8653 ) * update emsdk to v2.0.26 * fix pooling build warning * fix build break * use pragma diagnostic semantic only when __GNUC__ is defined * fix build break * disable AttentionPastState_dynamic	2021-08-26 15:31:34 -07:00
Sunghoon	a16c681103	[js/web] Prepare to integrate ONNX Runtime Web CI with BrowserStack (#8843 ) * Integrate BrowserStack with ONNX Runtime Web CI pipeline * Change to Linux command for BrowserStack CI * Set preferTriggeringPipeline as true * Fix a commit fetching script * Remove wasm binary download from the latest build * Use release build of WebAssembly * Disable check-out of commit for testing * Use commit of WebAssembly build CI pipeline * Need to issue two PRs to prevent build failure	2021-08-26 11:57:31 -07:00
Chi Lo	eb8f84e2a2	Fix issue of GPU tarball/zip/java package (#8850 ) * modify for test * modify for test * modify for test * modify for test * modify for test * modify for test * prepare for PR * Rename cuda directory to gpu directory in tarball * Fix gpu java package * fix bug * fix small bug	2021-08-26 10:16:16 -07:00
Edward Chen	0cfc4ec09d	[Objective-C] Enable static analysis (#8842 ) Add Objective-C API static analysis pipeline.	2021-08-26 09:13:52 -07:00
Sherlock	c325207f7a	Optimize MatmulGrad (#8846 ) Optimize two special cases of MatmulGrad using FusedMatMul.	2021-08-25 23:36:40 -07:00
Changming Sun	ced2d8e597	Clean up TRT docker files (#8847 )	2021-08-25 22:26:31 -07:00
Changming Sun	9cd7d836f7	Delete Dockerfile.ubuntu_for_android (#8848 )	2021-08-25 22:25:14 -07:00
Scott McKay	b21ea00020	Cleanup C# bindings to add EP (#8810 ) Fix C# add EP bindings. Add stubs to ORT so that if EP is not included in the build we return a graceful error message. Move declaration of stubs into C API and out for EP so they're in one place and are easier to use (no extra header required in the C/C++ world and consistent with the CUDA EP setup). Fix inconsistency in ROCM EP. Cleanup a few other things.	2021-08-26 13:59:40 +10:00
Guoyu Wang	613a600471	relax android ci timeout to 180 minutes (#8844 )	2021-08-25 19:59:48 -07:00
Chi Lo	32ecbf4691	Create combined GPU tarball and zip file package (#8827 ) * Add onnxruntime_providers_shared.dll into gpu nuget package * Modify for test * Temporarily remove for test * Modify for test * Modify for test * Test packging Windows combined GPU * Test packging Windows combined GPU * Test packging Windows combined GPU * Test packging Windows combined GPU * modify for test * modify for test * fix bug * Modify for test * Modify for test * Modify for test * Modify for test * Modify for test * Modify for test * Modify for test * Modify for test * Prepare for PR * Prepare for PR * Code refactor * Rename proper Artifact name * Rename intermediate Artifact names * Revert Artifact Names * Rename Artifact Names * Modify Artifact name * Modify Artifact name * Modify Artifact name * Update Java package * Update Java package * fix bug to change artifact name * Fix bug for the wrong file path * Fix no fetching correct artifact and test * temporarily modify for test * undo the change for test	2021-08-25 13:51:18 -07:00
Hariharan Seshadri	cee79526fd	Add opset 15 kernels for Pow, BatchNorm, and Shape (#8442 )	2021-08-25 12:04:20 -07:00
Rajalakshmi Srinivasaraghavan	33a97e995b	POWER: Fix compilation issues with clang This patch fixes some compilation errors when using clang11 on POWER processors.	2021-08-25 11:40:29 -07:00
Sherlock	73fe7bfa0f	Add ATenOp at::diagonal (#8838 ) * Register at::diagonal for ATenOp	2021-08-25 09:45:53 -07:00
Tianlei Wu	237076a660	Add option to disable FastGelu half2 cuda kernel (#8819 ) Allow FastGelu half2 kernel to build without --cmake_extra_defines CMAKE_CUDA_ARCHITECTURES=xx Add environment variable ORT_TRANSFORMER_OPTIONS=4 to disable half2 FastGelu kernel for testing purpose Test parity of FastGelu operator with fp16 inputs.	2021-08-25 08:37:41 -07:00
Chandru Ramakrishnan	98ed235fc7	Removed MSNPU code from eager. (#8832 )	2021-08-25 09:40:25 -04:00
ashari4	4251e04eae	Removed assert (#8779 )	2021-08-24 20:26:08 -07:00
Ye Wang	56b37e55e5	Add new transformers model type: Bart (#8698 ) * update * bart-base encoder attention fusion * update * update * update * update * update * yapf * review comments	2021-08-24 18:13:46 -07:00
Changming Sun	3837027506	Remove pyopenssl from installation (#8830 )	2021-08-24 17:07:22 -07:00
KeDengMS	ddd4586a2f	[Symbolic Shape Infer] add more ops for auto merge (#8824 ) As Less/Equal/Greater/LessOrEqual/GreaterOrEqual ops can broadcast	2021-08-24 16:33:23 -07:00
ashari4	7f1e880649	Reorder ORT eager headers (#8813 )	2021-08-24 14:48:43 -07:00
Guoyu Wang	8992e31c85	Move iOS package from framework to xcframework (#8805 ) * additional changes * test package run * minor fix * minor fix * minor fix * Get around no arm64 simulator * fix objc pod build failure * downgrade_eigen * update objc podspec template	2021-08-24 13:38:14 -07:00
Yufeng Li	e25986781f	Fallback to default quantization if quantization params is not found (#8788 )	2021-08-24 11:20:19 -07:00
Hariharan Seshadri	17b0664e34	Optimize sequence type usage on CUDA [2/n] (#8720 )	2021-08-24 10:40:28 -07:00
Jorn Tuyls	9053e1522d	Check for Python_EXECUTABLE in pyxir.cmake to fix Vitis AI EP build (#8631 ) Co-authored-by: Jorn Tuyls <jornt.tuyls@gmail.com>	2021-08-24 08:39:50 -07:00
Changming Sun	4bfff45859	Downgrade Eigen (#8817 )	2021-08-23 18:06:23 -07:00
Chandru Ramakrishnan	2693af9799	Ported changes / bug fixes from torch/ort. (#8784 ) * Ported changes / bug fixes from torch/ort. * Fixed formatting * Renamed function * Renamed module_ to module. * Revert "Renamed module_ to module." This reverts commit b17fc114b3db20d174283811d90592b5b8154c19. * Include pybind common header to fix linker errors on windows debug. * Fix to generation of > 1 custom op. Co-authored-by: Ashwin Hari <ashari@microsoft.com>	2021-08-23 17:45:40 -04:00
Chandru Ramakrishnan	f51f2bad66	Fix for doxygen doc errors. (#8814 )	2021-08-23 15:52:15 -04:00
Tiago Koji Castro Shibata	62c0d24340	Fix Windows Store build (#8753 ) * Remove APIs unavailable in Store in #8349, #8178, #8065 * Add UWP stubs of C runtime functions * Remove UWP incompatible tests from UWP build * Remove incompatible tests from Store * Use UWP stubs in store only * Skip partition check outside of Windows * Remove unused WRL include * Workaround Windows header not including what it uses * Fix precompiled header name clash * Workaround SDK bugs * DXCore workaround in Win7 * Fix warning * Fix more warnings * Bump WinML to target Windows 8 * Fix more warnings * Remove unnecessary workarounds * Remove Desktop only APIs from DML adapter	2021-08-23 11:19:03 -07:00
Edward Chen	ea68955c71	Add more info to kernel registry manager hash lookup error message. (#8801 )	2021-08-23 11:09:30 -07:00
George Nash	d4a88cfe3f	Add Gemm op to DNNL Exectution provider (#8799 ) * Implement Gemm op for DNNL execution provider Signed-off-by: George Nash <george.nash@intel.com> * Remove KernelRegistry and Gemm op for dnnl ep The KernelRegistry for the dnnl execution provider only registered a Gemm op that as best we can tell was never actually used and also was not using the dnnl library. We have implemented a Gemm op in the DNNL execution provider subgraph code and thus are removing the unused Gemm op that was in the dnnl KernelRegistry. Signed-off-by: George Nash <george.nash@intel.com> * Fix duplicated output and kernelshape inference fix getcapability to make sure subgraph outputs do not have duplicates fix kernelshape inference in pool Signed-off-by: Wang <zhaoyang.wang@intel.com> * Removed most dnnl specialized ifdefs from gradient_ops_test code Re-enable GlobalAveragePoolGrad test for dnnl ep The bugs that were exposed by the GlobalAveragePoolGrad test have been fixed and this test no longer needs to be disabled for DNNL. Removed the ReluGradDnnl test. We are getting the testing from the already existing ReluGrad test. MaxPoolGrad test no longer has specialized execution provider enabling for DNNL execution provider. It will now run without the extra enabling. ConvGrad is the only test that still has dnnl specialized ifdefs However, the ConvGrad code was not being executed by the code unless it was listed first in the list of execution providers. Signed-off-by: George Nash <george.nash@intel.com> * Fix transpose issue on Gemm On transposing square matrices, getmemoryandreshape will fail to reshape fix by adding a bool Signed-off-by: Wang <zhaoyang.wang@intel.com> * Save memory space by reusing internal tensor for output The intermediat matmul output tensor can be used as the output tensor for the binary calculation. Remove the unused IsAttributeSupported from the DnnlGemmNodeCapability class since we now support all of the Gemm attributes in our implementation. Signed-off-by: George Nash <george.nash@intel.com> Co-authored-by: Wang <zhaoyang.wang@intel.com>	2021-08-23 08:45:34 -07:00
Guoyu Wang	89656bb712	[CoreML/NNAPI EPs] Move direct use of initializer data to unpacked tensor data (#8780 )	2021-08-21 14:58:41 -07:00

1 2 3 4 5 ...

5446 commits