saymrwulf/onnxruntime: ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-03 03:58:54 +00:00

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Find a file

George Nash d0b08af37a Implementation of QAttention for the DNNL execution provider (#10004 ) * Add QAttention to DNNL EP Add QAttention to DNNL EP (limited support and disable for gpu) update ONEDNN version to 2.4.4 bug fix in getcapability add memory debug print Signed-off-by: Wang <zhaoyang.wang@intel.com> * Address Code Review + MatMulInteger Fix clean up code and add comments fix matmulinteger and add fusion rule to enable initialized vector weight zero points of 0s update DNNL_TAG to v2.5 Signed-off-by: Wang <zhaoyang.wang@intel.com> * Linux Compile Fix + rollback ONEDNN to 2.4.4 Signed-off-by: Zhaoyang Wang <zhaoyang.wang@intel.com> * Fix QAttention Debug build Signed-off-by: Wang <zhaoyang.wang@intel.com> * Fix QAttention build if USE_DNNL not specified Signed-off-by: George Nash <george.nash@intel.com> Co-authored-by: Wang <zhaoyang.wang@intel.com> Co-authored-by: MTC <63478620+jeyblu@users.noreply.github.com>		2021-12-10 21:50:13 -08:00
.gdn	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00
.github	Automate generation of C/C++ API docs (#9997 )	2021-12-10 17:45:50 -08:00
cgmanifests	add copyright (#9943 ) (#9970 )	2021-12-08 14:34:53 -08:00
cmake	Implementation of QAttention for the DNNL execution provider (#10004 )	2021-12-10 21:50:13 -08:00
csharp	Update Xamarin sample code (#9925 )	2021-12-07 16:18:58 +10:00
dockerfiles	Merged PR 6718335: RI 11/30 from github	2021-11-30 21:29:25 +00:00
docs	Automate generation of C/C++ API docs (#9997 )	2021-12-10 17:45:50 -08:00
include/onnxruntime/core	Fix some documentation errors plus ones generating doxygen warnings (#9993 )	2021-12-09 17:42:34 -08:00
java	Support optional type in ORT (#8339 )	2021-11-04 15:01:42 -07:00
js	[js/web] rename build-def.ts to build-def.d.ts (#9954 )	2021-12-09 14:17:42 -08:00
objectivec	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00
onnxruntime	Implementation of QAttention for the DNNL execution provider (#10004 )	2021-12-10 21:50:13 -08:00
orttraining	Change assert on a null value to an ort_enforce (#9982 )	2021-12-09 14:44:58 -08:00
package/rpm	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00
samples	Add Python checks pipeline (#7032 )	2021-08-09 10:37:05 -07:00
server	Remove redundant inline specifiers, sync server IsLittleEndianOrder with runtime core (#9856 )	2021-11-29 08:32:16 -08:00
tools	Update default opset to 14 in ORTModule (#9743 )	2021-12-09 12:45:35 +01:00
winml	Merge pull request #9917 from microsoft/user/dwayner/FnsCandyTolerance30696168	2021-12-02 22:45:45 -08:00
.clang-format
.clang-tidy
.dockerignore
.flake8	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00
.gitattributes
.gitignore	Improve reduced ops and types build (#9908 )	2021-12-07 13:02:05 -08:00
.gitmodules	Remove optional-lite (#9424 )	2021-10-22 16:45:45 -07:00
build.amd64.1411.bat
build.bat
build.sh
CODEOWNERS	Update ORTTraiing frontend codeowner (#9427 )	2021-10-18 23:56:21 -07:00
CONTRIBUTING.md	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00
LICENSE	Remove year from license (#6658 )	2021-02-12 00:25:56 -08:00
NuGet.config	Delete nuget extra configs (#6477 )	2021-01-27 20:25:45 -08:00
ort.wprp
packages.config	Merged PR 6718335: RI 11/30 from github	2021-11-30 21:29:25 +00:00
README.md	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00
requirements-dev.txt	Add post-install command to build PyTorch CPP extensions from within onnxruntime package (#8027 )	2021-06-28 18:11:58 -07:00
requirements-doc.txt	Add auto doc gen for ORTModule API during CI build (#7046 )	2021-03-22 10:20:33 -07:00
requirements-training.txt	Add post-install command to build PyTorch CPP extensions from within onnxruntime package (#8027 )	2021-06-28 18:11:58 -07:00
requirements.txt.in	Chang how numpy version is handled. (#8130 )	2021-06-23 14:08:37 -07:00
setup.py	Integrate TensorRT into GPU Python package (#9785 )	2021-11-18 13:26:51 -08:00
ThirdPartyNotices.txt	add copyright (#9943 ) (#9970 )	2021-12-08 14:34:53 -08:00
VERSION_NUMBER	Merged PR 6524907: Fix merge conflicts from public ORT to WindowsAI ORT	2021-10-01 22:47:52 +00:00

README.md

ONNX Runtime is a cross-platform inference and training machine-learning accelerator.

ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as PyTorch and TensorFlow/Keras as well as classical machine learning libraries such as scikit-learn, LightGBM, XGBoost, etc. ONNX Runtime is compatible with different hardware, drivers, and operating systems, and provides optimal performance by leveraging hardware accelerators where applicable alongside graph optimizations and transforms. Learn more →

ONNX Runtime training can accelerate the model training time on multi-node NVIDIA GPUs for transformer models with a one-line addition for existing PyTorch training scripts. Learn more →

Get Started

General Information: onnxruntime.ai

Usage documention and tutorials: onnxruntime.ai/docs

Companion sample repositories:

ONNX Runtime Inferencing: microsoft/onnxruntime-inference-examples
ONNX Runtime Training: microsoft/onnxruntime-training-examples

Build Pipeline Status

System	CPU	GPU	EPs
Windows
Linux
Mac
Android
iOS
WebAssembly

Data/Telemetry

Windows distributions of this project may collect usage data and send it to Microsoft to help improve our products and services. See the privacy statement for more details.

Contributions and Feedback

We welcome contributions! Please see the contribution guidelines.

For feature requests or bug reports, please file a GitHub Issue.

For general discussion or questions, please use GitHub Discussions.

Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

License

This project is licensed under the MIT License.