onnxruntime/cmake
Ankit Maheshkar a6ea57b8f3
OpenVINO EP Weights Sharing Feature (#23553)
### Description
These changes are done to ensure that weight sharing happens between two model using session context option ep_weight_sharing.

Key changes introduced in this feature are:

Creating a shared context between two models Extracting external constant initializers and re labelling them back as
inputs to the model to allow weight loading in the direct blob. Creating EP Context Nodes when Subgraph partitioning is happening.

### Motivation and Context
This change was required to ensure that LLM with prefill and kvcache models can use the same share
The change was also required to ensure EP Context nodes can be formed even when model is being subgraph partitioned.

---------

Co-authored-by: jatinwadhwa921 <jatin.wadhwa@intel.com>
Co-authored-by: jatinwadhwa921 <110383850+jatinwadhwa921@users.noreply.github.com>
Co-authored-by: saurabh <saurabh1.kale@intel.com>
Co-authored-by: TejalKhade28 <tejal.khade@intel.com>
Co-authored-by: sfatimar <sahar.fatima@intel.com>
Co-authored-by: Javier E. Martinez <javier.e.martinez@intel.com>
Co-authored-by: Preetha Veeramalai <preetha.veeramalai@intel.com>
Co-authored-by: Eric Crawford <eric.r.crawford@intel.com>
2025-02-06 14:57:38 -08:00
..
external Enable VCPKG in more pipelines (#23590) 2025-02-06 10:10:31 -08:00
patches remove log spam from cpuinfo (#23548) 2025-01-31 18:16:24 -08:00
tensorboard
vcpkg-ports Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
vcpkg-triplets Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
adjust_global_compile_flags.cmake Update ORT extension to the latest (#23314) 2025-01-13 18:59:42 -08:00
arm64x.cmake Dev/mookerem/arm64x update (#20536) 2024-05-07 12:50:38 -07:00
CMakeLists.txt Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
CMakePresets.json Correct ONNX and Protobuf version in vcpkg build (#23285) 2025-01-08 12:25:17 -08:00
CMakeSettings.json
codeconv.runsettings
deps.txt [TensorRT EP] support TensorRT 10.8-GA (#23592) 2025-02-06 10:05:57 -08:00
EnableVisualStudioCodeAnalysis.props
gdk_toolchain.cmake
hip_fatbin_insert [MIGraphX EP/ ROCm EP] add gfx1200, gfx1201 to CMAKE_HIP_ARCHITECTURES (#22348) 2024-10-11 17:31:36 -07:00
Info.plist.in
libonnxruntime.pc.cmake.in
linux_arm32_crosscompile_toolchain.cmake Add a build validation for Linux ARM64 cross-compile (#18200) 2023-11-08 13:03:18 -08:00
linux_arm64_crosscompile_toolchain.cmake Add a build validation for Linux ARM64 cross-compile (#18200) 2023-11-08 13:03:18 -08:00
maccatalyst_prepare_objects_for_prelink.py Support xcframework for mac catalyst builds. (#19534) 2024-03-20 10:55:19 -07:00
nuget_helpers.cmake Update nuget.exe used in WindowsAI nuget packaging so readme property is supported. (#22141) 2024-09-19 19:06:47 +10:00
onnxruntime.cmake [QNN EP] Make QNN EP a shared library (#23120) 2025-01-22 12:11:00 -08:00
onnxruntime_common.cmake Enable QNN HTP support for Node (#20576) 2024-05-09 13:11:07 -07:00
onnxruntime_compile_triton_kernel.cmake [CUDA] Add SparseAttention operator for Phi-3-small (#20216) 2024-04-30 09:06:29 -07:00
onnxruntime_config.h.in Pre-requisites of upgrading EMSDK (#23347) 2025-01-14 11:07:21 -08:00
onnxruntime_csharp.cmake Cleanup code (#22827) 2024-11-19 14:13:33 -08:00
onnxruntime_flatbuffers.cmake
onnxruntime_framework.cmake Enable VCPKG in more pipelines (#23590) 2025-02-06 10:10:31 -08:00
onnxruntime_framework.natvis
onnxruntime_fuzz_test.cmake [Fuzzer] Add two new ORT libfuzzer (Linux clang support for now) (#22055) 2024-09-12 11:50:34 -07:00
onnxruntime_graph.cmake [Apple framework] Fix minimal build with training enabled. (#19858) 2024-03-12 11:33:30 -07:00
onnxruntime_ios.toolchain.cmake Support visionos build (#20365) 2024-04-23 18:15:07 -07:00
onnxruntime_java.cmake [QNN EP] Make QNN EP a shared library (#23120) 2025-01-22 12:11:00 -08:00
onnxruntime_java_unittests.cmake [Java] Add API for appending QNN EP (#22208) 2024-10-01 10:18:04 -07:00
onnxruntime_kernel_explorer.cmake [ROCm] prefer hip interfaces over roc during hipify (#22394) 2024-10-14 20:34:03 -07:00
onnxruntime_lora.cmake Multi-Lora support (#22046) 2024-09-30 15:59:07 -07:00
onnxruntime_mlas.cmake [ARM CPU] hgemm optimized for gqa (#23107) 2025-01-24 15:25:24 -08:00
onnxruntime_nodejs.cmake Fix delay load for WebGPU EP and DML EP (#23111) 2024-12-19 10:23:48 -08:00
onnxruntime_objectivec.cmake Initial WebGPU EP checkin (#22318) 2024-10-08 16:10:46 -07:00
onnxruntime_opschema_lib.cmake
onnxruntime_optimizer.cmake Update Linux docker images (#23244) 2025-01-09 10:20:33 -08:00
onnxruntime_providers.cmake [QNN EP] Make QNN EP a shared library (#23120) 2025-01-22 12:11:00 -08:00
onnxruntime_providers_acl.cmake Split onnxruntime_providers.cmake to multiple (#17853) 2023-10-09 20:33:44 -07:00
onnxruntime_providers_armnn.cmake Split onnxruntime_providers.cmake to multiple (#17853) 2023-10-09 20:33:44 -07:00
onnxruntime_providers_azure.cmake Split onnxruntime_providers.cmake to multiple (#17853) 2023-10-09 20:33:44 -07:00
onnxruntime_providers_cann.cmake Remove nsync (#20413) 2024-10-21 15:32:14 -07:00
onnxruntime_providers_coreml.cmake Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
onnxruntime_providers_cpu.cmake Enable VCPKG in more pipelines (#23590) 2025-02-06 10:10:31 -08:00
onnxruntime_providers_cuda.cmake Cleanup code (#22827) 2024-11-19 14:13:33 -08:00
onnxruntime_providers_dml.cmake Refactor the cmake code that is related to delay loading (#22646) 2024-11-04 16:30:50 -08:00
onnxruntime_providers_dnnl.cmake Remove nsync (#20413) 2024-10-21 15:32:14 -07:00
onnxruntime_providers_js.cmake Split onnxruntime_providers.cmake to multiple (#17853) 2023-10-09 20:33:44 -07:00
onnxruntime_providers_migraphx.cmake Remove nsync (#20413) 2024-10-21 15:32:14 -07:00
onnxruntime_providers_nnapi.cmake Make partitioning utils QDQ aware so it does not break up QDQ node units (#19723) 2024-03-12 10:55:49 +10:00
onnxruntime_providers_openvino.cmake OpenVINO EP Weights Sharing Feature (#23553) 2025-02-06 14:57:38 -08:00
onnxruntime_providers_qnn.cmake [QNN EP] Make QNN EP a shared library (#23120) 2025-01-22 12:11:00 -08:00
onnxruntime_providers_rknpu.cmake Split onnxruntime_providers.cmake to multiple (#17853) 2023-10-09 20:33:44 -07:00
onnxruntime_providers_rocm.cmake Remove nsync (#20413) 2024-10-21 15:32:14 -07:00
onnxruntime_providers_tensorrt.cmake Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
onnxruntime_providers_vitisai.cmake [VitisAI] Cache node subgraph when necessary (#22073) 2024-11-08 23:17:16 -08:00
onnxruntime_providers_vsinpu.cmake Remove nsync (#20413) 2024-10-21 15:32:14 -07:00
onnxruntime_providers_webgpu.cmake [WebGPU] allow build WebGPU EP for WebAssembly (#23364) 2025-01-16 10:52:17 -08:00
onnxruntime_providers_webnn.cmake Split onnxruntime_providers.cmake to multiple (#17853) 2023-10-09 20:33:44 -07:00
onnxruntime_providers_xnnpack.cmake Make partitioning utils QDQ aware so it does not break up QDQ node units (#19723) 2024-03-12 10:55:49 +10:00
onnxruntime_python.cmake Enable VCPKG in more pipelines (#23590) 2025-02-06 10:10:31 -08:00
onnxruntime_rocm_hipify.cmake [ROCm] redo hipify of version controlled files (#22449) 2024-10-18 12:40:54 -07:00
onnxruntime_session.cmake Multi-Lora support (#22046) 2024-09-30 15:59:07 -07:00
onnxruntime_snpe_provider.cmake
onnxruntime_training.cmake Multi-Lora support (#22046) 2024-09-30 15:59:07 -07:00
onnxruntime_unittests.cmake Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
onnxruntime_util.cmake
onnxruntime_visionos.toolchain.cmake Support visionos build (#20365) 2024-04-23 18:15:07 -07:00
onnxruntime_webassembly.cmake [WebGPU] allow build WebGPU EP for WebAssembly (#23364) 2025-01-16 10:52:17 -08:00
precompiled_header.cmake
riscv64.toolchain.cmake Enable RISC-V 64-bit Cross-Compiling Support for ONNX Runtime on Linux (#19238) 2024-01-24 16:27:05 -08:00
Sdl.ruleset
set_winapi_family_desktop.h
target_delayload.cmake Refactor the cmake code that is related to delay loading (#22646) 2024-11-04 16:30:50 -08:00
uwp_stubs.h
vcpkg-configuration.json Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
vcpkg.json Enable VCPKG in CI build (#23426) 2025-02-05 10:58:53 -08:00
wcos_rules_override.cmake Stop using apiset in OneCore build: use onecoreuap.lib instead of onecoreuap_apiset.lib (#19632) 2024-02-23 22:31:57 -08:00
winml.cmake Fix a tiny problem in winml.cmake (#23173) 2024-12-20 11:48:43 -08:00
winml_cppwinrt.cmake
winml_sdk_helpers.cmake
winml_unittests.cmake Multi-Lora support (#22046) 2024-09-30 15:59:07 -07:00