[OVEP] ORT 1.24 Release Patch by ankitm3k · Pull Request #27238 · microsoft/onnxruntime

ankitm3k · 2026-02-04T07:26:04Z

Description

Re-use weight files and their underlying memory maps across shared contexts.

Motivation and Context

This reduces resident memory when different ep shared context sets reference the same weight file.

* Reuse weight files across shared contexts. * fix format

ankitm3k · 2026-02-04T07:26:59Z

@adrianlizarraga & @HectorSVC Please review & merge. FYI @MayureshV1

tianleiwu · 2026-02-04T17:47:15Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2026-02-04T17:47:35Z

Azure Pipelines successfully started running 4 pipeline(s).

MayureshV1

Looks good !

MayureshV1 · 2026-02-04T18:47:31Z

@adrianlizarraga , @yuslepukhin . Can you please review and have this merged.
If there is any opportunity to have it merged in ORT 1.24 or a bug fix release please consider.

### Description Re-use weight files and their underlying memory maps across shared contexts. ### Motivation and Context This reduces resident memory when different ep shared context sets reference the same weight file. Co-authored-by: Eric Crawford <eric.r.crawford@intel.com>

Copilot

Pull request overview

This PR introduces memory optimization for the OpenVINO Execution Provider by implementing a WeightFileManager singleton that enables sharing of weight file instances and their underlying memory maps across multiple SharedContext instances. This reduces memory footprint when different execution provider shared contexts reference the same weight files.

Changes:

Introduced WeightFileManager singleton to globally manage weight file instances
Changed WeightsFile storage from unique_ptr to shared_ptr to enable sharing across contexts
Modified SharedContext constructor to accept bin_path by const reference instead of by value

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
onnxruntime/core/providers/openvino/ov_shared_context.h	Added `WeightFileManager` singleton class, updated `SharedContext` to use shared weight files, and adjusted constructor signature
onnxruntime/core/providers/openvino/ov_shared_context.cc	Updated constructor to initialize `weight_file_manager_` and modified weight file acquisition to use the global manager

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-04T21:16:21Z

onnxruntime/core/providers/openvino/ov_shared_context.h

+  void
+  LoadTensorFromFile(


The return type and function name are split across two lines, which is inconsistent with the rest of the codebase. All other function declarations in this file (e.g., lines 62, 76-79) keep the return type and function name on the same line. This should be reformatted to match the existing code style.

Suggested change

void

LoadTensorFromFile(

void LoadTensorFromFile(

Copilot · 2026-02-04T21:16:21Z

onnxruntime/core/providers/openvino/ov_shared_context.h

@@ -104,7 +106,9 @@ class SharedContext : public std::enable_shared_from_this<SharedContext> {
    std::map<std::string, MappingContainer> imported_device_tensors_;
  };


The WeightsFile struct lacks thread synchronization for its member access. With the introduction of WeightFileManager, WeightsFile instances are now shared across multiple SharedContext objects (line 128-137). This means multiple threads can concurrently call LoadWeights() and TryGetOrCreateDeviceMapping() on the same WeightsFile instance. The LoadWeights() method performs non-thread-safe operations on the file_ member (seekg and read), and TryGetOrCreateDeviceMapping() modifies imported_device_tensors_ without synchronization. Add a mutex member to WeightsFile and protect all member accesses to prevent race conditions.

### Description Re-use weight files and their underlying memory maps across shared contexts. ### Motivation and Context This reduces resident memory when different ep shared context sets reference the same weight file. Co-authored-by: Eric Crawford <eric.r.crawford@intel.com>

CVS-180418 Reuse weight files across shared contexts (#927)

386a00d

* Reuse weight files across shared contexts. * fix format

tianleiwu added the release:1.24.2 label Feb 4, 2026

MayureshV1 approved these changes Feb 4, 2026

View reviewed changes

tianleiwu requested a review from Copilot February 4, 2026 21:11

Copilot started reviewing on behalf of tianleiwu February 4, 2026 21:11 View session

adrianlizarraga approved these changes Feb 4, 2026

View reviewed changes

adrianlizarraga merged commit 8abbfda into microsoft:main Feb 4, 2026
92 checks passed

Copilot AI reviewed Feb 4, 2026

View reviewed changes

tianleiwu mentioned this pull request Feb 4, 2026

[OVEP] ORT 1.24 Release Patch #27243

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OVEP] ORT 1.24 Release Patch#27238

[OVEP] ORT 1.24 Release Patch#27238
adrianlizarraga merged 1 commit intomicrosoft:mainfrom
intel:ovep_1.24_patch

ankitm3k commented Feb 4, 2026

Uh oh!

ankitm3k commented Feb 4, 2026

Uh oh!

tianleiwu commented Feb 4, 2026

Uh oh!

azure-pipelines bot commented Feb 4, 2026

Uh oh!

MayureshV1 left a comment

Uh oh!

MayureshV1 commented Feb 4, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 4, 2026

Uh oh!

Copilot AI Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		@@ -104,7 +106,9 @@ class SharedContext : public std::enable_shared_from_this<SharedContext> {
		std::map<std::string, MappingContainer> imported_device_tensors_;
		};

Conversation

ankitm3k commented Feb 4, 2026

Description

Motivation and Context

Uh oh!

ankitm3k commented Feb 4, 2026

Uh oh!

tianleiwu commented Feb 4, 2026

Uh oh!

azure-pipelines bot commented Feb 4, 2026

Uh oh!

MayureshV1 left a comment

Choose a reason for hiding this comment

Uh oh!

MayureshV1 commented Feb 4, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants