fast-get: fix partition leak for multi-thread usage by tmleman · Pull Request #10606 · thesofproject/sof

tmleman · 2026-03-06T15:08:54Z

Each thread that calls fast_get() on a large buffer gets a partition added to its memory domain. Previously, fast_put() only removed the partition from the allocating thread's domain (entry->thread), leaking partitions for any additional threads that were granted access.

Fix by having each thread remove its OWN partition on fast_put() using k_current_get() instead of entry->thread. The partition removal now happens unconditionally (not just on last reference), ensuring each thread cleans up after itself.

Order of operations:

Free buffer if last reference (while partition still grants access)
Remove current thread's partition (prevents leaks)
Clear entry if last reference

Each thread that calls fast_get() on a large buffer gets a partition added to its memory domain. Previously, fast_put() only removed the partition from the allocating thread's domain (entry->thread), leaking partitions for any additional threads that were granted access. Fix by having each thread remove its OWN partition on fast_put() using k_current_get() instead of entry->thread. The partition removal now happens unconditionally (not just on last reference), ensuring each thread cleans up after itself. Order of operations: 1. Free buffer if last reference (while partition still grants access) 2. Remove current thread's partition (prevents leaks) 3. Clear entry if last reference Signed-off-by: Tomasz Leman <tomasz.m.leman@intel.com>

Copilot

Pull request overview

This PR aims to fix a userspace memory-domain partition leak when multiple threads call fast_get() on large buffers by ensuring partitions are removed on fast_put() for all participating threads/domains.

Changes:

Adjust fast_put() to free the SRAM buffer when the last reference is released.
Change partition removal logic to remove a large-buffer partition from the current thread’s memory domain (instead of entry->thread), and do so outside the “last ref” block.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-06T15:14:12Z

zephyr/lib/fast-get.c

+#if CONFIG_USERSPACE
+	if (entry->size > FAST_GET_MAX_COPY_SIZE) {
+		struct k_mem_partition part = {
+			.start = (uintptr_t)entry->sram_ptr,
+			.size = ALIGN_UP(entry->size, CONFIG_MM_DRV_PAGE_SIZE),
+			.attr = K_MEM_PARTITION_P_RO_U_RO | XTENSA_MMU_CACHED_WB,
+		};
+		struct k_mem_domain *domain = k_current_get()->mem_domain_info.mem_domain;
+
+		LOG_DBG("removing partition %p size %#zx from thread %p",
+			(void *)part.start, part.size, k_current_get());
+		int err = k_mem_domain_remove_partition(domain, &part);
+
+		if (err)
+			LOG_WRN("partition removal failed: %d", err);
+	}


Partition removal is now executed on every fast_put() for large buffers, even when entry->refcount remains > 0. This breaks fast_get()/fast_put() reference counting: a thread can call fast_get() multiple times and still expect access after only some puts, but the first put would revoke access by removing the partition while the buffer is still in use. Partition removal needs to be tied to the last reference for the relevant owner (e.g., per-thread/per-domain refcount), not performed unconditionally on each put.

Copilot · 2026-03-06T15:14:12Z

zephyr/lib/fast-get.c

+		};
+		struct k_mem_domain *domain = k_current_get()->mem_domain_info.mem_domain;
+
+		LOG_DBG("removing partition %p size %#zx from thread %p",
+			(void *)part.start, part.size, k_current_get());
+		int err = k_mem_domain_remove_partition(domain, &part);
+


k_mem_domain_remove_partition() operates on a memory domain, which can be shared by multiple threads. Removing the partition from k_current_get()->mem_domain_info.mem_domain can therefore revoke access for other threads in the same domain that still hold references (entry->refcount > 0). To avoid use-after-revoke faults, this code should track granted partitions per-domain (and only remove when the last user in that domain releases), or otherwise enforce/document that each fast_get() user has a unique mem_domain.

Copilot · 2026-03-06T15:14:13Z

zephyr/lib/fast-get.c

+#if CONFIG_USERSPACE
+	if (entry->size > FAST_GET_MAX_COPY_SIZE) {
+		struct k_mem_partition part = {
+			.start = (uintptr_t)entry->sram_ptr,
+			.size = ALIGN_UP(entry->size, CONFIG_MM_DRV_PAGE_SIZE),
+			.attr = K_MEM_PARTITION_P_RO_U_RO | XTENSA_MMU_CACHED_WB,
+		};
+		struct k_mem_domain *domain = k_current_get()->mem_domain_info.mem_domain;
+
+		LOG_DBG("removing partition %p size %#zx from thread %p",
+			(void *)part.start, part.size, k_current_get());
+		int err = k_mem_domain_remove_partition(domain, &part);
+
+		if (err)
+			LOG_WRN("partition removal failed: %d", err);
+	}


This change targets CONFIG_USERSPACE behavior (partition add/remove), but existing fast_get/fast_put unit tests don’t appear to exercise userspace + large-buffer paths (e.g., multi-thread, partition add/remove, and the intended leak prevention). Consider adding a Zephyr ztest that enables CONFIG_USERSPACE and validates that: (1) multiple gets/puts from the same thread keep access until the final put, and (2) multiple threads (potentially in different domains) don’t leak partitions after puts.

tmleman requested review from abonislawski, dbaluta, iuliana-prodan, kv2019i and lyakh as code owners March 6, 2026 15:08

Copilot AI review requested due to automatic review settings March 6, 2026 15:08

Copilot started reviewing on behalf of tmleman March 6, 2026 15:09 View session

tmleman mentioned this pull request Mar 6, 2026

fast-get: fix crash by freeing buffer before removing partition #10585

Merged

tmleman requested review from serhiy-katsyuba-intel, softwarecki and wjablon1 March 6, 2026 15:11

Copilot AI reviewed Mar 6, 2026

View reviewed changes

serhiy-katsyuba-intel approved these changes Mar 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fast-get: fix partition leak for multi-thread usage#10606

fast-get: fix partition leak for multi-thread usage#10606
tmleman wants to merge 1 commit intothesofproject:mainfrom
tmleman:topic/upstream/pr/fast_get/fix/partition_leak

tmleman commented Mar 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 6, 2026

Uh oh!

Copilot AI Mar 6, 2026

Uh oh!

Copilot AI Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tmleman commented Mar 6, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants