Fixes memory leak when the job crashes before it's freed by sarthakaggarwal97 · Pull Request #3178 · valkey-io/valkey

sarthakaggarwal97 · 2026-02-09T17:50:47Z

If there is a crash between the time the job is popped and freed, we technically leak memory. This change allows us to peek, and pop just before we are about to free the job.

Fixes valgrind errors: https://github.com/valkey-io/valkey/actions/runs/21969557125/job/63467641572#step:6:8648

codecov · 2026-02-09T18:09:40Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 0.00%. Comparing base (fd57c21) to head (971258f).
⚠️ Report is 10 commits behind head on unstable.

Additional details and impacted files

@@       Coverage Diff        @@
##   unstable   #3178   +/-   ##
================================
================================

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

sarthakaggarwal97 · 2026-02-09T21:06:12Z

@madolson looks like valgrind is green with this PR: https://github.com/sarthakaggarwal97/valkey/actions/runs/21770227707

Do you have any feedbacks on this change on a high level? I know we discussed that this is a bit inefficient.

src/bio.c

sarthakaggarwal97 · 2026-02-11T23:05:10Z

@madolson valgrind run is green: https://github.com/sarthakaggarwal97/valkey/actions/runs/21887850029/job/63288991006

Signed-off-by: Sarthak Aggarwal <[email protected]>

Nikhil-Manglore

LGTM

src/unit/test_mutexqueue.c

src/unit/test_files.h

Signed-off-by: Madelyn Olson <[email protected]>

src/unit/test_files.h

Signed-off-by: Madelyn Olson <[email protected]>

) If there is a crash between the time the job is popped and freed, we technically leak memory. This change allows us to peek, and pop just before we are about to free the job. Fixes valgrind errors: https://github.com/valkey-io/valkey/actions/runs/21969557125/job/63467641572#step:6:8648 --------- Signed-off-by: Sarthak Aggarwal <[email protected]>

JimB123 · 2026-02-23T18:43:02Z

@madolson Good to see that this change resolves the test issue.

However, I'm not keen on the API change. It's common/typical for a mutex queue to support multiple readers. By adding a peek API, it becomes confusing for this common case. A second reader can pop the item that was peeked - or 2 readers can peek the same thing.

I'll look into alternatives.

madolson · 2026-02-24T05:47:27Z

However, I'm not keen on the API change. It's common/typical for a mutex queue to support multiple readers. By adding a peek API, it becomes confusing for this common case. A second reader can pop the item that was peeked - or 2 readers can peek the same thing.

Another thing we considered was to have a global/static pointer which keeps track of the per-thread objects. So even when the thread is killed, there is still a global reference so valgrind is happy.

I am indifferent to the new API, but we needed a fix for the valgrind issue.

…3256) ## Problem The peek-then-pop pattern introduced in #3178 solved the valgrind false-positive leak report (https://github.com/valkey-io/valkey/actions/runs/21969557125/job/63467641572#step:6:8648) for in-flight BIO jobs, but `peek` is a problematic API on a mutexqueue: multiple readers can peek the same item, and one reader can pop what another peeked. ## Fix Instead, store the in-flight job pointer in `bio_worker_data.current_job` before processing and clear it after freeing. Since `bio_workers[]` is a static array, valgrind can always trace from global memory to the job allocation, even after the worker thread is cancelled. Removed `mutexQueuePeek` from mutexqueue.{c,h} and its test. ## Test Manually run Daily workflow: All four valgrind jobs are green: `test-valgrind-test` passed, `test-valgrind-misc` passed, `test-valgrind-no-malloc-usable-size-test` passed, `test-valgrind-no-malloc-usable-size-misc` passed. Signed-off-by: Alina Liu <[email protected]>

github-actions bot assigned sarthakaggarwal97 Feb 9, 2026

sarthakaggarwal97 added the run-extra-tests Run extra tests on this PR (Runs all tests from daily except valgrind and RESP) label Feb 9, 2026

github-actions bot removed the run-extra-tests Run extra tests on this PR (Runs all tests from daily except valgrind and RESP) label Feb 9, 2026

sarthakaggarwal97 requested a review from madolson February 10, 2026 21:18

madolson reviewed Feb 11, 2026

View reviewed changes

src/bio.c Outdated Show resolved Hide resolved

src/bio.c Outdated Show resolved Hide resolved

sarthakaggarwal97 marked this pull request as ready for review February 11, 2026 23:04

sarthakaggarwal97 added 5 commits February 13, 2026 14:29

adds support for peek

31a6d1f

Signed-off-by: Sarthak Aggarwal <[email protected]>

some API fixes

37c07bb

Signed-off-by: Sarthak Aggarwal <[email protected]>

some API fixes

75f5005

Signed-off-by: Sarthak Aggarwal <[email protected]>

remove extra api

31987f2

Signed-off-by: Sarthak Aggarwal <[email protected]>

test file update

b992da4

Signed-off-by: Sarthak Aggarwal <[email protected]>

sarthakaggarwal97 force-pushed the fix-valgrind-error branch from 94d1513 to b992da4 Compare February 13, 2026 22:30

Nikhil-Manglore approved these changes Feb 16, 2026

View reviewed changes

madolson reviewed Feb 16, 2026

View reviewed changes

src/unit/test_mutexqueue.c Outdated Show resolved Hide resolved

src/unit/test_files.h Outdated Show resolved Hide resolved

Apply suggestions from code review

fda4b0d

Signed-off-by: Madelyn Olson <[email protected]>

madolson reviewed Feb 16, 2026

View reviewed changes

src/unit/test_files.h Outdated Show resolved Hide resolved

Apply suggestion from @madolson

971258f

Signed-off-by: Madelyn Olson <[email protected]>

madolson approved these changes Feb 16, 2026

View reviewed changes

madolson merged commit 051a5eb into valkey-io:unstable Feb 16, 2026
56 checks passed

dvkashapov mentioned this pull request Feb 23, 2026

Fix bio jobs leak on server crash #3011

Closed

madolson mentioned this pull request Feb 24, 2026

Workaround for fifo to better support memcheck #3061

Closed

This was referenced Feb 25, 2026

Replace mutexQueuePeek with current_job field for BIO memcheck fix JimB123/valkey#18

Merged

Replace mutexQueuePeek with current_job field for BIO memcheck fix #3256

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes memory leak when the job crashes before it's freed#3178

Fixes memory leak when the job crashes before it's freed#3178
madolson merged 7 commits intovalkey-io:unstablefrom
sarthakaggarwal97:fix-valgrind-error

sarthakaggarwal97 commented Feb 9, 2026 •

edited

Loading

Uh oh!

codecov bot commented Feb 9, 2026 •

edited

Loading

Uh oh!

sarthakaggarwal97 commented Feb 9, 2026

Uh oh!

Uh oh!

Uh oh!

sarthakaggarwal97 commented Feb 11, 2026

Uh oh!

Nikhil-Manglore left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JimB123 commented Feb 23, 2026

Uh oh!

madolson commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sarthakaggarwal97 commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sarthakaggarwal97 commented Feb 9, 2026

Uh oh!

Uh oh!

Uh oh!

sarthakaggarwal97 commented Feb 11, 2026

Uh oh!

Nikhil-Manglore left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JimB123 commented Feb 23, 2026

Uh oh!

madolson commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sarthakaggarwal97 commented Feb 9, 2026 •

edited

Loading

codecov bot commented Feb 9, 2026 •

edited

Loading