Fixed crashes due to missed slotToKeyInit() and missed expires_cursor reset by sundb · Pull Request #13315 · redis/redis

sundb · 2024-06-03T04:19:05Z

this PR fixes two crashes:

Fix missing slotToKeyInit() when using flushdb async under cluster mode.
[CRASH] Redis 7.2.3 crashed in slotToKeyReplaceEntry #13205
Fix missing expires_cursor reset when stopping active defrag in the middle of defragment.
[CRASH] Redis 7.2.x crashes in activeDefragCycle when activedefrag disabled while running and re-enabled #13307
If we stop active defrag in the middle of defragging db->expires, if expires_cursor is not reset to 0, the next time we enable active defrag again, defragLaterStep(db, ...) will be entered. However, at this time, db has been reset to NULL, which results in crash.

The affected code were removed by #11695 and #13058 in usntable, so we just need backport this to 7.2.

oranagra · 2024-06-03T15:25:05Z

tests/unit/memefficiency.tcl


 run_solo {defrag} {
-start_server {tags {"defrag external:skip"} overrides {appendonly yes auto-aof-rewrite-percentage 0 save ""}} {
+    proc test_active_defrag {type} {


so now all the defrag tests are running twice?
isn't that a little excessive?
if we keep it, or find a better one, do we want to reflect that change in unstable?

but these two are not the same, one is for cluster, and another is for standalone.

i know.
i'm saying that in the past we had a bunch of tests: [plain, aof, large keys, edge case, eval]
and now we run nearly all of them twice.
i think it's excessive, it's probably enough to just run one or two of them twice, or just add a dedicated test for cluster defrag (which i think we already have in unstable)

or just add a dedicated test for cluster defrag (which i think we already have in unstable)

i'll do it.

@sundb so to be sure: with this PR getting merged, the tests in both 7.2 and 7.4 are now able to detect a bug similar to the one being fixed here (which doesn't exist in 7.4)?

no, 7.4 doesn't covert it unless we add the same test for it.

ohh, now i actually looked at the test code:

# It repeatedly enables and disables active defragmentation, # and checks if it crashes, see issue #13307.

well, i suppose it's pointless to test the expires_cursor reset thing, since that code is now simplified.

and the flushdb async bug isn't really related to the defrag, so if we wanted, we could have added another simpler explicit check for it.
truth be told, that emptyDbAsync does have some duplicated logic in it that can someday get out of sync.
but i suppose that's completely unrelated to this PR.

stevelipinski · 2024-07-19T14:26:59Z

Is there a plan to get a 7.2.x release out with this fix (for issues #13205 and #13307) included?

sundb · 2024-07-20T05:14:34Z

@stevelipinski yes, this fix will appear in 7.2.6.

stevelipinski · 2024-07-22T13:13:19Z

Thanks - any ETA on timeframe for 7.2.6 release?

stevelipinski · 2024-09-18T15:08:34Z

can we request a 7.2.6 be released?

sundb · 2024-09-19T00:30:26Z

@stevelipinski sorry for late reply, we are not sure about the release date yet, but i will call you if any news, thanks.

sundb added 2 commits June 3, 2024 11:57

Fix missing slotToKeyInit when empting db async

4d62cf5

Fix missing expires_cursor reset when stopping active defrag

f48315f

sundb requested a review from oranagra June 3, 2024 04:19

This was linked to issues Jun 3, 2024

[CRASH] Redis 7.2.3 crashed in slotToKeyReplaceEntry #13205

Closed

[CRASH] Redis 7.2.x crashes in activeDefragCycle when activedefrag disabled while running and re-enabled #13307

Closed

sundb added the release-notes indication that this issue needs to be mentioned in the release notes label Jun 3, 2024

sundb added 3 commits June 3, 2024 12:41

Free db->slots_to_keys

7dfcc9f

Print defrag type

aead69b

Skip the tests that failed in cluster mode

d184127

oranagra reviewed Jun 3, 2024

View reviewed changes

Test for cluster mode

b829dd3

oranagra approved these changes Jun 17, 2024

View reviewed changes

sundb merged commit 2ad2548 into redis:7.2 Jun 18, 2024

This was referenced Jun 21, 2024

[CRASH] Redis 7.2.3 crashed in slotToKeyReplaceEntry #13205

Closed

[CRASH] Redis 7.2.x crashes in activeDefragCycle when activedefrag disabled while running and re-enabled #13307

Closed

sundb deleted the init_slot_emptydb_async branch July 20, 2024 05:14

oranagra mentioned this pull request Oct 2, 2024

Release 7.2.6 #13582

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed crashes due to missed slotToKeyInit() and missed expires_cursor reset#13315

Fixed crashes due to missed slotToKeyInit() and missed expires_cursor reset#13315
sundb merged 6 commits intoredis:7.2from
sundb:init_slot_emptydb_async

sundb commented Jun 3, 2024 •

edited

Loading

Uh oh!

oranagra Jun 3, 2024

Uh oh!

sundb Jun 4, 2024

Uh oh!

oranagra Jun 4, 2024

Uh oh!

sundb Jun 4, 2024

Uh oh!

oranagra Jun 17, 2024

Uh oh!

sundb Jun 17, 2024

Uh oh!

oranagra Jun 17, 2024

Uh oh!

stevelipinski commented Jul 19, 2024

Uh oh!

sundb commented Jul 20, 2024

Uh oh!

stevelipinski commented Jul 22, 2024

Uh oh!

stevelipinski commented Sep 18, 2024

Uh oh!

sundb commented Sep 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sundb commented Jun 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oranagra Jun 3, 2024

Choose a reason for hiding this comment

Uh oh!

sundb Jun 4, 2024

Choose a reason for hiding this comment

Uh oh!

oranagra Jun 4, 2024

Choose a reason for hiding this comment

Uh oh!

sundb Jun 4, 2024

Choose a reason for hiding this comment

Uh oh!

oranagra Jun 17, 2024

Choose a reason for hiding this comment

Uh oh!

sundb Jun 17, 2024

Choose a reason for hiding this comment

Uh oh!

oranagra Jun 17, 2024

Choose a reason for hiding this comment

Uh oh!

stevelipinski commented Jul 19, 2024

Uh oh!

sundb commented Jul 20, 2024

Uh oh!

stevelipinski commented Jul 22, 2024

Uh oh!

stevelipinski commented Sep 18, 2024

Uh oh!

sundb commented Sep 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sundb commented Jun 3, 2024 •

edited

Loading