Make tracking invalidation messages always after command's reply by huangzhw · Pull Request #9422 · redis/redis

huangzhw · 2021-08-29T03:44:47Z

Tracking invalidation messages were sometimes sent in inconsistent order, before the command's reply rather than after.
In addition to that, they were sometimes embedded inside other commands responses, like MULTI-EXEC and MGET.
Fix #8206 and #8935

oranagra

i would like to ask @madolson to take a look too.
and i would like to ask to perform some trivial benchmark to check the impact of this.

src/server.c

tests/unit/tracking.tcl

oranagra · 2021-09-14T15:30:46Z

@huangzhw what's the content of that last force-push? is it just a rebase from unstable or anything else?

p.s. we usually merge PRs with squash-merge, so it's more convenient to review if there are no force-pushes, and i'd rather merge unstable into the PR instead of a rebase.

huangzhw · 2021-09-14T23:48:44Z

I just rebase unstable as there are conflicts. Only the last commit is new.

oranagra

LGTM.
@madolson @yossigo @soloestoy does one of you want to run a second pair of eyes on this?
in theory there's a potential for a temp memory spike, since we flush that list quite frequently, i don't think that's a realistic concern.
There's also an additional lookupClientByID, but that's kinda trivial too i think.

oranagra · 2021-09-19T14:21:33Z

triggered daily CI: https://github.com/redis/redis/actions/runs/1250884079 (failures are unrelated to this PR)

madolson

I suppose I'm also not clear about the direction where we are deferring the sending of the invalidation, and not deferring the invalidation itself until outside of the command execution, which seems a little more straight forward of an implementation.

src/server.c

src/tracking.c

src/server.c

src/tracking.c

oranagra · 2021-09-29T06:24:20Z

I suppose I'm also not clear about the direction where we are deferring the sending of the invalidation, and not deferring the invalidation itself until outside of the command execution, which seems a little more straight forward of an implementation.

@madolson assuming i understand what you mean, and i remember the details:
deferring the invalidation (the decision of whatever to send a message or not), rather than just the message itself would cause us to make the wrong decisions (skip sending messages that we should have sent)

src/server.c

src/tracking.c

response are in the same connection.

src/tracking.c

src/server.c

Co-authored-by: Oran Agra <[email protected]>

oranagra

I guess this is ready to be merged.
any last concerns?

madolson

I have no concerns, took another pass through and it LGTM.

oranagra · 2021-11-02T08:48:08Z

@huangzhw one of the new tests in this PR seems to have a rare timing issue:
https://github.com/redis/redis/runs/4068817198?check_suite_focus=true

*** [err]: Tracking invalidation message of eviction keys should be before response in tests/unit/tracking.tcl
Expected '0' to be equal to 'invalidate volatile-key' (context: type eval line 21 cmd {assert_equal $res {invalidate volatile-key}} proc ::test)

can you please take a look

oranagra · 2021-11-02T10:41:23Z

interestingly, it looks like the failure is semi consistent now, happening a lot in one of the external tests (either clustered or not clustered)
https://github.com/redis/redis/runs/4078424493?check_suite_focus=true

huangzhw · 2021-11-02T12:47:03Z

I have a question. In external test between every start_server is the data set clean?

oranagra · 2021-11-02T12:50:53Z

yes, look for flushall in server.tcl

oranagra · 2021-11-02T13:28:16Z

looking at the test code i think i may realized the problem.
the line that does set used [s used_memory] at the beginning of the test may get the wrong idea if there's some lazy eviction going on in the background.

so maybe we need to copy this code from lazyfree.tcl:

        # make the previous test is really done before sampling used_memory
        wait_for_condition 50 100 {
            [s lazyfree_pending_objects] == 0
        } else {
            fail "lazyfree isn't done"
        }

or maybe even promote that to a function in util.tcl to be shared between them and maybe others in the future...

huangzhw · 2021-11-02T13:37:50Z

I check the code before this test in tracking.tcl. I think no tests cause this. I runned this test with --loop and external, I don't get failures. So I suspect other tests cause this.

oranagra · 2021-11-02T13:39:21Z

yes, the fact it only happens in external mode, is also a good indication it depends on other tests.
i posted a hypothesis above, and a PR to attempt to fix it: #9722

oranagra · 2021-11-02T18:51:04Z

Looks like my hypothesis was wrong https://github.com/redis/redis/actions/runs/1412769681
Still failing...
I guess we must add some prints and reproduce it so we know what's going on..

oranagra · 2021-11-02T21:53:04Z

found the problem: #9726

Tracking invalidation messages were sometimes sent in inconsistent order, before the command's reply rather than after. In addition to that, they were sometimes embedded inside other commands responses, like MULTI-EXEC and MGET. (cherry picked from commit fd135f3)

oranagra linked an issue Sep 5, 2021 that may be closed by this pull request

Expiration based invalidation PUSH messages sometimes embedded within other replies. #8935

Closed

oranagra reviewed Sep 5, 2021

View reviewed changes

src/server.c Outdated Show resolved Hide resolved

tests/unit/tracking.tcl Outdated Show resolved Hide resolved

tests/unit/tracking.tcl Outdated Show resolved Hide resolved

huangzhw added 3 commits September 14, 2021 18:52

Make tracking invalidation messages always after command's reply

af66ede

change test

50c3628

Add afterCommand in handleClientsBlockedOnKeys

eb0ae0e

huangzhw force-pushed the tracking branch from 15ba4c0 to eb0ae0e Compare September 14, 2021 10:53

Merge branch 'unstable' into tracking

efec777

oranagra reviewed Sep 19, 2021

View reviewed changes

oranagra added release-notes indication that this issue needs to be mentioned in the release notes state:to-be-merged The PR should be merged soon, even if not yet ready, this is used so that it won't be forgotten approval-needed Waiting for core team approval to be merged labels Sep 19, 2021

yossigo approved these changes Sep 22, 2021

View reviewed changes

oranagra requested review from madolson and soloestoy September 26, 2021 08:39

madolson reviewed Sep 29, 2021

View reviewed changes

src/server.c Outdated Show resolved Hide resolved

src/tracking.c Outdated Show resolved Hide resolved

src/server.c Outdated Show resolved Hide resolved

src/tracking.c Outdated Show resolved Hide resolved

src/tracking.c Outdated Show resolved Hide resolved

huangzhw added 2 commits September 30, 2021 07:49

Merge branch 'unstable' into tracking

7bd3121

rename

d00eb87

oranagra reviewed Sep 30, 2021

View reviewed changes

src/server.c Outdated Show resolved Hide resolved

huangzhw added 2 commits October 1, 2021 13:26

fix nestInCall

6b7f01e

test eviction before command

2347cfc

huangzhw commented Oct 1, 2021

View reviewed changes

src/tracking.c Outdated Show resolved Hide resolved

huangzhw added 2 commits October 3, 2021 15:19

Schedule tracking invalidation only when invalidation messages and

898587b

response are in the same connection.

use current_client instead of recording client

81ea899

oranagra reviewed Oct 3, 2021

View reviewed changes

src/tracking.c Show resolved Hide resolved

src/server.c Outdated Show resolved Hide resolved

huangzhw and others added 2 commits October 3, 2021 20:19

Update src/server.c

22e164e

Co-authored-by: Oran Agra <[email protected]>

comment

f744cc3

oranagra approved these changes Oct 4, 2021

View reviewed changes

madolson approved these changes Oct 7, 2021

View reviewed changes

oranagra merged commit fd135f3 into redis:unstable Oct 7, 2021

huangzhw deleted the tracking branch October 7, 2021 12:44

oranagra mentioned this pull request Oct 12, 2021

Sort out the mess around writable replicas and lookupKeyRead/Write #9572

Merged

oranagra mentioned this pull request Nov 2, 2021

Fix not updating backlog histlen when trimming repl backlog #9713

Merged

oranagra mentioned this pull request Nov 2, 2021

attempt to fix tracking test issue with external tests due to lazy free #9722

Merged

joshleeb mentioned this pull request Dec 14, 2021

Delay *SUBSCRIBE inside transactions #9928

Open

rueian mentioned this pull request Mar 3, 2022

v0.0.37 ,in the use case would happen panic situation redis/rueidis#12

Closed

yossigo mentioned this pull request Apr 23, 2022

Handle push notifications before or after reply. redis/hiredis#1062

Merged

This was referenced Apr 27, 2022

Redis 6.2.7 #10653

Closed

Redis 6.2.7 #10654

Merged

madolson mentioned this pull request Feb 17, 2023

Prevent Redis from crashing from key tracking invalidations #11814

Merged

oranagra mentioned this pull request Jun 18, 2023

Fix broken protocol when PUBLISH emits local push inside MULTI #12326

Merged

rueian mentioned this pull request Apr 18, 2024

Inconsistent local/remote cache hits redis/rueidis#534

Closed

Conversation

huangzhw commented Aug 29, 2021 • edited by oranagra Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oranagra left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oranagra commented Sep 14, 2021

Uh oh!

huangzhw commented Sep 14, 2021

Uh oh!

oranagra left a comment

Choose a reason for hiding this comment

Uh oh!

oranagra commented Sep 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

madolson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oranagra commented Sep 29, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oranagra left a comment

Choose a reason for hiding this comment

Uh oh!

madolson left a comment

Choose a reason for hiding this comment

Uh oh!

oranagra commented Nov 2, 2021

Uh oh!

oranagra commented Nov 2, 2021

Uh oh!

huangzhw commented Nov 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oranagra commented Nov 2, 2021

Uh oh!

oranagra commented Nov 2, 2021

Uh oh!

huangzhw commented Nov 2, 2021

Uh oh!

oranagra commented Nov 2, 2021

Uh oh!

oranagra commented Nov 2, 2021

Uh oh!

oranagra commented Nov 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

huangzhw commented Aug 29, 2021 •

edited by oranagra

Loading

oranagra commented Sep 19, 2021 •

edited

Loading

huangzhw commented Nov 2, 2021 •

edited

Loading