Fix propagation of entries_read by calling streamPropagateGroupID unconditionally by enjoy-binbin · Pull Request #12898 · redis/redis

enjoy-binbin · 2023-12-28T09:07:33Z

In XREADGROUP ACK, because streamPropagateXCLAIM does not propagate
entries-read, entries-read will be inconsistent between master and replicas.
I.e. if no entries were claimed, it would have propagated correctly, but if some
were claimed, then the entries-read field would be inconsistent on the replica.

The fix was suggested by guybe7, call streamPropagateGroupID unconditionally,
so that we will normalize entries_read on the replicas. In the past, we would
only set propagate_last_id when NOACK was specified. And in #9127, XCLAIM did
not propagate entries_read in ACK, which would cause entries_read to be
inconsistent between master and replicas.

Another approach is add another arg to XCLAIM and let it propagate entries_read,
but we decided not to use it. Because we want minimal damage in case there's an
old target and new source (in the worst case scenario, the new source doesn't
recognize XGROUP SETID ... ENTRIES READ and the lag is lost. If we change XCLAIM,
the damage is much more severe).

In this patch, now if the user uses XREADGROUP .. COUNT 1 there will be an additional
overhead of MULTI, EXEC and XGROUPSETID. We assume the extra command in case of
COUNT 1 (4x factor, changing from one XCLAIM to MULTI+XCLAIM+XSETID+EXEC), is probably
ok since reading just one entry is in any case very inefficient (a client round trip
per record), so we're hoping it's not a common case.

Issue was introduced in #9127.

…onditionally In XREADGROUP ACK, because streamPropagateXCLAIM does not propagate entries-read, entries-read will be inconsistent between master and replicas. The fix was suggested by guybe7, call streamPropagateGroupID unconditionally, so that we will normalize entries_read on the replicas. Issue was introduced in redis#9127.

src/commands/xgroup-create.json

tests/unit/type/stream-cgroups.tcl

guybe7 · 2023-12-28T09:56:57Z

revisiting the decision not to add another arg to XCLAIM: why is that a problem? it's only a problem if the master is new and replica is old (major-version-wise), which anyway isn't supported

@oranagra

oranagra · 2023-12-28T12:07:39Z

revisiting the decision not to add another arg to XCLAIM: why is that a problem? it's only a problem if the master is new and replica is old (major-version-wise), which anyway isn't supported

@oranagra

@guybe7 you're right, but there are also other cases (like RL's replica-of, or loading an AOF), which can replicate a new version to and old one.
what bothers me is also the fact that even if we ignore the command failure, it means that the target skips the entire XCLAIM, not just the unrecognized option.

so if the current approach is viable, maybe it's a better one...
WDYT?

enjoy-binbin · 2024-02-19T14:51:31Z

@oranagra @guybe7 avoid delaying for too long and losing the context. Do we have a final decision?

guybe7 · 2024-02-20T04:25:19Z

@oranagra ok, so what happens with the current code when the user uses NOACK? the server propagates the new form of XGROUP SETID (with ENTRIESREAD) and the receiving server (assuming it's an older version) will fail to execute it...

I mean, we probably have the same problem (old server fails to execute commands from a new server) in many other places, so I see no reason to avoid adding a new arg to XCLAIM

oranagra · 2024-02-22T10:06:46Z

i already lost context 😞
trying to move forward without regaining it..

the difference could be if the command / combination that breaks it is new, or rarely used, in which case the replication won't break unless the user uses a certain feature.
does that help? or are both approaches similar in that respect?

guybe7 · 2024-02-29T06:42:15Z

talked about this with @oranagra and we decided to keep the current approach because we want minimal damage in case there's an old target and new source (in the worst case scenario, the new source doesn't recognize XGROUP SETID ... ENTRIES READ and the lag is lost. if we change XCLAIM, the damage is much more severe)

we also understand that if the user uses XREADGROUP .. COUNT 1 the will be an additional overhead of MULTI, EXEC and XGROUPSETID

oranagra · 2024-02-29T06:47:48Z

to elaborate on that, we assume the extra command in case of COUNT 1 (4x factor, changing from one XCLAIM to MULTI+XCLAIM+XSETID+EXEC), is probably ok since reading just one entry is in any case very inefficient (a client round trip per record), so we're hoping it's not a common case.

oranagra · 2024-02-29T06:48:22Z

@enjoy-binbin please ping me that it's ready for merge from your perspective

enjoy-binbin · 2024-02-29T07:17:03Z

@oranagra I browsed through it again, it is ready to merge.

…onditionally (redis#12898) In XREADGROUP ACK, because streamPropagateXCLAIM does not propagate entries-read, entries-read will be inconsistent between master and replicas. I.e. if no entries were claimed, it would have propagated correctly, but if some were claimed, then the entries-read field would be inconsistent on the replica. The fix was suggested by guybe7, call streamPropagateGroupID unconditionally, so that we will normalize entries_read on the replicas. In the past, we would only set propagate_last_id when NOACK was specified. And in redis#9127, XCLAIM did not propagate entries_read in ACK, which would cause entries_read to be inconsistent between master and replicas. Another approach is add another arg to XCLAIM and let it propagate entries_read, but we decided not to use it. Because we want minimal damage in case there's an old target and new source (in the worst case scenario, the new source doesn't recognize XGROUP SETID ... ENTRIES READ and the lag is lost. If we change XCLAIM, the damage is much more severe). In this patch, now if the user uses XREADGROUP .. COUNT 1 there will be an additional overhead of MULTI, EXEC and XGROUPSETID. We assume the extra command in case of COUNT 1 (4x factor, changing from one XCLAIM to MULTI+XCLAIM+XSETID+EXEC), is probably ok since reading just one entry is in any case very inefficient (a client round trip per record), so we're hoping it's not a common case. Issue was introduced in redis#9127.

enjoy-binbin requested review from guybe7 and oranagra December 28, 2023 09:07

enjoy-binbin commented Dec 28, 2023

View reviewed changes

src/commands/xgroup-create.json Show resolved Hide resolved

enjoy-binbin commented Dec 28, 2023

View reviewed changes

tests/unit/type/stream-cgroups.tcl Outdated Show resolved Hide resolved

Update tests/unit/type/stream-cgroups.tcl

ca2cb46

enjoy-binbin mentioned this pull request Dec 28, 2023

Add stream consumer group lag tracking and reporting #9127

Merged

13 tasks

fix multi propagation test

eda6771

fix test

16015fd

enjoy-binbin requested review from guybe7 and removed request for guybe7 February 29, 2024 02:17

guybe7 approved these changes Feb 29, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/unstable' into fix_entries_read

1fcea46

oranagra merged commit f17381a into redis:unstable Feb 29, 2024

oranagra added the release-notes indication that this issue needs to be mentioned in the release notes label Feb 29, 2024

enjoy-binbin deleted the fix_entries_read branch February 29, 2024 07:56

sundb mentioned this pull request Jun 27, 2024

[BUG] Stream lag seems to not be correctly replicated on slave #13336

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix propagation of entries_read by calling streamPropagateGroupID unconditionally#12898

Fix propagation of entries_read by calling streamPropagateGroupID unconditionally#12898
oranagra merged 5 commits intoredis:unstablefrom
enjoy-binbin:fix_entries_read

enjoy-binbin commented Dec 28, 2023 •

edited by oranagra

Loading

Uh oh!

Uh oh!

Uh oh!

guybe7 commented Dec 28, 2023

Uh oh!

oranagra commented Dec 28, 2023

Uh oh!

enjoy-binbin commented Feb 19, 2024

Uh oh!

guybe7 commented Feb 20, 2024

Uh oh!

oranagra commented Feb 22, 2024

Uh oh!

guybe7 commented Feb 29, 2024

Uh oh!

oranagra commented Feb 29, 2024

Uh oh!

oranagra commented Feb 29, 2024

Uh oh!

enjoy-binbin commented Feb 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

enjoy-binbin commented Dec 28, 2023 • edited by oranagra Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

guybe7 commented Dec 28, 2023

Uh oh!

oranagra commented Dec 28, 2023

Uh oh!

enjoy-binbin commented Feb 19, 2024

Uh oh!

guybe7 commented Feb 20, 2024

Uh oh!

oranagra commented Feb 22, 2024

Uh oh!

guybe7 commented Feb 29, 2024

Uh oh!

oranagra commented Feb 29, 2024

Uh oh!

oranagra commented Feb 29, 2024

Uh oh!

enjoy-binbin commented Feb 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

enjoy-binbin commented Dec 28, 2023 •

edited by oranagra

Loading