if diskless repl child is killed, make sure to reap the pid by oranagra · Pull Request #7742 · redis/redis

oranagra · 2020-09-02T10:53:48Z

Starting redis 6.0 and the changes we made to the diskless master to be
suitable for TLS, I made the master avoid reaping (wait3) the pid of the
child until we know all replicas are done reading their rdb.

I did that in order to avoid a state where the rdb_child_pid is -1 but
we don't yet want to start another fork (still busy serving that data to
replicas).

It turns out that the solution used so far was problematic in case the
fork child was being killed (e.g. by the kernel OOM killer), in that
case there's a chance that we currently disabled the read event on the
rdb pipe, since we're waiting for a replica to become writable again.
and in that scenario the master would have never realized the child
exited, and the replica will remain hung too.
Note that there's no mechanism to detect a hung replica while it's in
rdb transfer state.

The solution here is to add another pipe which is used by the parent to
tell the child it is safe to exit. this mean that when the child exits,
for whatever reason, it is safe to reap it.

Starting redis 6.0 and the changes we made to the diskless master to be suitable for TLS, I made the master avoid reaping (wait3) the pid of the child until we know all replicas are done reading their rdb. I did that in order to avoid a state where the rdb_child_pid is -1 but we don't yet want to start another fork (still busy serving that data to replicas). It turns out that the solution used so far was problematic in case the fork child was being killed (e.g. by the kernel OOM killer), in that case there's a chance that we currently disabled the read event on the rdb pipe, since we're waiting for a replica to become writable again. and in that scenario the master would have never realized the child exited, and the replica will remain hung too. Note that there's no mechanism to detect a hung replica while it's in rdb transfer state. The solution here is to add another pipe which is used by the parent to tell the child it is safe to exit. this mean that when the child exits, for whatever reason, it is safe to reap it.

oranagra · 2020-09-06T10:51:39Z

@yossigo i had to rebase due to a conflict (two adjacent changes).
while doing that i noticed a compilation warning and did some additional adjustment please review.

this adjustment was part of the original #6271 and got removed before merging (#6271 (review)) because it was no longer relevant after the introduction of the TLS fork/pipe changes (5a47794). but with this change it becomes relevant again.

p.s. testing the diskless loading short read still manages to run 100 tests in about 3 seconds regardless of this adjustment, but that's probably only because it sets the cron hz to 500.
p.p.s it looks like this test is even slightly faster with this PR than before this PR.

i'll update the commit comment to reflect the above when squashing.

Starting redis 6.0 and the changes we made to the diskless master to be suitable for TLS, I made the master avoid reaping (wait3) the pid of the child until we know all replicas are done reading their rdb. I did that in order to avoid a state where the rdb_child_pid is -1 but we don't yet want to start another fork (still busy serving that data to replicas). It turns out that the solution used so far was problematic in case the fork child was being killed (e.g. by the kernel OOM killer), in that case there's a chance that we currently disabled the read event on the rdb pipe, since we're waiting for a replica to become writable again. and in that scenario the master would have never realized the child exited, and the replica will remain hung too. Note that there's no mechanism to detect a hung replica while it's in rdb transfer state. The solution here is to add another pipe which is used by the parent to tell the child it is safe to exit. this mean that when the child exits, for whatever reason, it is safe to reap it. Besides that, i'm re-introducing an adjustment to REPLCONF ACK which was part of redis#6271 (Accelerate diskless master connections) but was dropped when that PR was rebased after the TLS fork/pipe changes (5a47794). Now that RdbPipeCleanup no longer calls checkChildrenDone, and the ACK has chance to detect that the child exited, it should be the one to call it so that we don't have to wait for cron (server.hz) to do that.

Starting redis 6.0 and the changes we made to the diskless master to be suitable for TLS, I made the master avoid reaping (wait3) the pid of the child until we know all replicas are done reading their rdb. I did that in order to avoid a state where the rdb_child_pid is -1 but we don't yet want to start another fork (still busy serving that data to replicas). It turns out that the solution used so far was problematic in case the fork child was being killed (e.g. by the kernel OOM killer), in that case there's a chance that we currently disabled the read event on the rdb pipe, since we're waiting for a replica to become writable again. and in that scenario the master would have never realized the child exited, and the replica will remain hung too. Note that there's no mechanism to detect a hung replica while it's in rdb transfer state. The solution here is to add another pipe which is used by the parent to tell the child it is safe to exit. this mean that when the child exits, for whatever reason, it is safe to reap it. Besides that, i'm re-introducing an adjustment to REPLCONF ACK which was part of #6271 (Accelerate diskless master connections) but was dropped when that PR was rebased after the TLS fork/pipe changes (5a47794). Now that RdbPipeCleanup no longer calls checkChildrenDone, and the ACK has chance to detect that the child exited, it should be the one to call it so that we don't have to wait for cron (server.hz) to do that. (cherry picked from commit 573246f)

oranagra requested a review from yossigo September 2, 2020 10:53

oranagra changed the title ~~if Diskless repl child killed, make sure to reaping the pid~~ if Diskless repl child killed, make sure to reap the pid Sep 2, 2020

oranagra changed the title ~~if Diskless repl child killed, make sure to reap the pid~~ if diskless repl child is killed, make sure to reap the pid Sep 2, 2020

yossigo previously approved these changes Sep 6, 2020

View reviewed changes

oranagra added 2 commits September 6, 2020 13:19

squashme - final fixes

36b8b92

oranagra dismissed yossigo’s stale review via 36b8b92 September 6, 2020 10:41

yossigo approved these changes Sep 6, 2020

View reviewed changes

oranagra merged commit 573246f into redis:unstable Sep 6, 2020

oranagra deleted the reap-killed-diskless-fork branch September 6, 2020 13:44

oranagra mentioned this pull request Jan 13, 2021

Redis 6.2 RC1. #8187

Merged

oranagra mentioned this pull request Jul 21, 2021

Release 6.0.15 #9266

Merged

soloestoy mentioned this pull request Mar 25, 2022

Attempt to fix a rare crash in cluster tests. #10265

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

if diskless repl child is killed, make sure to reap the pid#7742

if diskless repl child is killed, make sure to reap the pid#7742
oranagra merged 2 commits intoredis:unstablefrom
oranagra:reap-killed-diskless-fork

oranagra commented Sep 2, 2020

Uh oh!

oranagra commented Sep 6, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

oranagra commented Sep 2, 2020

Uh oh!

oranagra commented Sep 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oranagra commented Sep 6, 2020 •

edited

Loading