Fix the wrong reisze of querybuf by sundb · Pull Request #9003 · redis/redis

sundb · 2021-05-29T09:25:29Z

The initialize memory of querybuf is PROTO_IOBUF_LEN(1024*16) * 2 (due to sdsMakeRoomFor being greedy), under jemalloc, the allocated memory will be 40k.
This will most likely result in the querybuf being resized when call clientsCronResizeQueryBuffer unless the client requests it fast enough.

Note that this bug existed even before #7875, since the condition for resizing includes the sds headers (32k+6).

Changes

Use non-greedy sdsMakeRoomFor when allocating the initial query buffer (of 16k).
Also use non-greedy allocation when working with BIG_ARG (we won't use that extra space anyway)
in case we did use a greedy allocation, read as much as we can into the buffer we got (including internal frag), to reduce system calls.
introduce a dedicated constant for the shrinking (same value as before)
Add test for querybuf.
improve a maxmemory test by ignoring the effect of replica query buffers (can accumulate many ACKs on slow env)
improve a maxmemory by disabling slowlog (it will cause slight memory growth on slow env).

oranagra · 2021-05-29T11:59:34Z

@sundb AFAIR you started to investigate that to solve the replica buffer don't induce eviction test, but i see it fails in this PR.
is that related or unrelated to this test?

sundb · 2021-05-29T12:12:42Z

@oranagra This pr solves the problem of replica buffer don't induce eviction test, but now the test fails because my modification causes the memory to cause the slave's querybuf to no longer be shrunk, and when the slave is killed, delta_no_repl does not count the memory added by the querybuf, I'm still testing.

oranagra · 2021-05-30T09:18:33Z

Let me see if i get the the full story here.

by default when we read a query we use:

    readlen = PROTO_IOBUF_LEN;
....
    c->querybuf = sdsMakeRoomFor(c->querybuf, readlen);

PROTO_IOBUF_LEN is 16k, and sdsMakeRoomFor being greedy doubles that.
so that, together with the (not so recent) change in ~~#7864~~ #7875. caused the sds size to be 48k (instead of 32k which it was before)

so when it was combined with this check (PROTO_MBULK_BIG_ARG is set to 32k, and using >) and avoid shrinking the default buffer size.

    if (querybuf_size > PROTO_MBULK_BIG_ARG &&

so in fact this is a "regression" from ~~#7864~~ #7875.

sundb · 2021-05-30T09:30:38Z

@oranagra It's introduced by #7875.
sdsMakeRoomFor will change sds size to be 40k(http://jemalloc.net/jemalloc.3.html#size_classes).

oranagra · 2021-05-30T10:03:04Z

ohh, yeah, that's the PR i meant (took the wrong one since they had a similar title)

oranagra

sorry i wasn't paying close attention so far while you were investigating failures in this test. only now i bothered to read my test and remember what it was attempting to achieve.

tests/unit/maxmemory.tcl

tests/unit/querybuf.tcl

src/server.c

tests/unit/maxmemory.tcl

tests/unit/querybuf.tcl

tests/unit/maxmemory.tcl

Co-authored-by: Oran Agra <[email protected]>

src/server.c

tests/test_helper.tcl

oranagra · 2021-06-03T09:54:05Z

i take it back, this is not a regression from #7875.
before 7875, we used to ask for 16k, and the greenness of sdsMakeRoomFor gave us 32k, but since serverCron uses sdsAllocSize rather than sdsalloc (what #5013 wants to solve), it would readk 32k+6, so the fact the code there uses > and not >= doesn't help.

so even before 7875 we would have shrieked the query buff right after it was allocated.

sundb · 2021-06-03T10:05:03Z

):8 It's been around for 9 years.

src/sds.c

src/server.c

src/networking.c

Co-authored-by: Oran Agra <[email protected]>

src/sds.c

oranagra · 2021-06-13T07:14:52Z

@sundb @yoav-steinberg let me know if we're good to merge this one.

sundb · 2021-06-15T01:09:53Z

@oranagra Sorry, I can't be in front of the PC these days.
I make the following changes.

yoav-steinberg

Comment on using sdsavail for updating readlen.

src/networking.c

oranagra

much of these recent changes aren't really doing anything (e.g. the one in scripting.c). we can consider them a cleanup, but maybe since this PR is already quite big and confusing, we wanna leave that cleanup for a separate PR?

the one in tls.c seems to reveal another bug (again i think subject for another PR).
WDYT?

src/tls.c

…size_t

oranagra

so are we all good to merge this now?

sundb · 2021-06-15T10:37:26Z

@oranagra Yes.

Fix test failure which introduced by #9003. The following case will occur when querybuf expansion will allocate memory equal to (16*1024)k. 1) make use ```CFLAGS=-DNO_MALLOC_USABLE_SIZE```. 2) ```malloc``` will not allocate more under ```alpine```.

…Followup for #9003) (#9100) Due to the change in #9003, a long-standing bug was raised under `valgrind`. This bug can cause the master-slave sync to take a very long time, causing the `pendingquerybuf.tcl` test to fail. This problem does not only occur in master-slave sync, it is triggered when the big arg is greater than 32k. step: ```sh dd if=/dev/zero of=bigfile bs=1M count=32 ./src/redis-cli -x hset a a < bigfile ``` 1) Make room for querybuf in processMultibulkBuffer, now the alloc of querybuf will be more than 32k. 2) If this happens to trigger the `clientsCronResizeQueryBuffer`, querybuf will be resized to 0. 3) Finally, in readQueryFromClient, we expand the querybuf non-greedily, from 0 to 32k. Old code, make room for querybuf is greedy, so it only needs 11 times to expand to 32M(16k*(2^11)), but now we need 2048(32*1024/16) times to reach it, due to the slow allocation under valgrind that exposed the problem. The fix for the excessive shrinking of the query buf to 0, will be handled in #5013 (that other change on it's own can fix failing test too), but the fix in this PR will also fix the failing test. The fix in this PR will makes the reading in `readQueryFromClient` more aggressive when working on a big arg (so that it is in par with the same code in `processMultibulkBuffer` (i.e. the two calls to `sdsMakeRoomForNonGreedy` should both use the bulk size). In the code before this fix the one in readQueryFromClient always has `readlen = PROTO_IOBUF_LEN`

The initialize memory of `querybuf` is `PROTO_IOBUF_LEN(1024*16) * 2` (due to sdsMakeRoomFor being greedy), under `jemalloc`, the allocated memory will be 40k. This will most likely result in the `querybuf` being resized when call `clientsCronResizeQueryBuffer` unless the client requests it fast enough. Note that this bug existed even before redis#7875, since the condition for resizing includes the sds headers (32k+6). ## Changes 1. Use non-greedy sdsMakeRoomFor when allocating the initial query buffer (of 16k). 1. Also use non-greedy allocation when working with BIG_ARG (we won't use that extra space anyway) 2. in case we did use a greedy allocation, read as much as we can into the buffer we got (including internal frag), to reduce system calls. 3. introduce a dedicated constant for the shrinking (same value as before) 3. Add test for querybuf. 4. improve a maxmemory test by ignoring the effect of replica query buffers (can accumulate many ACKs on slow env) 5. improve a maxmemory by disabling slowlog (it will cause slight memory growth on slow env).

Fix test failure which introduced by redis#9003. The following case will occur when querybuf expansion will allocate memory equal to (16*1024)k. 1) make use ```CFLAGS=-DNO_MALLOC_USABLE_SIZE```. 2) ```malloc``` will not allocate more under ```alpine```.

…Followup for redis#9003) (redis#9100) Due to the change in redis#9003, a long-standing bug was raised under `valgrind`. This bug can cause the master-slave sync to take a very long time, causing the `pendingquerybuf.tcl` test to fail. This problem does not only occur in master-slave sync, it is triggered when the big arg is greater than 32k. step: ```sh dd if=/dev/zero of=bigfile bs=1M count=32 ./src/redis-cli -x hset a a < bigfile ``` 1) Make room for querybuf in processMultibulkBuffer, now the alloc of querybuf will be more than 32k. 2) If this happens to trigger the `clientsCronResizeQueryBuffer`, querybuf will be resized to 0. 3) Finally, in readQueryFromClient, we expand the querybuf non-greedily, from 0 to 32k. Old code, make room for querybuf is greedy, so it only needs 11 times to expand to 32M(16k*(2^11)), but now we need 2048(32*1024/16) times to reach it, due to the slow allocation under valgrind that exposed the problem. The fix for the excessive shrinking of the query buf to 0, will be handled in redis#5013 (that other change on it's own can fix failing test too), but the fix in this PR will also fix the failing test. The fix in this PR will makes the reading in `readQueryFromClient` more aggressive when working on a big arg (so that it is in par with the same code in `processMultibulkBuffer` (i.e. the two calls to `sdsMakeRoomForNonGreedy` should both use the bulk size). In the code before this fix the one in readQueryFromClient always has `readlen = PROTO_IOBUF_LEN`

The initialize memory of `querybuf` is `PROTO_IOBUF_LEN(1024*16) * 2` (due to sdsMakeRoomFor being greedy), under `jemalloc`, the allocated memory will be 40k. This will most likely result in the `querybuf` being resized when call `clientsCronResizeQueryBuffer` unless the client requests it fast enough. Note that this bug existed even before redis#7875, since the condition for resizing includes the sds headers (32k+6). 1. Use non-greedy sdsMakeRoomFor when allocating the initial query buffer (of 16k). 1. Also use non-greedy allocation when working with BIG_ARG (we won't use that extra space anyway) 2. in case we did use a greedy allocation, read as much as we can into the buffer we got (including internal frag), to reduce system calls. 3. introduce a dedicated constant for the shrinking (same value as before) 3. Add test for querybuf. 4. improve a maxmemory test by ignoring the effect of replica query buffers (can accumulate many ACKs on slow env) 5. improve a maxmemory by disabling slowlog (it will cause slight memory growth on slow env).

Fix the wrong reisze of querybuf

d3395cb

sundb marked this pull request as draft May 29, 2021 10:07

oranagra added the state:to-be-merged The PR should be merged soon, even if not yet ready, this is used so that it won't be forgotten label May 29, 2021

sundb added 2 commits May 30, 2021 14:39

Fix maxmemory test fail

6c211e8

Add test for querybuf

7f070e7

Fix comment

e9c950c

sundb marked this pull request as ready for review May 30, 2021 09:32

oranagra reviewed May 30, 2021

View reviewed changes

tests/unit/maxmemory.tcl Outdated Show resolved Hide resolved

tests/unit/maxmemory.tcl Outdated Show resolved Hide resolved

tests/unit/querybuf.tcl Outdated Show resolved Hide resolved

tests/unit/querybuf.tcl Outdated Show resolved Hide resolved

huangzhw reviewed May 30, 2021

View reviewed changes

src/server.c Outdated Show resolved Hide resolved

Fix CR

f8bea24

oranagra reviewed Jun 1, 2021

View reviewed changes

tests/unit/maxmemory.tcl Outdated Show resolved Hide resolved

tests/unit/querybuf.tcl Outdated Show resolved Hide resolved

tests/unit/querybuf.tcl Outdated Show resolved Hide resolved

sundb added 4 commits June 1, 2021 19:06

Fix CR

4f0ceb9

Incr theshold

69c9021

Fix test error

7234a5a

Add more assert

e85e4dd

oranagra reviewed Jun 1, 2021

View reviewed changes

tests/unit/maxmemory.tcl Outdated Show resolved Hide resolved

Update tests/unit/maxmemory.tcl

0f8ae6c

Co-authored-by: Oran Agra <[email protected]>

oranagra mentioned this pull request Jun 2, 2021

Improve test suite to handle external servers better. #9033

Merged

oranagra reviewed Jun 2, 2021

View reviewed changes

src/server.c Show resolved Hide resolved

tests/test_helper.tcl Outdated Show resolved Hide resolved

Add sdsMakeRoomForExact to expand querybuf

65851e4

oranagra reviewed Jun 6, 2021

View reviewed changes

src/sds.c Outdated Show resolved Hide resolved

src/sds.c Outdated Show resolved Hide resolved

src/sds.c Outdated Show resolved Hide resolved

src/server.c Outdated Show resolved Hide resolved

src/networking.c Outdated Show resolved Hide resolved

sundb and others added 2 commits June 7, 2021 08:46

Update src/sds.c

fdc61e5

Co-authored-by: Oran Agra <[email protected]>

Update src/sds.c

0ca9308

Co-authored-by: Oran Agra <[email protected]>

sundb added 3 commits June 9, 2021 08:13

Change slow tag and temp change ci

7884d82

Revert ci

5d05356

Remove redundant space in comment

e925ae3

oranagra reviewed Jun 10, 2021

View reviewed changes

src/sds.c Outdated Show resolved Hide resolved

src/sds.c Outdated Show resolved Hide resolved

improve comments.

2099bf1

oranagra previously approved these changes Jun 13, 2021

View reviewed changes

sundb dismissed oranagra’s stale review via dc64916 June 15, 2021 01:31

sundb force-pushed the fix-querybuf-resize branch from d539de1 to 2099bf1 Compare June 15, 2021 03:00

yoav-steinberg reviewed Jun 15, 2021

View reviewed changes

src/networking.c Show resolved Hide resolved

oranagra reviewed Jun 15, 2021

View reviewed changes

src/tls.c Outdated Show resolved Hide resolved

sundb force-pushed the fix-querybuf-resize branch from 9e8b5b7 to 2099bf1 Compare June 15, 2021 09:32

sundb added 2 commits June 15, 2021 02:32

Use sdsvalid to read from socket, change return type of connRead to s…

cb37ba5

…size_t

Revert changes of connRead

17d1855

oranagra approved these changes Jun 15, 2021

View reviewed changes

oranagra merged commit e5d8a5e into redis:unstable Jun 15, 2021

sundb mentioned this pull request Jun 16, 2021

Fix querybuf test failure #9091

Merged

This was referenced Jun 17, 2021

resize query buffer more accurately #5013

Merged

Make readQueryFromClient more aggressive when reading big arg again (Followup for #9003) #9100

Merged

sundb deleted the fix-querybuf-resize branch June 30, 2021 02:00

oranagra mentioned this pull request Oct 4, 2021

Release 6.2.6 #9583

Merged

oranagra mentioned this pull request Mar 15, 2022

optimize(remove) usage of client's pending_querybuf #10413

Merged

Conversation

sundb commented May 29, 2021 • edited by oranagra Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

oranagra commented May 29, 2021

Uh oh!

sundb commented May 29, 2021

Uh oh!

oranagra commented May 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sundb commented May 30, 2021

Uh oh!

oranagra commented May 30, 2021

Uh oh!

oranagra left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oranagra commented Jun 3, 2021

Uh oh!

sundb commented Jun 3, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oranagra commented Jun 13, 2021

Uh oh!

sundb commented Jun 15, 2021

Uh oh!

yoav-steinberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

oranagra left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

oranagra left a comment

Choose a reason for hiding this comment

Uh oh!

sundb commented Jun 15, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sundb commented May 29, 2021 •

edited by oranagra

Loading

oranagra commented May 30, 2021 •

edited

Loading