Improve performance of `Ring.shuffleShard()` by charleskorn · Pull Request #281 · grafana/dskit

charleskorn · 2023-04-04T07:09:11Z

What this PR does:

This PR improves the performance of Ring.shuffleShard(), and specifically the evaluation of the list of all tokens in the shard. It also introduces a new benchmark for shuffle sharding, BenchmarkRing_ShuffleShard_LargeShardSize, which better mirrors the worst-case parameters we see in production Mimir cells.

There are two main optimisations:

to compute the per-zone lists of tokens, use a loser tree rather than a heap to merge per-host lists of tokens
to compute the overall list of tokens, merge the per-zone lists of tokens rather than merging the per-host lists of tokens, and use a simplified merge algorithm to perform this merge step (benchmarks show that it performs better than using either a heap or a loser tree when merging such a small number of lists)

The loser tree implementation is based on @bboreham's implementation in Loki, which we are relicensing as part of this PR. I've modified it to work on slices directly, rather than Sequence interfaces, as this improved overall shuffle shard computation performance by ~15%, and incorporated the fix in grafana/loki#9057.

Benchmark results:

goos: darwin
goarch: arm64
pkg: github.com/grafana/dskit/ring
                                                                          │  before.txt   │             final.txt              │
                                                                          │    sec/op     │   sec/op     vs base               │
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_3-10       28.02µ ±  1%   17.82µ ± 0%  -36.41% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_10-10     105.36µ ±  0%   51.41µ ± 0%  -51.20% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_30-10      418.0µ ±  1%   159.6µ ± 0%  -61.81% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_3-10       48.00µ ±  1%   45.07µ ± 0%   -6.10% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_10-10     134.43µ ±  0%   86.19µ ± 1%  -35.89% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_30-10      381.7µ ±  1%   202.3µ ± 6%  -46.99% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_3-10      29.08µ ±  2%   18.31µ ± 2%  -37.04% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_10-10    106.16µ ±  2%   52.62µ ± 0%  -50.43% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_30-10     423.7µ ±  1%   161.1µ ± 1%  -61.99% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_3-10      48.99µ ±  1%   45.88µ ± 0%   -6.36% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_10-10    136.33µ ±  3%   87.78µ ± 1%  -35.61% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_30-10     385.8µ ±  0%   203.4µ ± 0%  -47.28% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_3-10     26.77µ ±  2%   16.46µ ± 0%  -38.50% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_10-10   103.64µ ±  0%   49.62µ ± 0%  -52.13% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_30-10    418.8µ ± 32%   157.1µ ± 0%  -62.48% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_3-10     45.33µ ±  0%   42.24µ ± 1%   -6.82% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_10-10   131.38µ ±  1%   81.63µ ± 1%  -37.87% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_30-10    378.6µ ±  0%   194.9µ ± 1%  -48.51% (p=0.002 n=6)
Ring_ShuffleShard_512Tokens-10                                               286.6µ ±  0%   170.8µ ± 0%  -40.40% (p=0.002 n=6)
Ring_ShuffleShard_LargeShardSize-10                                         22.624m ±  0%   7.956m ± 1%  -64.83% (p=0.002 n=6)
geomean                                                                      162.9µ         91.56µ       -43.81%

                                                                          │  before.txt   │              final.txt              │
                                                                          │     B/op      │     B/op      vs base               │
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_3-10      11.039Ki ± 0%   9.391Ki ± 0%  -14.93% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_10-10      21.12Ki ± 0%   15.51Ki ± 0%  -26.56% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_30-10      51.46Ki ± 0%   34.21Ki ± 0%  -33.52% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_3-10       21.61Ki ± 0%   21.55Ki ± 0%   -0.25% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_10-10      33.49Ki ± 0%   33.03Ki ± 0%   -1.38% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_30-10      62.54Ki ± 0%   61.41Ki ± 0%   -1.79% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_3-10     11.039Ki ± 0%   9.391Ki ± 0%  -14.93% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_10-10     21.12Ki ± 0%   15.51Ki ± 0%  -26.56% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_30-10     51.46Ki ± 0%   34.21Ki ± 0%  -33.52% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_3-10      21.61Ki ± 0%   21.55Ki ± 0%   -0.25% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_10-10     33.49Ki ± 0%   33.03Ki ± 0%   -1.37% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_30-10     62.53Ki ± 0%   61.42Ki ± 0%   -1.79% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_3-10    11.039Ki ± 0%   9.391Ki ± 0%  -14.93% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_10-10    21.12Ki ± 0%   15.51Ki ± 0%  -26.56% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_30-10    51.46Ki ± 0%   34.21Ki ± 0%  -33.52% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_3-10     21.61Ki ± 0%   21.55Ki ± 0%   -0.25% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_10-10    33.49Ki ± 0%   33.03Ki ± 0%   -1.37% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_30-10    62.54Ki ± 0%   61.42Ki ± 0%   -1.78% (p=0.002 n=6)
Ring_ShuffleShard_512Tokens-10                                               56.92Ki ± 0%   56.56Ki ± 0%   -0.63% (p=0.002 n=6)
Ring_ShuffleShard_LargeShardSize-10                                          1.203Mi ± 0%   1.193Mi ± 0%   -0.80% (p=0.002 n=6)
geomean                                                                      35.69Ki        31.10Ki       -12.86%

                                                                          │ before.txt  │             final.txt             │
                                                                          │  allocs/op  │ allocs/op   vs base               │
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_3-10       32.00 ± 0%   22.00 ± 0%  -31.25% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_10-10      62.00 ± 0%   31.00 ± 0%  -50.00% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_1,_shard_size_=_30-10     143.00 ± 0%   52.00 ± 0%  -63.64% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_3-10       44.00 ± 0%   37.00 ± 0%  -15.91% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_10-10      86.00 ± 0%   52.00 ± 0%  -39.53% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_50,_num_zones_=_3,_shard_size_=_30-10     164.00 ± 0%   76.00 ± 0%  -53.66% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_3-10      32.00 ± 0%   22.00 ± 0%  -31.25% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_10-10     62.00 ± 0%   31.00 ± 0%  -50.00% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_1,_shard_size_=_30-10    143.00 ± 0%   52.00 ± 0%  -63.64% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_3-10      44.00 ± 0%   37.00 ± 0%  -15.91% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_10-10     86.00 ± 0%   52.00 ± 0%  -39.53% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_100,_num_zones_=_3,_shard_size_=_30-10    164.00 ± 0%   76.00 ± 0%  -53.66% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_3-10     32.00 ± 0%   22.00 ± 0%  -31.25% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_10-10    62.00 ± 0%   31.00 ± 0%  -50.00% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_1,_shard_size_=_30-10   143.00 ± 0%   52.00 ± 0%  -63.64% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_3-10     44.00 ± 0%   37.00 ± 0%  -15.91% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_10-10    86.00 ± 0%   52.00 ± 0%  -39.53% (p=0.002 n=6)
Ring_ShuffleShard/num_instances_=_1000,_num_zones_=_3,_shard_size_=_30-10   164.00 ± 0%   76.00 ± 0%  -53.66% (p=0.002 n=6)
Ring_ShuffleShard_512Tokens-10                                               74.00 ± 0%   49.00 ± 0%  -33.78% (p=0.002 n=6)
Ring_ShuffleShard_LargeShardSize-10                                         1134.0 ± 0%   326.0 ± 0%  -71.25% (p=0.002 n=6)
geomean                                                                      85.71        46.49       -45.76%

Which issue(s) this PR fixes:

(none)

Checklist

[n/a] Tests updated
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

charleskorn · 2023-04-04T07:16:32Z

ring/ring.go

+		}
+
+		if !haveSeenGroupWithRemainingToken {
+			return merged


Note to reviewers: this should never happen, but I couldn't think of a nicer way to handle this - open to suggestions.

This reduces CPU time for the shuffle sharding benchmarks by ~15%.

This saves ~5-7% of CPU time compared to using a loser tree for three groups.

pracucci

I'm assuming the loser tree implementation is correct, given it's getting copied from Loki and I also assume the fuzz test has been run manually given it doesn't run in CI. Based on these assumptions, I've checked the rest of the code and LGTM (modulo a couple of nits).

ring/ring.go

charleskorn · 2023-04-12T03:46:49Z

I'm assuming the loser tree implementation is correct, given it's getting copied from Loki

I have modified it slightly to not use an interface, but the tests exercise the same test cases, so I believe this hasn't broken anything.

I also assume the fuzz test has been run manually given it doesn't run in CI.

Yep.

pracucci

LGTM, thanks!

This brings in grafana/dskit#280 and grafana/dskit#281.

* Upgrade to latest dskit version. This brings in grafana/dskit#280 and grafana/dskit#281. * Add changelog entry.

Upgrades to a newer version (v1.53.0) of google.golang.org/grpc that includes breaking changes to the resolver.Target type (see Cleanup usages of resolver.Target.Endpoint grpc/grpc-go#5796) Upgrade from v2.4.0 to v.4.0.0 of https://github.com/sercand/kuberesolver which is compatible to grpc to v1.53.0.

charleskorn commented Apr 4, 2023

View reviewed changes

charleskorn marked this pull request as ready for review April 4, 2023 07:18

charleskorn added 4 commits April 5, 2023 13:27

Add benchmark to replicate conditions in prod cells.

2181bde

Improve performance of Ring.shuffleShard().

40abe0e

Fix linting warning.

4a38be4

Add changelog entry.

60e2e2a

charleskorn force-pushed the charleskorn/improve-shuffleshard-performance branch from 39c479a to 60e2e2a Compare April 5, 2023 03:28

charleskorn added 6 commits April 5, 2023 14:30

Initial import of loser tree implementation from Loki.

da37dcc

Remove unused method.

ca7d8ff

Initial naive implementation with loser trees.

f4fc326

Use loser trees to merge groups as well.

b91c2f2

Remove Sequence type from loser tree implementation.

fbcc909

This reduces CPU time for the shuffle sharding benchmarks by ~15%.

Reintroduce specialised code for merging groups together.

b028530

This saves ~5-7% of CPU time compared to using a loser tree for three groups.

charleskorn marked this pull request as draft April 5, 2023 06:04

charleskorn added 6 commits April 5, 2023 16:35

Fix issue where loser tree does not include values equal to maximum.

d68acf6

Add further test cases and fuzz test for loser tree merging.

f80225a

Clarify test case.

2c602fb

Bring in Bryan's changes from grafana/loki#9057.

80b3192

Rename benchmark.

d48dd8c

Sort imports.

fb36dad

charleskorn marked this pull request as ready for review April 11, 2023 04:40

charleskorn requested review from bboreham and pracucci April 11, 2023 04:41

pracucci approved these changes Apr 11, 2023

View reviewed changes

ring/ring.go Show resolved Hide resolved

ring/ring.go Outdated Show resolved Hide resolved

ring/ring.go Show resolved Hide resolved

charleskorn added 2 commits April 12, 2023 13:34

Address PR feedback: clarify logic.

888fe15

Address PR feedback: add tests for mergeTokenGroups.

6a915cd

pracucci approved these changes Apr 12, 2023

View reviewed changes

charleskorn merged commit 86cce08 into main Apr 12, 2023

charleskorn deleted the charleskorn/improve-shuffleshard-performance branch April 12, 2023 07:20

charleskorn added a commit to grafana/mimir that referenced this pull request Apr 12, 2023

Upgrade to latest dskit version.

5b8b533

This brings in grafana/dskit#280 and grafana/dskit#281.

charleskorn mentioned this pull request Apr 12, 2023

Upgrade to latest dskit version grafana/mimir#4711

Merged

1 task

charleskorn added a commit to grafana/mimir that referenced this pull request Apr 12, 2023

Upgrade to latest dskit version (#4711)

4f4738d

* Upgrade to latest dskit version. This brings in grafana/dskit#280 and grafana/dskit#281. * Add changelog entry.

charleskorn mentioned this pull request Apr 14, 2023

Cache subrings returned by Ring.ShuffleShardWithLookback() #283

Merged

2 tasks

bboreham mentioned this pull request Sep 5, 2023

loser-tree: add sequence abstraction #376

Draft

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of `Ring.shuffleShard()`#281

Improve performance of `Ring.shuffleShard()`#281
charleskorn merged 18 commits intomainfrom
charleskorn/improve-shuffleshard-performance

charleskorn commented Apr 4, 2023 •

edited

Loading

Uh oh!

charleskorn Apr 4, 2023

Uh oh!

pracucci left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

charleskorn commented Apr 12, 2023 •

edited

Loading

Uh oh!

pracucci left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

charleskorn commented Apr 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

charleskorn Apr 4, 2023

Choose a reason for hiding this comment

Uh oh!

pracucci left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

charleskorn commented Apr 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pracucci left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

charleskorn commented Apr 4, 2023 •

edited

Loading

charleskorn commented Apr 12, 2023 •

edited

Loading