Add benchmarks for tracking relayout boundaries #150539

gnprice · 2024-06-20T05:25:53Z

Fixes part of #150524.

As part of #150524, I'll be sending a couple of PRs that simplify and reduce the bookkeeping we do for tracking relayout boundaries. To measure the performance effect of that reduced bookkeeping, I wrote some microbenchmarks that exercise that tracking.

This PR adds the microbenchmarks. That should give them a clean baseline, before the PRs that change how the bookkeeping works and are meant to change the results of the microbenchmarks.

The "reparenting accordion" benchmark is a bit pathological: it involves N GlobalKeys all getting reparented on each frame. It takes quadratic time, and I think we're OK with that. Still, constant-factor improvements and regressions there seem worth knowing about, because real apps may contain smaller versions of the same pattern.

Pre-launch Checklist

I read the Contributor Guide and followed the process outlined there for submitting PRs.
I read the Tree Hygiene wiki page, which explains my responsibilities.
I read and followed the Flutter Style Guide, including Features we expect every widget to implement.
I signed the CLA.
I listed at least one issue that this PR fixes in the description above.
I updated/added relevant documentation (doc comments with ///).
I added new tests to check the change I am making, or this PR is test-exempt.
I followed the breaking change policy and added Data Driven Fixes where supported.
All existing and new tests are passing.

If you need help, consider asking for advice on the #hackers-new channel on Discord.

Fixes part of flutter#150524. As part of flutter#150524, I'll be sending a couple of PRs that simplify and reduce the bookkeeping we do for tracking relayout boundaries. To measure the performance effect of that reduced bookkeeping, I wrote some microbenchmarks that exercise that tracking. This PR adds the microbenchmarks. That should give them a clean baseline, before the PRs that change how the bookkeeping works and are meant to change the results of the microbenchmarks. The "reparenting accordion" benchmark is a bit pathological: it involves N `GlobalKey`s all getting reparented on each frame. It takes quadratic time, and I think we're OK with that. Still, constant-factor improvements and regressions there seem worth knowing about, because real apps may contain smaller versions of the same pattern.

gnprice · 2024-06-20T06:44:36Z

The failures in Mac build_ios_framework_module_test and Mac tool_integration_tests_1_4 were timeouts, and seem like they shouldn't be related — those checks don't sound like they run benchmarks. Re-running those checks.

LongCatIsLooong

As a microbenchmark for relayout boundaries, I feel testing using widgets will be way too noisy. Clearing the relayout boundary for each node takes no more than a few clockcycles I would assume, but reparenting an Element tree is much much more expensive than that.

LongCatIsLooong · 2024-06-26T21:02:07Z

I guess you could do the same benchmark tests using the rendering layer only, by putting markNeedsDirty and flushLayout in a tight loop. I believe ultimately it's the overall layout performance that we care about, so existing layout benchmarks should reflect the performance impact of the layout boundary implementation change you'd like to make?

gnprice · 2024-06-26T21:12:54Z

Sure, I can try reworking these to be just in terms of render objects with no widgets. (In your second comment, do you mean markNeedsLayout, or is there something elsewhere I'm missing?)

That should make the benchmarks even more micro-, so I expect it'll make the measured speedups from my upcoming changes even steeper. I guess I went for widgets initially because I didn't know if the even-more-micro-benchmark version might be seen as less realistic, and so less convincing.

gnprice · 2024-06-26T21:14:11Z

Which existing benchmarks are the ones you'd look at for evaluating a layout change?

Last week I tried looking at the benchmark results in Skia Perf, but found it hard to browse among different benchmarks.

LongCatIsLooong · 2024-06-27T01:01:04Z

do you mean markNeedsLayout

Ah sorry yeah I meant markNeedsLayout

Which existing benchmarks are the ones you'd look at for evaluating a layout change?

Something like microbenchmarks/lib/stocks/layout_bench.dart (https://flutter-flutter-perf.skia.org/e/?begin=1719363564&end=1719449964&queries=sub_result%3Dstock_layout_iteration)? It's kinda macro-y to me.

I think it's probably justifiable to land the relayout boundary change with no additional benchmarks, since it would remove the treewalk no?

gnprice · 2024-06-27T06:35:11Z

Cool, thanks. I just tried that benchmark on my draft changes, and any effect seemed to be within the noise (it varied a few percent from run to run, due perhaps to random apps running background tasks on my phone). On the microbenchmarks in this PR, the effect is much bigger, like 40% for relayout_boundary_toggle and 25-30% for the other two.

As you say, clearing (or updating) the relayout boundary for each node should be very fast, so I expect the speedup to only be material when there's a large same-relayout-boundary subtree that the current logic has to sweep through. Also only when it actually does have to do that sweep — so when the relayout boundary changes, or that subtree is removed from the render tree. I'm not sure if the stocks benchmark exercises those cases, but if it does it probably just doesn't have any really large same-relayout-boundary subtrees.

I think it's probably justifiable to land the relayout boundary change with no additional benchmarks, since it would remove the treewalk no?

Yeah, I wouldn't disagree. Indeed it removes the treewalks — the two in layout when the relayout boundary changes, and then the one in dropChild. So just from reading the changes, even without any benchmark results, one would expect it certainly ought to be a speedup, even if usually a small one in practice.

I think converting these to pure rendering-layer terms would be a fun exercise for me, though, so I'll make a go of it even if only for my own edification. But it might be a few days before I get to that.

I believe they'd be the first examples in the tree of a benchmark of that kind, too. So that might be a bonus reason to land at least one of them.

gnprice · 2024-08-20T22:44:35Z

Status update: this is in the same state as the related #150905:

I'm still planning to return to this. The summer has been a busy time, but I'm hoping to pick it back up in September or October.

Piinks · 2024-10-22T22:44:13Z

(PR triage): Since we have not heard back on this or #150905 I am going to close this to remove it from the review queue. I have added the framework label though, so it does end up in the right team review queue if you decide to reopen it. If you'd like to reopen these PRs, but not have them in the review queue, please mark them as a draft.
Thanks!

gnprice mentioned this pull request Jun 20, 2024

Microbenchmarks flaky; "Null check operator used on a null value" #150542

Closed

gnprice requested review from LongCatIsLooong and goderbauer June 25, 2024 18:18

LongCatIsLooong reviewed Jun 26, 2024

View reviewed changes

This was referenced Jun 27, 2024

_relayoutBoundary is more complex than it could be, and undocumented #150524

Open

Reduce relayout-boundary tracking to a bool-or-null per RenderObject #150905

Closed

Piinks closed this Oct 22, 2024

Piinks added framework flutter/packages/flutter repository. See also f: labels. team: benchmark Performance issues found by inspecting benchmarks labels Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add benchmarks for tracking relayout boundaries #150539

Add benchmarks for tracking relayout boundaries #150539

Uh oh!

gnprice commented Jun 20, 2024

Uh oh!

gnprice commented Jun 20, 2024

Uh oh!

LongCatIsLooong left a comment

Uh oh!

LongCatIsLooong commented Jun 26, 2024

Uh oh!

gnprice commented Jun 26, 2024

Uh oh!

gnprice commented Jun 26, 2024

Uh oh!

LongCatIsLooong commented Jun 27, 2024 •

edited

Loading

Uh oh!

gnprice commented Jun 27, 2024

Uh oh!

gnprice commented Aug 20, 2024

Uh oh!

Piinks commented Oct 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add benchmarks for tracking relayout boundaries #150539

Add benchmarks for tracking relayout boundaries #150539

Uh oh!

Conversation

gnprice commented Jun 20, 2024

Pre-launch Checklist

Uh oh!

gnprice commented Jun 20, 2024

Uh oh!

LongCatIsLooong left a comment

Choose a reason for hiding this comment

Uh oh!

LongCatIsLooong commented Jun 26, 2024

Uh oh!

gnprice commented Jun 26, 2024

Uh oh!

gnprice commented Jun 26, 2024

Uh oh!

LongCatIsLooong commented Jun 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gnprice commented Jun 27, 2024

Uh oh!

gnprice commented Aug 20, 2024

Uh oh!

Piinks commented Oct 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LongCatIsLooong commented Jun 27, 2024 •

edited

Loading