Sped up Element._sort #104103

gaaclarke · 2022-05-18T17:48:18Z

Redid the logic for Element._sort to reduce operations:

	before	after
branches	4	3
getters	8	5
ops	6	3

This will make the biggest difference with hot reload.

Here is the profile of rebuilding a large app over and over again which is what lead me to looking at Element._sort:

Pre-launch Checklist

I read the Contributor Guide and followed the process outlined there for submitting PRs.
I read the Tree Hygiene wiki page, which explains my responsibilities.
I read and followed the Flutter Style Guide, including Features we expect every widget to implement.
I signed the CLA.
I listed at least one issue that this PR fixes in the description above.
I updated/added relevant documentation (doc comments with ///).
I added new tests to check the change I am making, or this PR is test-exempt.
All existing and new tests are passing.

If you need help, consider asking for advice on the #hackers-new channel on Discord.

flutter-dashboard · 2022-05-18T17:53:07Z

It looks like this pull request may not have tests. Please make sure to add tests before merging. If you need an exemption to this rule, contact Hixie on the #hackers channel in Chat (don't just cc him here, he won't see it! He's on Discord!).

If you are not sure if you need tests, consider this rule of thumb: the purpose of a test is to make sure someone doesn't accidentally revert the fix. Ask yourself, is there anything in your PR that you feel it is important we not accidentally revert back to how it was before your fix?

Reviewers: Read the Tree Hygiene page and make sure this patch meets those guidelines before LGTMing.

gaaclarke · 2022-05-18T17:53:32Z

I'm not sure if we want a benchmark beyond the ones we already have. A benchmark for hot reload would capture this the best probably.

jonahwilliams

How much does this speed things up on the rebuild benchmarks?

gaaclarke · 2022-05-18T18:25:22Z

How much does this speed things up on the rebuild benchmarks?

I have a number but I'm not sure how much I trust it until I've played around with the new benchmark a bit more. It can be demonstrated that there is no path through this code as it was originally written that does less operations.

gaaclarke · 2022-05-18T19:37:04Z

I wrote a microbenchmark for this but the results were just showing a 1% improvement. I suspect the slowness is elsewhere in dart (like List.operator[]=, or function invocation). I can add the microbenchmark if you want but I don't think it's worth the effort when we can demonstrate logically that it is faster and that it is actually a hot spot for some usage of the framework.

jonahwilliams · 2022-05-18T19:46:30Z

what if you make Element.depth not late?

gaaclarke · 2022-05-18T20:01:09Z

what if you make Element.depth not late?

In my microbenchmarks I was comparing without _depth being late: https://gist.github.com/gaaclarke/98ab1e38fcea413693afbb2f8a9e0f60

gaaclarke · 2022-05-18T20:43:56Z

(added screenshot in description that shows Element._sort showing up in the rebuild benchmark profile)

jonahwilliams · 2022-05-18T20:59:32Z

sorry I'm not really following, did you see a performance improvement from this change?

gaaclarke · 2022-05-18T21:30:44Z

sorry I'm not really following, did you see a performance improvement from this change?

Yes, the linked microbenchmark does show an improvement. It is however small to the point where I didn't do statistics to determine a likelihood when we can evaluate the codes performance logically. Is there a doubt that it is faster? Let me know what your concern is and I can address it better. We can look at the generated assembly if you want, or just explain it.

goderbauer · 2022-05-18T22:30:34Z

This does reduce the readability of the code a lot. :/

If this is actually proven faster, there should probably be a comment in the code explaining what this is doing.

gaaclarke · 2022-05-18T22:30:59Z

I've tweaked the rebuild benchmark to avoid scheduling frames and rebuild time goes from 5336.5437 us to 5220.361 us, 2% improvement. I'll make a separate CL for the change to the benchmark.

bernaferrari · 2022-05-18T22:38:59Z

This does reduce the readability of the code a lot. :/

If there were a Wikipedia URL of what you are using (probably it is not the first time someone has ever used ^ for sorting), maybe could be more useful.

gaaclarke · 2022-05-18T22:42:46Z

This does reduce the readability of the code a lot. :/

If this is actually proven faster, there should probably be a comment in the code explaining what this is doing.

I added comments. I think that clears it up. It isn't really a trick, it's just that people don't xor often.

jonahwilliams · 2022-05-19T17:08:47Z

I don't really think this sort of change is worth landing. I think its OK if we were speeding up something that is slow, or if we didn't speed something up but made it easier to read/simpler. But in this case we're rewriting code that isn't really on any critical path and that doesn't make it any clearer. What is the goal here?

packages/flutter/lib/src/widgets/framework.dart

gaaclarke · 2022-05-19T18:26:23Z

Talked to @jonahwilliams offline, he's fine with the change as long as @goderbauer is happy and the comments make up for any deficiencies in clarity of the code.

bernaferrari · 2022-05-19T18:33:11Z

packages/flutter/lib/src/widgets/framework.dart

+    // If the `dirty` values are not equal, sort with non-dirty elements being
+    // less than dirty elements.
+    final bool isBDirty = b.dirty;
+    if (a.dirty ^ isBDirty) {
+      return isBDirty ? -1 : 1;
+    }


sort with non-dirty elements being less than dirty elements.

I can kiiiiind of understand what you mean, but it still kind of hard lol. The sort with non-dirty elements being less than dirty elements then looking at isBDirty and the xor.

BTW, is isBDirty really needed?

b.isDirty is a method call so the compiler can't optimized away the second call because it can't know if dirty always returns the same value.

gaaclarke · 2022-05-20T16:08:12Z

I switched the xor for not equals, they are functionally equivalent and I think removes a lot of the confusion in the code.

goderbauer · 2022-05-20T17:41:30Z

I have the same questions that Jonah raised in #104103 (comment). Have those been answered somewhere?

gaaclarke · 2022-05-20T23:28:25Z

I have the same questions that Jonah raised in #104103 (comment). Have those been answered somewhere?

@goderbauer Yes, we had a discussion offline where he agreed it is worth merging as long as your concerns about making it clear are addressed (mentioned here: #104103 (comment)). I hate to talk to him, feel free to correct me if you want @jonahwilliams. I imagine those concerns are even less now that I've removed xor. It doesn't affect clarity at all.

This function is called thousands of times (n log n, where n is the number of Elements) when we do hot reload. The work was done to make it faster. It would be a shame to throw out the savings that can be demonstrated by looking at the generated code, counting operations, or running benchmarks, just because it doesn't look how we'd first think about it.

jonahwilliams · 2022-05-20T23:56:38Z

I have a very high tolerance for "unreadable code" so I don't feel comfortable making the call on whether or not this is worth it. I agree with Aaron that lots of small changes do add up, but of course there is always the risk we make something worse too 🤣

goderbauer · 2022-05-21T00:07:03Z

My readability concerns are mostly addressed.

This function is called thousands of times (n log n, where n is the number of Elements) when we do hot reload. The work was done to make it faster. It would be a shame to throw out the savings that can be demonstrated by looking at the generated code, counting operations, or running benchmarks, just because it doesn't look how we'd first think about it.

So, there are benchmarks showing that this is faster? What are the numbers?

gaaclarke · 2022-05-23T16:57:43Z

So, there are benchmarks showing that this is faster? What are the numbers?

@goderbauer with the following microbenchmark it is 27% faster on arm64 ios profile builds. This is the minimum benefit since Element uses polymorphism and this is the fastest possible implementation, just a straight field. I didn't think checking in a microbenchmark in the case was worth the maintenance since it's clearly faster (to my eyes).

// Copyright 2014 The Flutter Authors. All rights reserved.
// Use of this source code is governed by a BSD-style license that can be
// found in the LICENSE file.

import 'dart:developer';

import 'package:flutter/material.dart';

import '../common.dart';

const int _kNumIterations = 100000;

class _Widget extends Widget {
  @override
  Element createElement() {
    throw UnimplementedError();
  }
}

class _Element extends Element {
  _Element(this.depth, this.dirty) : super(_Widget());

  @override
  final int depth;

  @override
  final bool dirty;

  @override
  bool get debugDoingBuild => throw UnimplementedError();

  @override
  void performRebuild() {}
}

void main() {
  assert(false,
      "Don't run benchmarks in debug mode! Use 'flutter run --release'.");
  final BenchmarkResultPrinter printer = BenchmarkResultPrinter();

  final Element a = _Element(0, false);
  final Element b = _Element(1, false);
  final Element c = _Element(0, true);
  final Element d = _Element(1, true);

  final Stopwatch watch = Stopwatch();
  int tally = 0;
  watch.start();
  for (int i = 0; i < _kNumIterations; i += 1) {
    tally += Element.sort(a, a);
    tally += Element.sort(a, b);
    tally += Element.sort(a, c);
    tally += Element.sort(a, d);
    tally += Element.sort(b, c);
    tally += Element.sort(b, d);
    tally += Element.sort(c, d);
  }
  watch.stop();
  if (tally < 0) {
    print("this shouldn't happen.");
  }

  printer.addResult(
    description: 'Element.sort',
    value: watch.elapsedMicroseconds.toDouble() / _kNumIterations,
    unit: 'us per iteration',
    name: 'element_sort',
  );

  printer.printToStdout();
}

goderbauer

LGTM

Thanks for getting some numbers for this!

Sped up Element._sort

1148030

flutter-dashboard bot added the framework flutter/packages/flutter repository. See also f: labels. label May 18, 2022

gaaclarke marked this pull request as ready for review May 18, 2022 17:53

gaaclarke requested review from dnfield and jonahwilliams May 18, 2022 17:53

jonahwilliams reviewed May 18, 2022

View reviewed changes

Added comments.

e1ce399

gaaclarke requested a review from jonahwilliams May 19, 2022 17:02

jonahwilliams reviewed May 19, 2022

View reviewed changes

packages/flutter/lib/src/widgets/framework.dart Outdated Show resolved Hide resolved

updated comments

e0ed1a3

bernaferrari reviewed May 19, 2022

View reviewed changes

switched xor for !=

142da24

gaaclarke requested a review from goderbauer May 20, 2022 16:08

goderbauer approved these changes May 23, 2022

View reviewed changes

gaaclarke added the waiting for tree to go green label May 23, 2022

fluttergithubbot merged commit 41c6063 into flutter:master May 23, 2022

engine-flutter-autoroll mentioned this pull request May 23, 2022

Roll Flutter from ec20ea80ad98 to 41c606335646 (1 revision) flutter/plugins#5809

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/plugins that referenced this pull request May 23, 2022

41c6063 Sped up Element._sort (flutter/flutter#104103)

b3f2b6d

engine-flutter-autoroll mentioned this pull request May 23, 2022

Roll Flutter from ec20ea80ad98 to 41c606335646 (1 revision) flutter/packages#2041

Merged

engine-flutter-autoroll added a commit to engine-flutter-autoroll/packages that referenced this pull request May 23, 2022

41c6063 Sped up Element._sort (flutter/flutter#104103)

1914604

engine-flutter-autoroll mentioned this pull request May 23, 2022

Roll Flutter from ec20ea80ad98 to 0015ed2b7541 (2 revisions) flutter/plugins#5810

Closed

engine-flutter-autoroll added a commit to engine-flutter-autoroll/plugins that referenced this pull request May 23, 2022

41c6063 Sped up Element._sort (flutter/flutter#104103)

b44112b

engine-flutter-autoroll mentioned this pull request May 23, 2022

Roll Flutter from ec20ea80ad98 to 7ece8f9f9435 (3 revisions) flutter/plugins#5813

Merged

engine-flutter-autoroll added a commit to engine-flutter-autoroll/plugins that referenced this pull request May 23, 2022

41c6063 Sped up Element._sort (flutter/flutter#104103)

be8ee4f

camsim99 pushed a commit to camsim99/flutter that referenced this pull request Aug 10, 2022

Sped up Element._sort (flutter#104103)

252b940

engine-flutter-autoroll added a commit to engine-flutter-autoroll/packages that referenced this pull request Aug 30, 2022

41c6063 Sped up Element._sort (flutter/flutter#104103)

cabaf65

engine-flutter-autoroll added a commit to engine-flutter-autoroll/plugins that referenced this pull request Aug 30, 2022

41c6063 Sped up Element._sort (flutter/flutter#104103)

53f06c7

Sped up Element._sort #104103

Sped up Element._sort #104103

Uh oh!

Conversation

gaaclarke commented May 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pre-launch Checklist

Uh oh!

flutter-dashboard bot commented May 18, 2022

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

jonahwilliams left a comment

Choose a reason for hiding this comment

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

jonahwilliams commented May 18, 2022

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

jonahwilliams commented May 18, 2022

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

goderbauer commented May 18, 2022

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

bernaferrari commented May 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gaaclarke commented May 18, 2022

Uh oh!

jonahwilliams commented May 19, 2022

Uh oh!

Uh oh!

gaaclarke commented May 19, 2022

Uh oh!

bernaferrari May 19, 2022

Choose a reason for hiding this comment

Uh oh!

gaaclarke May 19, 2022

Choose a reason for hiding this comment

Uh oh!

gaaclarke commented May 20, 2022

Uh oh!

goderbauer commented May 20, 2022

Uh oh!

gaaclarke commented May 20, 2022

Uh oh!

jonahwilliams commented May 20, 2022

Uh oh!

goderbauer commented May 21, 2022

Uh oh!

gaaclarke commented May 23, 2022

Uh oh!

goderbauer left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gaaclarke commented May 18, 2022 •

edited

Loading

bernaferrari commented May 18, 2022 •

edited

Loading