Improve UI-thread animation performance on busy screens (~10x reduction depending on your use-case) #138481

ignatz · 2023-11-15T14:36:14Z

This change can be a significant reduction in UI thread load for animations when a controller drives many animations that will come in and go out of existance. 100x are easily possible depending on how many things are popping on and off the screen (e.g. hundreds).

WARNING: This is a somewhat breaking change (as shown by the test-change), due to listeners only being notified once (in first-insertion order) rather than once every registration. I'd be more than happy to make the behavior configurable on the AnimationController, so that folks could opt into the new behavior or construct an AnimationController depending on what their use-case demands.
Tangentially, I'd also be interested in understanding the motivation for the implemented behavior a bit better. Given that listeners can be mutating, preserving the order seems quite intuitive. Whereas multiple registrations may make sense, I just can't think of a good use-case. I can easily think of unintentional gotchas like advancing an animation multiple times per step/frame. (EDIT: it's also surprising that despite its ordering, it's not a stack, i.e. you push new listeners to the end but you remove them from the front,which makes working with multiple registrations conceptually probably hard to work with 🤷‍♀️ )

CONTEXT: I'm using a library that draws markers onto a map and then depending on the zoom-level it will cluster (collapse/expand) these markers. This results often in tens or hundreds of markers being animated (position & opacity) with a single AnimationController. After this animation no-longer visible markers will be taken off screen resulting in large numbers of listeners being removed. With the implementation at HEAD, my App is spending a majority of its UI-thread time (tens of milliseconds per frame) just deleting listeners. I'd argue that a generic tool like AnimationController should not micro-optimize for the case N=1.

I also did a performance comparison based on N to support the proposal. I tried to model realistic use cases, i.e. add/remove, notifiyListeners() w/o any _set invalidation, with invalidation, and a cycle of notifylisteners/rm/add. The benchmark code can be found here: https://gist.github.com/ignatz/dba6293d0fef41e3c6e1ed3c51364de6 .

2 add/remove                       SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:       -0%   (0:00:01.373926 vs 0:00:01.376916)
2 add/remove                       SAME HashedObserverList vs MyHashedObserverList     takes:      1.0x, saved:       -0%   (0:00:02.170364 vs 0:00:02.177635)
2 add/remove                     SLOWER ObserverList vs MyHashedObserverList           takes:      0.6x, saved:      -64%   (0:00:01.375987 vs 0:00:02.254769)

2 notifyListeners                  SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:        1%   (0:00:01.183753 vs 0:00:01.167889)
2 notifyListeners                FASTER HashedObserverList vs MyHashedObserverList     takes:      1.6x, saved:       38%   (0:00:01.703029 vs 0:00:01.054944)
2 notifyListeners                FASTER ObserverList vs MyHashedObserverList           takes:      1.1x, saved:       10%   (0:00:01.173942 vs 0:00:01.061108)

2 notifyListeners self-mod         SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:        1%   (0:00:00.413579 vs 0:00:00.409012)
2 notifyListeners self-mod       FASTER HashedObserverList vs MyHashedObserverList     takes:      1.2x, saved:       16%   (0:00:00.861495 vs 0:00:00.725754)
2 notifyListeners self-mod       SLOWER ObserverList vs MyHashedObserverList           takes:      0.6x, saved:      -79%   (0:00:00.410618 vs 0:00:00.734759)

2 mix nl/rm/add                  SLOWER ObserverList vs MyObserverList                 takes:      1.0x, saved:       -4%   (0:00:00.382266 vs 0:00:00.398285)
2 mix nl/rm/add                  FASTER HashedObserverList vs MyHashedObserverList     takes:      1.3x, saved:       25%   (0:00:00.696625 vs 0:00:00.522181)
2 mix nl/rm/add                  SLOWER ObserverList vs MyHashedObserverList           takes:      0.8x, saved:      -25%   (0:00:00.363710 vs 0:00:00.454669)


4 add/remove                       SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:       -1%   (0:00:03.210935 vs 0:00:03.240136)
4 add/remove                       SAME HashedObserverList vs MyHashedObserverList     takes:      1.0x, saved:       -2%   (0:00:04.000753 vs 0:00:04.088699)
4 add/remove                     SLOWER ObserverList vs MyHashedObserverList           takes:      0.9x, saved:      -18%   (0:00:03.457464 vs 0:00:04.062586)

4 notifyListeners                SLOWER ObserverList vs MyObserverList                 takes:      0.8x, saved:      -22%   (0:00:01.857049 vs 0:00:02.266333)
4 notifyListeners                FASTER HashedObserverList vs MyHashedObserverList     takes:      1.3x, saved:       21%   (0:00:03.031229 vs 0:00:02.395746)
4 notifyListeners                FASTER ObserverList vs MyHashedObserverList           takes:      1.2x, saved:       14%   (0:00:03.723434 vs 0:00:03.213373)

4 notifyListeners self-mod       SLOWER ObserverList vs MyObserverList                 takes:      0.8x, saved:      -28%   (0:00:03.809080 vs 0:00:04.857361)
4 notifyListeners self-mod       FASTER HashedObserverList vs MyHashedObserverList     takes:      1.3x, saved:       24%   (0:00:03.694135 vs 0:00:02.797687)
4 notifyListeners self-mod       FASTER ObserverList vs MyHashedObserverList           takes:      1.3x, saved:       23%   (0:00:03.691153 vs 0:00:02.850335)

4 mix nl/rm/add                  SLOWER ObserverList vs MyObserverList                 takes:      0.9x, saved:       -6%   (0:00:01.665852 vs 0:00:01.759793)
4 mix nl/rm/add                    SAME HashedObserverList vs MyHashedObserverList     takes:      1.0x, saved:        1%   (0:00:01.430239 vs 0:00:01.422435)
4 mix nl/rm/add                  FASTER ObserverList vs MyHashedObserverList           takes:      2.6x, saved:       62%   (0:00:03.076555 vs 0:00:01.176919)


7 add/remove                     FASTER ObserverList vs MyObserverList                 takes:      1.2x, saved:       16%   (0:00:08.613052 vs 0:00:07.209422)
7 add/remove                       SAME HashedObserverList vs MyHashedObserverList     takes:      1.0x, saved:        1%   (0:00:08.334085 vs 0:00:08.291406)
7 add/remove                       SAME ObserverList vs MyHashedObserverList           takes:      1.0x, saved:       -1%   (0:00:07.939038 vs 0:00:07.982456)

7 notifyListeners                FASTER ObserverList vs MyObserverList                 takes:      1.1x, saved:       11%   (0:00:03.537551 vs 0:00:03.162703)
7 notifyListeners                FASTER HashedObserverList vs MyHashedObserverList     takes:      1.2x, saved:       19%   (0:00:04.266710 vs 0:00:03.441823)
7 notifyListeners                  SAME ObserverList vs MyHashedObserverList           takes:      1.0x, saved:        2%   (0:00:03.562112 vs 0:00:03.489668)

7 notifyListeners self-mod       FASTER ObserverList vs MyObserverList                 takes:      1.3x, saved:       20%   (0:00:07.683331 vs 0:00:06.141342)
7 notifyListeners self-mod       FASTER HashedObserverList vs MyHashedObserverList     takes:      1.1x, saved:        6%   (0:00:02.777192 vs 0:00:02.607791)
7 notifyListeners self-mod       FASTER ObserverList vs MyHashedObserverList           takes:      2.3x, saved:       57%   (0:00:06.138749 vs 0:00:02.623466)

7 mix nl/rm/add                  SLOWER ObserverList vs MyObserverList                 takes:      1.0x, saved:       -4%   (0:00:01.527072 vs 0:00:01.591417)
7 mix nl/rm/add                  FASTER HashedObserverList vs MyHashedObserverList     takes:      1.1x, saved:       10%   (0:00:01.424173 vs 0:00:01.274727)
7 mix nl/rm/add                  FASTER ObserverList vs MyHashedObserverList           takes:      1.8x, saved:       43%   (0:00:01.718774 vs 0:00:00.972161)


10 add/remove                      SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:        1%   (0:00:06.705254 vs 0:00:06.652850)
10 add/remove                    SLOWER HashedObserverList vs MyHashedObserverList     takes:      0.9x, saved:       -8%   (0:00:06.127039 vs 0:00:06.634606)
10 add/remove                    FASTER ObserverList vs MyHashedObserverList           takes:      1.1x, saved:        6%   (0:00:06.621233 vs 0:00:06.242699)

10 notifyListeners               SLOWER ObserverList vs MyObserverList                 takes:      0.8x, saved:      -21%   (0:00:02.542935 vs 0:00:03.065810)
10 notifyListeners               FASTER HashedObserverList vs MyHashedObserverList     takes:      1.2x, saved:       20%   (0:00:03.038072 vs 0:00:02.441563)
10 notifyListeners               FASTER ObserverList vs MyHashedObserverList           takes:      1.1x, saved:       10%   (0:00:02.708075 vs 0:00:02.429304)

10 notifyListeners self-mod      SLOWER ObserverList vs MyObserverList                 takes:      0.6x, saved:      -63%   (0:00:04.636833 vs 0:00:07.569618)
10 notifyListeners self-mod      SLOWER HashedObserverList vs MyHashedObserverList     takes:      0.7x, saved:      -44%   (0:00:01.926254 vs 0:00:02.768144)
10 notifyListeners self-mod      FASTER ObserverList vs MyHashedObserverList           takes:      3.9x, saved:       75%   (0:00:19.460013 vs 0:00:04.935028)

10 mix nl/rm/add                   SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:       -3%   (0:00:03.122031 vs 0:00:03.207405)
10 mix nl/rm/add                 FASTER HashedObserverList vs MyHashedObserverList     takes:      1.1x, saved:       12%   (0:00:02.104239 vs 0:00:01.853132)
10 mix nl/rm/add                 FASTER ObserverList vs MyHashedObserverList           takes:      1.7x, saved:       43%   (0:00:03.167834 vs 0:00:01.817996)


100 add/remove                   FASTER ObserverList vs MyObserverList                 takes:      2.1x, saved:       53%   (0:00:29.435696 vs 0:00:13.740486)
100 add/remove                   SLOWER HashedObserverList vs MyHashedObserverList     takes:      1.0x, saved:       -4%   (0:00:01.810957 vs 0:00:01.874549)
100 add/remove                   FASTER ObserverList vs MyHashedObserverList           takes:      7.0x, saved:       86%   (0:00:13.562779 vs 0:00:01.932154)

100 notifyListeners              FASTER ObserverList vs MyObserverList                 takes:      1.1x, saved:       11%   (0:00:00.824788 vs 0:00:00.735041)
100 notifyListeners              FASTER HashedObserverList vs MyHashedObserverList     takes:      1.1x, saved:       10%   (0:00:00.780762 vs 0:00:00.701437)
100 notifyListeners              FASTER ObserverList vs MyHashedObserverList           takes:      1.1x, saved:        8%   (0:00:00.797545 vs 0:00:00.732023)

100 notifyListeners self-mod     FASTER ObserverList vs MyObserverList                 takes:      1.1x, saved:       11%   (0:00:24.091372 vs 0:00:21.355631)
100 notifyListeners self-mod     FASTER HashedObserverList vs MyHashedObserverList     takes:      1.0x, saved:        4%   (0:00:00.560706 vs 0:00:00.536952)
100 notifyListeners self-mod     FASTER ObserverList vs MyHashedObserverList           takes:     42.2x, saved:       98%   (0:00:22.614892 vs 0:00:00.536188)

100 mix nl/rm/add                  SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:       -1%   (0:00:00.399696 vs 0:00:00.403067)
100 mix nl/rm/add                FASTER HashedObserverList vs MyHashedObserverList     takes:      1.1x, saved:       12%   (0:00:00.168415 vs 0:00:00.148730)
100 mix nl/rm/add                FASTER ObserverList vs MyHashedObserverList           takes:      2.7x, saved:       63%   (0:00:00.395535 vs 0:00:00.145706)


1000 add/remove                  SLOWER ObserverList vs MyObserverList                 takes:      1.0x, saved:       -3%   (0:00:43.742194 vs 0:00:45.164800)
1000 add/remove                    SAME HashedObserverList vs MyHashedObserverList     takes:      1.0x, saved:       -0%   (0:00:00.596296 vs 0:00:00.596309)
1000 add/remove                  FASTER ObserverList vs MyHashedObserverList           takes:     74.4x, saved:       99%   (0:00:44.349347 vs 0:00:00.595768)

1000 notifyListeners               SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:       -3%   (0:00:00.281672 vs 0:00:00.289646)
1000 notifyListeners             FASTER HashedObserverList vs MyHashedObserverList     takes:      1.1x, saved:        7%   (0:00:00.251050 vs 0:00:00.234575)
1000 notifyListeners             FASTER ObserverList vs MyHashedObserverList           takes:      1.2x, saved:       17%   (0:00:00.283716 vs 0:00:00.235989)

1000 notifyListeners self-mod    SLOWER ObserverList vs MyObserverList                 takes:      1.0x, saved:       -3%   (0:01:03.477013 vs 0:01:05.411053)
1000 notifyListeners self-mod    SLOWER HashedObserverList vs MyHashedObserverList     takes:      0.6x, saved:      -78%   (0:00:00.188581 vs 0:00:00.336313)
1000 notifyListeners self-mod    FASTER ObserverList vs MyHashedObserverList           takes:    348.7x, saved:      100%   (0:01:05.049408 vs 0:00:00.186552)

1000 mix nl/rm/add                 SAME ObserverList vs MyObserverList                 takes:      1.0x, saved:       -1%   (0:00:00.123481 vs 0:00:00.124426)
1000 mix nl/rm/add               FASTER HashedObserverList vs MyHashedObserverList     takes:      1.1x, saved:        5%   (0:00:00.050968 vs 0:00:00.048402)
1000 mix nl/rm/add               FASTER ObserverList vs MyHashedObserverList           takes:      2.5x, saved:       60%   (0:00:00.121606 vs 0:00:00.048307)

Pre-launch Checklist

I read the Contributor Guide and followed the process outlined there for submitting PRs.
I read the Tree Hygiene wiki page, which explains my responsibilities.
I read and followed the Flutter Style Guide, including Features we expect every widget to implement.
I signed the CLA.
I listed at least one issue that this PR fixes in the description above.
I updated/added relevant documentation (doc comments with ///).
I added new tests to check the change I am making, or this PR is test-exempt.
All existing and new tests are passing.

If you need help, consider asking for advice on the #hackers-new channel on Discord.

ignatz · 2023-11-15T14:48:24Z

A flutter-tools profile screenshot from 3.13.9:

bernaferrari · 2023-11-15T22:21:08Z

@gaaclarke really interesting PR, somehow seems related to some small changes you do that radically improve things.

goderbauer · 2023-11-21T00:39:10Z

WARNING: This is a somewhat breaking change (as shown by the test-change)

I am a little careful of changing the behavior here. This is used in lots and lots of places. It looks like the change is not failing any customer tests, which is a good start. I'll see if I can run google's internal tests on this to see if we can get a better understanding of how breaking this would be.

I also did a performance comparison based on N to support the proposal

I scrolled through the data and I can't see a clear pattern. It seems to be a wash, sometimes the new implementation is faster, sometimes its slower. I am actually not sure whether this is a clear win?

ignatz · 2023-11-21T11:43:32Z

WARNING: This is a somewhat breaking change (as shown by the test-change)

I am a little careful of changing the behavior here. This is used in lots and lots of places. It looks like the change is not failing any customer tests, which is a good start. I'll see if I can run google's internal tests on this to see if we can get a better understanding of how breaking this would be.

Appreciated. I agree that we need to respect Hyrum's law here.

That said, the implemented semantics seem rather ad-hoc. For multiple registrations it is virtually impossible to emply the semantics correctly especially when listeners mutate the list of listeners. I'm also not convinced that the updated semantics are great either. I'd say they're about as bad.

Personally, I'd argue that multiple registrations should be rejected (at least for the animations use-case). For a listener its trivial to run code N times w/o having to register itself N-times. However, this discussion probably goes beyond the scope of this PR.

I also did a performance comparison based on N to support the proposal

I scrolled through the data and I can't see a clear pattern. It seems to be a wash, sometimes the new implementation is faster, sometimes its slower. I am actually not sure whether this is a clear win?

Yes and no but really yes :). It's a bit hard to see given the various dimensions and quite a bit of trial-by-trial variation (I'm going to blame GC and my notebook's thermals :) ). Note that when the new cases are flagged as slower, it's only a very small percentage. In most cases it's just the same +/- noise, I merely picked a conservative threshold for labeling as SLOWER vs SAME. In these cases looking at the absolute runtime can also provide a sense of how noisy the measure may be. I did share the benchmark code, so you can simply run it yourself (the script has zero deps).

Let me try to break it down:

There are 3 comparisons:

ObserverList before and after
HasedObserverList before and after
Improved HasedObserverList against original ObserverList (as a before and after of the second commit)

Then we have two more dimensions:

A. N number of elements. It's worth pointing out that ObserverList as implemented behaves entirely differently for N <3 and N >= 3 elements (Otherwise I would have reduced the number of data points)
B. different test cases.

1. & 2.

The changes covered by 1. and 2. are strict improvements. They don't change any semantics and thus I pulled them out into a separate commit. They should be fairly straightforward. the overriden toList() is a noticeable improvement in the general case over falling back to the Iterator.toList's default implementation. In practice, this reduces the number of memory allocations.

Note that toList() is used within notifyListeners() to make a copy of he list (to account for listeners which may mutate the list of listeners), so you'll hit the code path on every frame independent of your access patterns.

The change to ObserverList.remove is very subtle and only makes a difference if you try to delete non-registered listeners. My benchmarks don't even test for that and it's highly arguable how much of a real-life impact this would have (it certainly would if you constantly try to remove non-registered listeners). Yet, it just makes sense to not unnecessarily invalidate the internal hash set when not needed.

I pulled the "toList()" case out into a separate benchmark to make this clearer:

10 toList()                      FASTER HashedObserverList vs MyHashedObserverList     takes:      2.0x, saved:       50%   (0:00:20.598112 vs 0:00:10.332040)
1000 toList()                    FASTER HashedObserverList vs MyHashedObserverList     takes:      1.4x, saved:       28%   (0:00:01.271092 vs 0:00:00.912649)

(With N=1000 probably a lot more noisy than N=10, due to it's significantly shorter runtime. I only wanted to spend so much time tweaking the iteration numbers for every combo)

3.

"3." covers the changes within the second commit, i.e. moving from ObserverList ot HashedObserver list for animations. This is where the real meat is, i.e. 100+x improvements depending on the number of listeners. At the end of the day it's just array vs hash map. No magic involved.

My assumption is that whoever did the current implementation assumed that AnimationControllers will only drive a small, static set of animations. In these cases, sure the array is slighly faster. In the general case: "a changing, not necessarily small set of Animations", the array is atrocious.

You can see that the ObserverList implementation tries to compensate for it managing a separate, potentially stale, hash set.... Unfortunately, it doesn't help for:

Removals, you still need to traverse the entire list and move elements
For mutating listeners it kills the performance on every frame. When the hash set gets invalidated and notifyListeners may rebuild the hash set many times per frame. It's horrendous.

(Outside the scope of this change but the attempted band aid on ObserverList is dangerous. For certain access patterns it may help. For others it completely destroys performance. It's important to note that the access pattern for AnimationController and many other cases is entirely user-defined and the punishment is completely opaque to users. It would be a lot faster, but more importantly safer, to switch to HashedObserList for any case where the number of elements and access patterns are user-controlled).

That's where you get numbers like:

1000 add/remove                  FASTER ObserverList vs MyHashedObserverList           takes:     74.3x, saved:       99%   (0:00:45.577018 vs 0:00:00.613640)

1000 notifyListeners self-mod    FASTER ObserverList vs MyHashedObserverList           takes:    357.4x, saved:      100%   (0:01:06.012280 vs 0:00:00.184690)

Generally, I don't think we should get too hung up on the performance differences for small N that are well within the margin of error. At the end of the day, we'd be discussing 10us vs 10.5us on an already cheap operation, when in reality we're seeing 10us vs 3ms in the N=1000 case.

As a personal data point, my specific use-case that surfaced this issue has hundreds of animations popping on and off screen. These gains are not theoretical for me. At this point, I'm actually shipping a forked version of flutter incorporating these changes, which significantly reduces frame-drops for me

flutter-dashboard · 2023-12-11T13:56:25Z

This pull request executed golden file tests, but it has not been updated in a while (20+ days). Test results from Gold expire after as many days, so this pull request will need to be updated with a fresh commit in order to get results from Gold.

For more guidance, visit Writing a golden file test for package:flutter.

Reviewers: Read the Tree Hygiene page and make sure this patch meets those guidelines before LGTMing.

goderbauer · 2023-12-12T00:22:23Z

Regarding 1 (Changes to ObservableList.remove): In practice, this sounds like regression to me. I would expect that the case where you remove a non-existent listener is very, very rare (I would almost consider it a bug in the code if somebody does that). So, optimizing for that is odd to me - since you will pay for the "optimization" in the far more common case with an additional check.

Regarding 2 (Optimizing HashedObserverList.toList): This does seem like a nice improvement and the benchmarks appear to back that up. We could probably land just this change without much trouble.

Regarding 3 (Changing AnimationLocalListenersMixin): This is actually less risky than I initially thought since it only affects this one mixin. For some reason I initially thought this would effect all ChangeNotifiers. We still need to run this change through our google testing suite to see if this would be a breaking change.

I was finally able to kick off a test run within google, let's see what it reports back. I'll keep you posted.

goderbauer · 2023-12-12T00:23:36Z

Also, can you please resolve the issues reported by the analyze check? Thanks!

ignatz · 2023-12-12T09:01:02Z

Regarding 1 (Changes to ObservableList.remove): In practice, this sounds like regression to me. I would expect that the case where you remove a non-existent listener is very, very rare (I would almost consider it a bug in the code if somebody does that). So, optimizing for that is odd to me - since you will pay for the "optimization" in the far more common case with an additional check.

Maybe all use-cases are buggy (in which case, we should probably surface it, e.g. assert or at least log), maybe there are some valid edge cases 🤷‍♀️ . I'm not sure that ignoring it is the best way to deal with it in either case.

Meanwhile, it's the typical insurance-risk-severity evaluation: the check is basically free and the cost independent of the size, whereas invalidating the internal optimization is non-obvious to users and very costly especially for larger sets. In terms of cost, given any modern CPU with speculative execution and accurate branch prediction, I would expect this check to take <1ns :handwave:.

Personally, I can very much live w/o this "optimization", I'm merely surprised by your cost argument in the light of the benchmark results. I could certainly buy in your bug argument, however one should probably look into how to surface such bugs (independent of this change). As I was rambling earlier, I think there are other semantics that might be unintentional like FIFO (rather than LIFO) removals of duplicate listeners. Maybe duplicate listeners in general should be considered a bug? 🤷‍♀️ There would be nice benefits to it too, e.g. one could use cheaper data structure that don't need to keep track of insertion order.

Regarding 2 (Optimizing HashedObserverList.toList): This does seem like a nice improvement and the benchmarks appear to back that up. We could probably land just this change without much trouble.

Agreed, it shouldn't cause any surprises.

Regarding 3 (Changing AnimationLocalListenersMixin): This is actually less risky than I initially thought since it only affects this one mixin. For some reason I initially thought this would effect all ChangeNotifiers. We still need to run this change through our google testing suite to see if this would be a breaking change.

Thanks! Independently, it could be interesting to evaluate if other constructs would also benefit from other data structures but ultimately it will always depend on your use case.

Also, can you please resolve the issues reported by the analyze check? Thanks!

Once I get word from you on how we will or won't proceed, I'll urgently address any issues.

Thanks!

goderbauer · 2023-12-13T18:53:50Z

This is not causing any issues internally at google. I think we should be able to land this if the reported analyzer issues get resolved.

bernaferrari · 2023-12-13T19:05:43Z

packages/flutter/lib/src/foundation/observer_list.dart

you forgot the type here, that's the only remaining thing

Thanks. Done

ignatz · 2023-12-13T21:17:30Z

This is not causing any issues internally at google. I think we should be able to land this if the reported analyzer issues get resolved.

Happy to hear. Thanks!

goderbauer · 2023-12-19T23:00:38Z

This PR is still failing the analyzer check. Can you please take a look? Thanks.

bernaferrari · 2023-12-19T23:02:41Z

/b/s/w/ir/x/w/flutter/packages/flutter/lib/src/animation/listener_helpers.dart:94: trailing U+0020 space character
you have a space (" ") at somewhere where it shouldn't be.

ignatz · 2023-12-20T10:55:04Z

Thanks. Analyzer seems to pass now and I rebased against latest main

goderbauer

LGTM

LongCatIsLooong

LGTM modulo nits. In case in the future someone else wants to make a different trade off based on their own use cases, could you add a benchmark for this? It also would be nice to see the performance improvements this PR introduces.

packages/flutter/lib/src/foundation/observer_list.dart

ignatz · 2024-01-24T10:24:46Z

could you add a benchmark for this? It also would be nice to see the performance improvements this PR introduces.

I like the idea. Even more so for continuous performance monitoring. It's probably less likely that someone would stumble upon a benchmark and than decides to just implement their own AnimationListener.

I couldn't find any other micro-benchmarks in this repo. That's also why I hadn't added mine. I did share them above: https://gist.github.com/ignatz/dba6293d0fef41e3c6e1ed3c51364de6.

I had hoped that this change would be validated by existing e2e benchmarks but if you feel adding micro-benchmarks makes sense we can talk about it. That said, I understand that you folks are busy, and therefore I'd like to avoid expanding this simple change further. It has already been more than an OKR cycle.

LongCatIsLooong · 2024-01-24T19:48:25Z

If you're looking for microbenchmarks: https://github.com/flutter/flutter/tree/master/dev/benchmarks/microbenchmarks

ignatz · 2024-01-25T11:19:24Z

Added a microbenchmark following the example of dev/benchmarks/microbenchmarks/lib/foundation/change_notifier_bench.dart.

Could you also provide me a link to a dashboard where I can see/monitor the continuous evolution of them microbenchmarks? A Google search didn't bring anything up and I'm wondering how this setup is being put to use.

ignatz · 2024-02-07T00:38:06Z

I've spent countless hours complying with any remotely reasonable request. I've provided extensive benchmarks from the get go. We all know and acknowledge that his is a small change with significant performance impact.

Best practice is to pull the benchmark (and its registration) out into a separate PR, submit that first, and then a day later submit this change to make sure the benchmark works properly and this change has the desired effect.

This sounds more like an opinion than a best practice. I'm always down for a good argument, just not big on process for the sake of process. AFAICT, there's nothing to be gained from this proposal unless you believe the CI server must execute the benchmarks to be trusted.

If you think I should spend a few more weeks on back and forts with moving goal posts: fine. I you don't have the bandwidth to spend 1minute on a cherry-pick: fine. If you don't want to improve Flutter's performance: fine. If you're ok with the opportunity cost for your users: fine. If you're comfortable with disrespecting your contributors' time: fine.

If you change your mind: also fine. Otherwise thank you for you time and good luck!

And just for the record

Sadly, we currently don't have resources to take over this PR. If you could split this into two, we can land it, though.

could be misconstrued as gaslighting.

goderbauer · 2024-02-07T00:52:16Z

I am sorry if you misinterpreted my statement. That wasn't my intention. And there is no disrespecting anyones time. Getting any change landed sometimes requires a couple of feedback cycles and back-and-forth discussions before they are ready to land. That's just part of open-source team work.

We do appreciate your contribution, but we also expect contributors to get changes over the line and not just "throw them over the fence" to have someone else finish them when they are done with the fun bits.

I still hope that we can land this with your help.

ignatz · 2024-02-07T01:44:00Z

I am sorry if you misinterpreted my statement. That wasn't my intention. And there is no disrespecting anyones time. Getting any change landed sometimes requires a couple of feedback cycles and back-and-forth discussions before they are ready to land. That's just part of open-source team work.

We do appreciate your contribution, but we also expect contributors to get changes over the line and not just "throw them over the fence" to have someone else finish them when they are done with the fun bits.

I'm not sure how to read this. Are you talking generally? Are you talking about the same PR? Maybe some context faded over time? Are you insinuating?

Personally, I didn't think that addressing every request, answering everybquestion, writing and running benchmarks for hours were the fun bits. I'm sorry you feel like I'm just throwing code over the fence.

In any case thanks for explaining team work to me. I wish you had given me a rational instead 🤷‍♀️

I still hope that we can land this with your help.

I'm still not sure what stands to be gained from the extra work but it sure seems important to you.

Good luck and thank you for your time

bernaferrari · 2024-02-15T20:33:58Z

Hey @ignatz, if you allow me, I have pushed the benchmarks to a different PR so your contribution can be landed as part 1 (the benchmark) and part 2 (the improvements).

ignatz · 2024-02-15T20:58:18Z

Hey @bernaferrari. Thanks for jumping in, much appreciated.

@ignatz

Part 1 of #138481 Cherry picking @ignatz work.

bernaferrari · 2024-02-20T23:25:18Z

I don't know how to remove the benchmark commit and merge this, could you do that? @ignatz else I can reopen the PR without it...

@ignatz

Reverts #143542 Initiated by: goderbauer Reason for reverting: Failing post-submit, see https://logs.chromium.org/logs/flutter/buildbucket/cr-buildbucket/8755531866181499153/+/u/run_microbenchmarks/stdout Original PR Author: bernaferrari Reviewed By: {goderbauer} This change reverts the following previous change: Original Description: Part 1 of #138481 Cherry picking @ignatz work.

…r small N, by overriding `toList()`, which is critical for `.notifyListeners()`. This change is meant to shift the scales. Previously, `ObserverList` was chosen for generic implementations such as `AnimationController` to optimize for the common case: small N. However, the penalty in the general case, including larger N, is tremendous. Also slightly improve `ObserverList.remove` performance for some use-cases by not always invalidating the set. Some benchmark results. Happy to share the benchmark code as well. I was trying to mimic a realistic ObserverList life-cycle: 2 add/remove SAME ObserverList vs MyObserverList takes: 1.0x, saved: -0% (0:00:01.373926 vs 0:00:01.376916) 2 add/remove SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: -0% (0:00:02.170364 vs 0:00:02.177635) 2 add/remove SLOWER ObserverList vs MyHashedObserverList takes: 0.6x, saved: -64% (0:00:01.375987 vs 0:00:02.254769) 2 notifyListeners SAME ObserverList vs MyObserverList takes: 1.0x, saved: 1% (0:00:01.183753 vs 0:00:01.167889) 2 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.6x, saved: 38% (0:00:01.703029 vs 0:00:01.054944) 2 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:01.173942 vs 0:00:01.061108) 2 notifyListeners self-mod SAME ObserverList vs MyObserverList takes: 1.0x, saved: 1% (0:00:00.413579 vs 0:00:00.409012) 2 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.2x, saved: 16% (0:00:00.861495 vs 0:00:00.725754) 2 notifyListeners self-mod SLOWER ObserverList vs MyHashedObserverList takes: 0.6x, saved: -79% (0:00:00.410618 vs 0:00:00.734759) 2 mix nl/rm/add SLOWER ObserverList vs MyObserverList takes: 1.0x, saved: -4% (0:00:00.382266 vs 0:00:00.398285) 2 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.3x, saved: 25% (0:00:00.696625 vs 0:00:00.522181) 2 mix nl/rm/add SLOWER ObserverList vs MyHashedObserverList takes: 0.8x, saved: -25% (0:00:00.363710 vs 0:00:00.454669) 4 add/remove SAME ObserverList vs MyObserverList takes: 1.0x, saved: -1% (0:00:03.210935 vs 0:00:03.240136) 4 add/remove SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: -2% (0:00:04.000753 vs 0:00:04.088699) 4 add/remove SLOWER ObserverList vs MyHashedObserverList takes: 0.9x, saved: -18% (0:00:03.457464 vs 0:00:04.062586) 4 notifyListeners SLOWER ObserverList vs MyObserverList takes: 0.8x, saved: -22% (0:00:01.857049 vs 0:00:02.266333) 4 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.3x, saved: 21% (0:00:03.031229 vs 0:00:02.395746) 4 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.2x, saved: 14% (0:00:03.723434 vs 0:00:03.213373) 4 notifyListeners self-mod SLOWER ObserverList vs MyObserverList takes: 0.8x, saved: -28% (0:00:03.809080 vs 0:00:04.857361) 4 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.3x, saved: 24% (0:00:03.694135 vs 0:00:02.797687) 4 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 1.3x, saved: 23% (0:00:03.691153 vs 0:00:02.850335) 4 mix nl/rm/add SLOWER ObserverList vs MyObserverList takes: 0.9x, saved: -6% (0:00:01.665852 vs 0:00:01.759793) 4 mix nl/rm/add SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: 1% (0:00:01.430239 vs 0:00:01.422435) 4 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 2.6x, saved: 62% (0:00:03.076555 vs 0:00:01.176919) 7 add/remove FASTER ObserverList vs MyObserverList takes: 1.2x, saved: 16% (0:00:08.613052 vs 0:00:07.209422) 7 add/remove SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: 1% (0:00:08.334085 vs 0:00:08.291406) 7 add/remove SAME ObserverList vs MyHashedObserverList takes: 1.0x, saved: -1% (0:00:07.939038 vs 0:00:07.982456) 7 notifyListeners FASTER ObserverList vs MyObserverList takes: 1.1x, saved: 11% (0:00:03.537551 vs 0:00:03.162703) 7 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.2x, saved: 19% (0:00:04.266710 vs 0:00:03.441823) 7 notifyListeners SAME ObserverList vs MyHashedObserverList takes: 1.0x, saved: 2% (0:00:03.562112 vs 0:00:03.489668) 7 notifyListeners self-mod FASTER ObserverList vs MyObserverList takes: 1.3x, saved: 20% (0:00:07.683331 vs 0:00:06.141342) 7 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 6% (0:00:02.777192 vs 0:00:02.607791) 7 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 2.3x, saved: 57% (0:00:06.138749 vs 0:00:02.623466) 7 mix nl/rm/add SLOWER ObserverList vs MyObserverList takes: 1.0x, saved: -4% (0:00:01.527072 vs 0:00:01.591417) 7 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:01.424173 vs 0:00:01.274727) 7 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 1.8x, saved: 43% (0:00:01.718774 vs 0:00:00.972161) 10 add/remove SAME ObserverList vs MyObserverList takes: 1.0x, saved: 1% (0:00:06.705254 vs 0:00:06.652850) 10 add/remove SLOWER HashedObserverList vs MyHashedObserverList takes: 0.9x, saved: -8% (0:00:06.127039 vs 0:00:06.634606) 10 add/remove FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 6% (0:00:06.621233 vs 0:00:06.242699) 10 notifyListeners SLOWER ObserverList vs MyObserverList takes: 0.8x, saved: -21% (0:00:02.542935 vs 0:00:03.065810) 10 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.2x, saved: 20% (0:00:03.038072 vs 0:00:02.441563) 10 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:02.708075 vs 0:00:02.429304) 10 notifyListeners self-mod SLOWER ObserverList vs MyObserverList takes: 0.6x, saved: -63% (0:00:04.636833 vs 0:00:07.569618) 10 notifyListeners self-mod SLOWER HashedObserverList vs MyHashedObserverList takes: 0.7x, saved: -44% (0:00:01.926254 vs 0:00:02.768144) 10 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 3.9x, saved: 75% (0:00:19.460013 vs 0:00:04.935028) 10 mix nl/rm/add SAME ObserverList vs MyObserverList takes: 1.0x, saved: -3% (0:00:03.122031 vs 0:00:03.207405) 10 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 12% (0:00:02.104239 vs 0:00:01.853132) 10 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 1.7x, saved: 43% (0:00:03.167834 vs 0:00:01.817996) 100 add/remove FASTER ObserverList vs MyObserverList takes: 2.1x, saved: 53% (0:00:29.435696 vs 0:00:13.740486) 100 add/remove SLOWER HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: -4% (0:00:01.810957 vs 0:00:01.874549) 100 add/remove FASTER ObserverList vs MyHashedObserverList takes: 7.0x, saved: 86% (0:00:13.562779 vs 0:00:01.932154) 100 notifyListeners FASTER ObserverList vs MyObserverList takes: 1.1x, saved: 11% (0:00:00.824788 vs 0:00:00.735041) 100 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:00.780762 vs 0:00:00.701437) 100 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 8% (0:00:00.797545 vs 0:00:00.732023) 100 notifyListeners self-mod FASTER ObserverList vs MyObserverList takes: 1.1x, saved: 11% (0:00:24.091372 vs 0:00:21.355631) 100 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: 4% (0:00:00.560706 vs 0:00:00.536952) 100 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 42.2x, saved: 98% (0:00:22.614892 vs 0:00:00.536188) 100 mix nl/rm/add SAME ObserverList vs MyObserverList takes: 1.0x, saved: -1% (0:00:00.399696 vs 0:00:00.403067) 100 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 12% (0:00:00.168415 vs 0:00:00.148730) 100 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 2.7x, saved: 63% (0:00:00.395535 vs 0:00:00.145706) Foo

…lly, with `HashedObserverList<T>` now being a good 20% to 30% faster for small N, the inversion point for performance is reached even sooner. For common use-cases, this means N around three to five (see below). WARNING: This is a potentially breaking change (as witnessed by the test-change), du to listeners only being notified once (in first-insertion order) rather than once every registration. I'd be more than happy to make the behavior configurable on the AnimationController, so that folks could opt into the new behavior or construct an AnimationController depending on their needs. I'd also be interested in understanding the motivation for the behavior at hand a bit better. Given that listeners can be mutating, preserving the order seems quite intuitive. Whereas multiple registrations may make sense, I just can't think of a good use-case. I can easily think of unintentional gotchas like advancing an animation multiple times per step/frame. Context: I'm using a library that draws markers onto a map and then depending on the zoom-level it will cluster (collapse/expand) these markers. This results often in tens or hundreds of markers being animated (position & opacity) with a single AnimationController. After this animation no-longer visible markers will be taken off screen resulting in large numbers of listeners being `removed`. With the implementation at HEAD, my App is spending a majority of its UI-thread time (tens of milliseconds per frame) just deleting listeners. I'd argue that a generic tool like AnimationController should not micro-optimize for the case N=1. With this change, I'm seeing some ~50x improvements depending on how many markers are on screen. There's an argument that could be made that `HashedObserverList` could subsume `ObserverList` in more cases. However, I'd prefer to let you decide this on a case-by-case basis given the slightly different iteration semantics. Here are some benchmark results for typical use-cases. Happy to share the exact benchmark code. 2 add/remove SAME ObserverList vs MyObserverList takes: 1.0x, saved: -0% (0:00:01.373926 vs 0:00:01.376916) 2 add/remove SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: -0% (0:00:02.170364 vs 0:00:02.177635) 2 add/remove SLOWER ObserverList vs MyHashedObserverList takes: 0.6x, saved: -64% (0:00:01.375987 vs 0:00:02.254769) 2 notifyListeners SAME ObserverList vs MyObserverList takes: 1.0x, saved: 1% (0:00:01.183753 vs 0:00:01.167889) 2 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.6x, saved: 38% (0:00:01.703029 vs 0:00:01.054944) 2 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:01.173942 vs 0:00:01.061108) 2 notifyListeners self-mod SAME ObserverList vs MyObserverList takes: 1.0x, saved: 1% (0:00:00.413579 vs 0:00:00.409012) 2 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.2x, saved: 16% (0:00:00.861495 vs 0:00:00.725754) 2 notifyListeners self-mod SLOWER ObserverList vs MyHashedObserverList takes: 0.6x, saved: -79% (0:00:00.410618 vs 0:00:00.734759) 2 mix nl/rm/add SLOWER ObserverList vs MyObserverList takes: 1.0x, saved: -4% (0:00:00.382266 vs 0:00:00.398285) 2 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.3x, saved: 25% (0:00:00.696625 vs 0:00:00.522181) 2 mix nl/rm/add SLOWER ObserverList vs MyHashedObserverList takes: 0.8x, saved: -25% (0:00:00.363710 vs 0:00:00.454669) 4 add/remove SAME ObserverList vs MyObserverList takes: 1.0x, saved: -1% (0:00:03.210935 vs 0:00:03.240136) 4 add/remove SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: -2% (0:00:04.000753 vs 0:00:04.088699) 4 add/remove SLOWER ObserverList vs MyHashedObserverList takes: 0.9x, saved: -18% (0:00:03.457464 vs 0:00:04.062586) 4 notifyListeners SLOWER ObserverList vs MyObserverList takes: 0.8x, saved: -22% (0:00:01.857049 vs 0:00:02.266333) 4 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.3x, saved: 21% (0:00:03.031229 vs 0:00:02.395746) 4 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.2x, saved: 14% (0:00:03.723434 vs 0:00:03.213373) 4 notifyListeners self-mod SLOWER ObserverList vs MyObserverList takes: 0.8x, saved: -28% (0:00:03.809080 vs 0:00:04.857361) 4 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.3x, saved: 24% (0:00:03.694135 vs 0:00:02.797687) 4 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 1.3x, saved: 23% (0:00:03.691153 vs 0:00:02.850335) 4 mix nl/rm/add SLOWER ObserverList vs MyObserverList takes: 0.9x, saved: -6% (0:00:01.665852 vs 0:00:01.759793) 4 mix nl/rm/add SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: 1% (0:00:01.430239 vs 0:00:01.422435) 4 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 2.6x, saved: 62% (0:00:03.076555 vs 0:00:01.176919) 7 add/remove FASTER ObserverList vs MyObserverList takes: 1.2x, saved: 16% (0:00:08.613052 vs 0:00:07.209422) 7 add/remove SAME HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: 1% (0:00:08.334085 vs 0:00:08.291406) 7 add/remove SAME ObserverList vs MyHashedObserverList takes: 1.0x, saved: -1% (0:00:07.939038 vs 0:00:07.982456) 7 notifyListeners FASTER ObserverList vs MyObserverList takes: 1.1x, saved: 11% (0:00:03.537551 vs 0:00:03.162703) 7 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.2x, saved: 19% (0:00:04.266710 vs 0:00:03.441823) 7 notifyListeners SAME ObserverList vs MyHashedObserverList takes: 1.0x, saved: 2% (0:00:03.562112 vs 0:00:03.489668) 7 notifyListeners self-mod FASTER ObserverList vs MyObserverList takes: 1.3x, saved: 20% (0:00:07.683331 vs 0:00:06.141342) 7 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 6% (0:00:02.777192 vs 0:00:02.607791) 7 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 2.3x, saved: 57% (0:00:06.138749 vs 0:00:02.623466) 7 mix nl/rm/add SLOWER ObserverList vs MyObserverList takes: 1.0x, saved: -4% (0:00:01.527072 vs 0:00:01.591417) 7 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:01.424173 vs 0:00:01.274727) 7 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 1.8x, saved: 43% (0:00:01.718774 vs 0:00:00.972161) 10 add/remove SAME ObserverList vs MyObserverList takes: 1.0x, saved: 1% (0:00:06.705254 vs 0:00:06.652850) 10 add/remove SLOWER HashedObserverList vs MyHashedObserverList takes: 0.9x, saved: -8% (0:00:06.127039 vs 0:00:06.634606) 10 add/remove FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 6% (0:00:06.621233 vs 0:00:06.242699) 10 notifyListeners SLOWER ObserverList vs MyObserverList takes: 0.8x, saved: -21% (0:00:02.542935 vs 0:00:03.065810) 10 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.2x, saved: 20% (0:00:03.038072 vs 0:00:02.441563) 10 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:02.708075 vs 0:00:02.429304) 10 notifyListeners self-mod SLOWER ObserverList vs MyObserverList takes: 0.6x, saved: -63% (0:00:04.636833 vs 0:00:07.569618) 10 notifyListeners self-mod SLOWER HashedObserverList vs MyHashedObserverList takes: 0.7x, saved: -44% (0:00:01.926254 vs 0:00:02.768144) 10 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 3.9x, saved: 75% (0:00:19.460013 vs 0:00:04.935028) 10 mix nl/rm/add SAME ObserverList vs MyObserverList takes: 1.0x, saved: -3% (0:00:03.122031 vs 0:00:03.207405) 10 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 12% (0:00:02.104239 vs 0:00:01.853132) 10 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 1.7x, saved: 43% (0:00:03.167834 vs 0:00:01.817996) 100 add/remove FASTER ObserverList vs MyObserverList takes: 2.1x, saved: 53% (0:00:29.435696 vs 0:00:13.740486) 100 add/remove SLOWER HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: -4% (0:00:01.810957 vs 0:00:01.874549) 100 add/remove FASTER ObserverList vs MyHashedObserverList takes: 7.0x, saved: 86% (0:00:13.562779 vs 0:00:01.932154) 100 notifyListeners FASTER ObserverList vs MyObserverList takes: 1.1x, saved: 11% (0:00:00.824788 vs 0:00:00.735041) 100 notifyListeners FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 10% (0:00:00.780762 vs 0:00:00.701437) 100 notifyListeners FASTER ObserverList vs MyHashedObserverList takes: 1.1x, saved: 8% (0:00:00.797545 vs 0:00:00.732023) 100 notifyListeners self-mod FASTER ObserverList vs MyObserverList takes: 1.1x, saved: 11% (0:00:24.091372 vs 0:00:21.355631) 100 notifyListeners self-mod FASTER HashedObserverList vs MyHashedObserverList takes: 1.0x, saved: 4% (0:00:00.560706 vs 0:00:00.536952) 100 notifyListeners self-mod FASTER ObserverList vs MyHashedObserverList takes: 42.2x, saved: 98% (0:00:22.614892 vs 0:00:00.536188) 100 mix nl/rm/add SAME ObserverList vs MyObserverList takes: 1.0x, saved: -1% (0:00:00.399696 vs 0:00:00.403067) 100 mix nl/rm/add FASTER HashedObserverList vs MyHashedObserverList takes: 1.1x, saved: 12% (0:00:00.168415 vs 0:00:00.148730) 100 mix nl/rm/add FASTER ObserverList vs MyHashedObserverList takes: 2.7x, saved: 63% (0:00:00.395535 vs 0:00:00.145706)

ignatz · 2024-02-21T13:45:14Z

I don't know how to remove the benchmark commit and merge this, could you do that? @ignatz else I can reopen the PR without it...

Of course, done. That's also what I offered before. In case you're curious, can be done with git rebase. There's also git cherry-pick in case you want to selectively pull in a single commit w/o creating an independent one.

To be clear, I'm happy to help with virtually any request I can remotely reason about. I merely don't like spending my time on something that has zero tangible value just because someone, unsure why, called it a "best practice". In general, most engineers will be allergic to corporate processes and authority arguments. Doesn't feel like teamwork, feels at best like someone wanting to check the "demonstrates leadership at the next level" box and triggers my inner Hixie.

I love Flutter ❤️

EDIT: I would also like to explicitly express my gratitude for you, Bernardo. You're clearly caught in the cross fire - sorry - but you also genuinely seem to want to help. Thanks!

Re-lands #143542 (which is part 1 of #138481) The filename was wrong ð�«

Reverts #143799 Initiated by: goderbauer Reason for reverting: The microbenchmark failed two run two times in a row after this was committed (see https://logs.chromium.org/logs/flutter/buildbucket/cr-buildbucket/8755379171959141089/+/u/run_microbenchmarks/stdout). Original PR Author: bernaferrari Reviewed By: {goderbauer} This change reverts the following previous change: Original Description: Re-lands #143542 (which is part 1 of #138481) The filename was wrong 🫠

Piinks · 2024-04-02T23:02:39Z

Greetings from PR triage, checking in on this, it looks like the benchmark will need to be checked back in before we can land this PR?

For reference, #143799 was reverted in #143946.

Performance improvements like this are difficult to test, and therefore difficult to catch when they regress. The benchmark is intended to establish a baseline before this lands so we know (1) it has made an improvement on that baseline as expected, and then (2) it persists so we will know if it regresses in the future.

Also, reading through the comments here, just wanted to provide a friendly tone check. We should all be working together respectfully, per our code of conduct. 🙂

bernaferrari · 2024-04-02T23:04:40Z

I'll reland for the third time the benchmarks. It is hard because they are not checked by the CI, so bugs slided two times and I got tired/traumatized of getting things reverted. Then I forgot, but I'll do it again shortly.

I also have no idea why they failed last time. Will see what happens locally.

Piinks · 2024-04-02T23:13:05Z

Thanks @bernaferrari. I appreciate you volunteering your time to help with this change. 🙏

Piinks · 2024-04-03T16:55:14Z

Since this was closed, I filed #146211 to follow up on this.

Reland #143799 which is part 1 of #138481 and #146211. Could someone run this in device-lab so we are 100% sure it works? I don't know if it was a flake or what last time. Locally it works well.

The following PR (#138481) got split in 2, this is part 2. We now have the microbenchmarks to compare this change against (and hopefully see improvements). Close: #146211 Part 1: #153368 --------- Co-authored-by: Nate Wilson <[email protected]>

bernaferrari · 2024-12-07T17:02:39Z

This (finally) got merged! 🥳

github-actions bot added framework flutter/packages/flutter repository. See also f: labels. a: animation Animation APIs labels Nov 15, 2023

ignatz changed the title ~~Significantly improve UI-thread animation performance on busy screens (1x - 100x and more depending on your use-case)~~ Improve UI-thread animation performance on busy screens (1x - 100x depending on your use-case) Nov 15, 2023

ignatz changed the title ~~Improve UI-thread animation performance on busy screens (1x - 100x depending on your use-case)~~ Improve UI-thread animation performance on busy screens (~10x reduction depending on your use-case) Nov 16, 2023

HansMuller requested a review from goderbauer November 17, 2023 22:34

bernaferrari reviewed Dec 13, 2023

View reviewed changes

ignatz force-pushed the faster_animations_on_busy_screens branch from cfc20bb to 60e6080 Compare December 13, 2023 21:16

ignatz force-pushed the faster_animations_on_busy_screens branch 2 times, most recently from 02341e1 to a620001 Compare December 20, 2023 10:23

goderbauer approved these changes Jan 19, 2024

View reviewed changes

goderbauer requested a review from LongCatIsLooong January 19, 2024 22:03

LongCatIsLooong approved these changes Jan 23, 2024

View reviewed changes

packages/flutter/lib/src/foundation/observer_list.dart Outdated Show resolved Hide resolved

packages/flutter/lib/src/foundation/observer_list.dart Outdated Show resolved Hide resolved

ignatz force-pushed the faster_animations_on_busy_screens branch from a620001 to 35398fc Compare January 24, 2024 10:14

ignatz force-pushed the faster_animations_on_busy_screens branch from 53c3c25 to e6da0bd Compare January 25, 2024 11:21

bernaferrari mentioned this pull request Feb 15, 2024

Add UI Benchmarks #143542

Merged

auto-submit bot pushed a commit that referenced this pull request Feb 20, 2024

Add UI Benchmarks (#143542)

74a2e0d

Part 1 of #138481 Cherry picking @ignatz work.

auto-submit bot mentioned this pull request Feb 20, 2024

Reverts "Add UI Benchmarks (#143542)" #143798

Merged

bernaferrari mentioned this pull request Feb 20, 2024

Reland Add UI Benchmarks (#143542) #143799

Merged

ignatz added 2 commits February 21, 2024 14:24

ignatz force-pushed the faster_animations_on_busy_screens branch from 143d894 to f969ca8 Compare February 21, 2024 13:24

auto-submit bot pushed a commit that referenced this pull request Feb 22, 2024

Reland Add UI Benchmarks (#143542) (#143799)

063f59f

Re-lands #143542 (which is part 1 of #138481) The filename was wrong ð�«

auto-submit bot mentioned this pull request Feb 22, 2024

Reverts "Reland Add UI Benchmarks (#143542) (#143799)" #143946

Merged

ignatz closed this Apr 3, 2024

ignatz deleted the faster_animations_on_busy_screens branch April 3, 2024 05:37

Piinks mentioned this pull request Apr 3, 2024

Improve UI-thread animation performance through ObserverList #146211

Closed

bernaferrari mentioned this pull request Aug 13, 2024

Reland Add UI Benchmarks #153368

Merged

bernaferrari mentioned this pull request Nov 21, 2024

Improve UI-thread animation performance #159288

Merged

Improve UI-thread animation performance on busy screens (~10x reduction depending on your use-case) #138481

Improve UI-thread animation performance on busy screens (~10x reduction depending on your use-case) #138481

Uh oh!

Conversation

ignatz commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pre-launch Checklist

Uh oh!

ignatz commented Nov 15, 2023

Uh oh!

bernaferrari commented Nov 15, 2023

Uh oh!

goderbauer commented Nov 21, 2023

Uh oh!

ignatz commented Nov 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. & 2.

3.

Uh oh!

flutter-dashboard bot commented Dec 11, 2023

Uh oh!

goderbauer commented Dec 12, 2023

Uh oh!

goderbauer commented Dec 12, 2023

Uh oh!

ignatz commented Dec 12, 2023

Uh oh!

goderbauer commented Dec 13, 2023

Uh oh!

bernaferrari Dec 13, 2023

Choose a reason for hiding this comment

Uh oh!

ignatz Dec 13, 2023

Choose a reason for hiding this comment

Uh oh!

ignatz commented Dec 13, 2023

Uh oh!

goderbauer commented Dec 19, 2023

Uh oh!

bernaferrari commented Dec 19, 2023

Uh oh!

ignatz commented Dec 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

goderbauer left a comment

Choose a reason for hiding this comment

Uh oh!

LongCatIsLooong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ignatz commented Jan 24, 2024

Uh oh!

LongCatIsLooong commented Jan 24, 2024

Uh oh!

ignatz commented Jan 25, 2024

Uh oh!

ignatz commented Feb 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

goderbauer commented Feb 7, 2024

Uh oh!

ignatz commented Feb 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bernaferrari commented Feb 15, 2024

Uh oh!

ignatz commented Feb 15, 2024

Uh oh!

bernaferrari commented Feb 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ignatz commented Feb 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Piinks commented Apr 2, 2024

Uh oh!

bernaferrari commented Apr 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ignatz commented Nov 15, 2023 •

edited

Loading

ignatz commented Nov 21, 2023 •

edited

Loading

ignatz commented Dec 20, 2023 •

edited

Loading

ignatz commented Feb 7, 2024 •

edited

Loading

ignatz commented Feb 7, 2024 •

edited

Loading

bernaferrari commented Feb 20, 2024 •

edited

Loading

ignatz commented Feb 21, 2024 •

edited

Loading

bernaferrari commented Apr 2, 2024 •

edited

Loading