[flutter tools] Don't return success if we trigger runZoned's error callback #58474

jamesderlin · 2020-06-02T09:04:40Z

Description

Triggering runZoned/runZonedGuarded's error callback does not
necessarily mean that the body stops executing. (Consequently, that
also means that the error callback can be triggered multiple times.)

This allowed some types of crashes to not be properly reported. In
such cases we would undesirably continue along the success path,
which calls exit() and prematurely terminates the process before
the error path completes. (It'd be nice to stop calling exit()
entirely, but I digress.)

Fix this by checking if the zone's error callback has fired before
exiting with a success code. This also prevents prematurely
terminating before the error callback can finish reporting the crash.

Related Issues

#56406

Tests

I added the following tests:

I added a test to try to simulate the type of crash that would trigger runZoned's error callback. I verified that the test fails without the rest of my change and passes with it.

Checklist

Before you create this PR confirm that it meets all requirements listed below by checking the relevant checkboxes ([x]). This will ensure a smooth and quick review process.

I read the Contributor Guide and followed the process outlined there for submitting PRs.
I signed the CLA.
I read and followed the Flutter Style Guide, including Features we expect every widget to implement.
I read the Tree Hygiene wiki page, which explains my responsibilities.
I updated/added relevant documentation (doc comments with ///).
All existing and new tests are passing.
The analyzer (flutter analyze --flutter-repo) does not report any problems on my PR.
I am willing to follow-up on review comments in a timely manner.

Breaking Change

Did any tests fail when you ran them? Please read Handling breaking changes.

No, no existing tests failed, so this is not a breaking change.
Yes, this is a breaking change. If not, delete the remainder of this section.
- I wrote a design doc: https://flutter.dev/go/template Replace this with a link to your design doc's short link
- I got input from the developer relations team, specifically from: Replace with the names of who gave advice
- I wrote a migration guide: https://flutter.dev/go/breaking-changes-template Replace this with a link to a pull request that adds the migration guide to https://flutter.dev/docs/release/breaking-changes

…allback Triggering `runZoned`/`runZonedGuarded`'s error callback does not necessarily mean that the body stops executing. (Consequently, that also means that the error callback can be triggered multiple times.) This allowed some types of crashes to not be properly reported. In such cases we would undesirably continue along the success path, which calls `exit()` and prematurely terminates the process before the error path completes. (It'd be nice to stop calling `exit()` entirely, but I digress.) Fix this by checking if the zone's error callback has fired before exiting with a success code. This also prevents prematurely terminating before the error callback can finish reporting the crash.

zanderso

Good catch. Just a comment about the test.

zanderso · 2020-06-03T21:13:31Z

packages/flutter_tools/test/general.shard/runner/runner_test.dart

+    Timer.run(() => throw error);
+
+    // Give the Timer time to fire.
+    await Future<void>.delayed(const Duration(milliseconds: 100));


It might work in this case, but it's generally not a good idea to have explicit timers like this in tests. There are two options. The first option is to use a FakeAsync block. I think that might be overkill here. Instead, it looks like you might be able to pass around some Futures to await and some Completers to complete to achieve the execution order that you're trying to exercise.

I've replaced this one with a Completer to wait on. I didn't think that particular case would be controversial since it involves a 0-delay timer and a non-zero delay timer, and I expect that timer callbacks would fire in order.

I really don't like the Timer in SlowCrashReporter. The point of that is to make runner.run's failure path take longer than the success path if they're racing. I don't think there's anything along the success path that I can wait on; I can't add a Completer to the test-implementation of exit() since the fix is to not call exit() in this situation.

I could add an explicit, global, @visibleForTesting-only Completer to runner.dart that will be completed when we continue down the success path. That also feels gross, but I suppose it's better than the hard-coded delay. Is that acceptable?

Remove hard-coded wait durations. Also adjust some names.

zanderso · 2020-06-05T16:03:23Z

packages/flutter_tools/test/general.shard/runner/runner_test.dart

+    });
+
+    await completer.future;
+    return FlutterCommandResult.success();


Does anything go wrong if you do runCompleted.complete() here instead?

It seems to work, but it would be a bit more brittle. If asynchronous work ever occurs after FlutterCommand.runCommand() returns, the test could pass when it should fail.

Sorry, I'm not quite getting it. Could you spell this out for me a bit more? What's the sequence of events that would be problematic?

If CrashingFlutterCommand.runCommand completed runCompleted instead, the desired sequence of events would be:

CrashingFlutterCommand.runCommand generates an asynchronous error.

The onError callback from runner.run's runZoned call fires. We proceed down the crash reporting path. WaitingCrashReporter waits for runCompleted.

CrashingFlutterCommand.runCommand continues, completes runCompleted, and returns.

runner.run continues in its try block.

WaitingCrashReporter is unblocked and successfully reports the crash.

If execution yields between steps 3 and 4, then step 5 could run first. If step 5 runs before step 4, a regression where runner.run's try block makes its own call to exit() would not be detected.

Okay, thanks. I understand the concern now. Other than a stray call in analyze_continuously.dart, the tool only calls exit() in runner.dart and base/signals.dart. We should guard against proliferating calls to exit with a more on-the-nose test, but not in this PR. For this PR, I think it would be best to move the runCompleted completer from runner.dart into the test.

I filed #59338

…premature-exit

Move the command Completer into the test to avoid polluting the tool code. Note that this weakens the test and might prevent it from catching regressions.

zanderso

Thanks!

…allback (flutter#58474)

jamesderlin requested a review from zanderso June 2, 2020 09:04

fluttergithubbot added the tool Affects the "flutter" command-line tool. See also t: labels. label Jun 2, 2020

googlebot added the cla: yes label Jun 2, 2020

zanderso reviewed Jun 3, 2020

View reviewed changes

Update with review feedback from zanderso

fed31c7

Remove hard-coded wait durations. Also adjust some names.

zanderso reviewed Jun 5, 2020

View reviewed changes

jamesderlin added 2 commits June 10, 2020 10:15

Merge branch 'master' of github.com:flutter/flutter into jamesderlin/…

f8ef957

…premature-exit

Update with more review feedback from zanderso

08d11c7

Move the command Completer into the test to avoid polluting the tool code. Note that this weakens the test and might prevent it from catching regressions.

zanderso mentioned this pull request Jun 12, 2020

Limit calls to exit() #59338

Closed

Fix analysis error

e7222f3

jamesderlin requested a review from zanderso June 12, 2020 21:35

zanderso approved these changes Jun 12, 2020

View reviewed changes

jamesderlin added the waiting for tree to go green label Jun 12, 2020

fluttergithubbot merged commit c21b323 into flutter:master Jun 15, 2020

jamesderlin mentioned this pull request Jun 20, 2020

flutter_tools crash might exit before reporting the crash #56406

Closed

zljj0818 pushed a commit to zljj0818/flutter that referenced this pull request Jun 22, 2020

[flutter tools] Don't return success if we trigger runZoned's error c…

7c8bbaa

…allback (flutter#58474)

mingwandroid pushed a commit to mingwandroid/flutter that referenced this pull request Sep 6, 2020

[flutter tools] Don't return success if we trigger runZoned's error c…

ee80f40

…allback (flutter#58474)

github-actions bot locked as resolved and limited conversation to collaborators Jul 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[flutter tools] Don't return success if we trigger runZoned's error callback #58474

[flutter tools] Don't return success if we trigger runZoned's error callback #58474

Uh oh!

jamesderlin commented Jun 2, 2020

Uh oh!

zanderso left a comment

Uh oh!

zanderso Jun 3, 2020

Uh oh!

jamesderlin Jun 3, 2020

Uh oh!

zanderso Jun 5, 2020

Uh oh!

jamesderlin Jun 5, 2020 •

edited

Loading

Uh oh!

zanderso Jun 5, 2020

Uh oh!

jamesderlin Jun 5, 2020 •

edited

Loading

Uh oh!

zanderso Jun 11, 2020

Uh oh!

zanderso Jun 12, 2020

Uh oh!

zanderso left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[flutter tools] Don't return success if we trigger runZoned's error callback #58474

[flutter tools] Don't return success if we trigger runZoned's error callback #58474

Uh oh!

Conversation

jamesderlin commented Jun 2, 2020

Description

Related Issues

Tests

Checklist

Breaking Change

Uh oh!

zanderso left a comment

Choose a reason for hiding this comment

Uh oh!

zanderso Jun 3, 2020

Choose a reason for hiding this comment

Uh oh!

jamesderlin Jun 3, 2020

Choose a reason for hiding this comment

Uh oh!

zanderso Jun 5, 2020

Choose a reason for hiding this comment

Uh oh!

jamesderlin Jun 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zanderso Jun 5, 2020

Choose a reason for hiding this comment

Uh oh!

jamesderlin Jun 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zanderso Jun 11, 2020

Choose a reason for hiding this comment

Uh oh!

zanderso Jun 12, 2020

Choose a reason for hiding this comment

Uh oh!

zanderso left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jamesderlin Jun 5, 2020 •

edited

Loading

jamesderlin Jun 5, 2020 •

edited

Loading