Remove sources of unreliablility in extended functional tests #10072

jnewbery · 2017-03-24T20:28:46Z

This PR removes two sources of unreliability that were causing extended test cases to fail intermittently on travis:

bip9-softforks stop-starts bitcoind twice, but does not wait for the p2p connection to mininode to re-open. This would lead to race conditions where the following call to getblocktemplate() would sometimes fail because there were no p2p connections open to the node. This commit also removes the shutil.rmtree() call that was blatting the test_framework.log file.
forknotify asserts that an alert has been written to a file and fails immediately if the file is empty. This would lead to race conditions where the assert would sometimes be hit before bitcoind had written to the file.

JeremyRubin

utack! A couple suggestions but nothing major.

JeremyRubin · 2017-03-28T17:43:08Z

test/functional/bip9-softforks.py

Do we want the same behavior on L44?

wait_for_verack() is called by the comparison test framework at the start of the test run (comptest.py line 300).

JeremyRubin · 2017-03-28T17:44:08Z

test/functional/bip9-softforks.py

Is there any work we can do in-between this and the wait for verack?

Maybe have a comment here about having to wait for verack rather than just deleting the comment.

No, we need to wait for the network thread to start before receiving the verack (the network thread is what catches the verack and calls the callback).

JeremyRubin · 2017-03-28T17:44:22Z

test/functional/bip9-softforks.py

Is there a more idiomatic way to get the dirname?

Not that I know of.

I don't like the fact that this test is using shutil and removing files itself. This change just makes it slightly less damaging (ie it's only removing the datadir rather than the entire tmpdir)

JeremyRubin · 2017-03-28T17:46:18Z

test/functional/forknotify.py

Maybe raise the error yourself rather than assert, unless you only want it in debug mode?

I don't think there's any difference. No-one runs the test cases with -O

using asserts as control flow is generally an antipattern (I think that the for/else loop is a bit clearer in any case).

ah. I see what you're saying now. Yes, I'll remove the assert from the control flow.

JeremyRubin · 2017-03-28T18:04:49Z

test/functional/forknotify.py

Maybe a bit cleaner to do something like (a couple of changes, feel free to pick and choose -- I was torn on the os.path.exists+getsize v.s. continual reading with open).

for t in xrange(100): if os.path.exists(self.alert_filename) and os.path.getsize(self.alert_filename): break if t != 99: time.sleep(0.1) else: raise AssertionError("-alertnotify did not warn of up-version blocks") with open(self.alert_filename, 'r', encoding='utf8') as f: alert_text = f.read()

I think this is equivalent.

yeah; mostly this was motivated by being able to get rid of the stateful timeout = 10; assert timeout > 0; timeout -= 0.1 combo.
This is also fine.

for t in xrange(100): with open(self.alert_filename, 'r', encoding='utf8') as f: alert_text = f.read() if alert_text: break time.sleep(0.1) else: raise AssertionError("-alertnotify did not warn of up-version blocks")

JeremyRubin · 2017-03-28T18:07:35Z

test/functional/forknotify.py

This should rarely actually take 10 seconds, if the code is correct, right? Maybe 1 second will be a bit better in the case where it is actually broken & you're trying to fix it.

Depends. Travis catches a lot of timing edge cases. I don't think there's any problem making the timeout something high like 10 seconds since the while loop will break as soon as the file is written.

bip9-sofforks.py stop-starts the bitcoind node twice during the test run, but it doesn't wait for the connection from mininode to open before continuing with the test. This leads to race conditions where the test can fail getblocktemplate() because it has no p2p connections.

forknotify would intermittently fail because the alert file was not being written fast enough. This commit adds a timeout so the test does not fail immediately.

jnewbery · 2017-03-28T20:23:23Z

Pushed a new version using while/else and not using assert for control flow.

maflcko · 2017-04-02T10:51:02Z

utACK a4fd89f. Going to merge this, so that the failures are a thing of the past.

maflcko · 2017-04-02T10:53:21Z

test/functional/forknotify.py

+            time.sleep(0.1)
+            timeout -= 0.1
+        else:
+            assert False, "-alertnotify did not warn of up-version blocks"


@jnewbery nit: Any reason you changed the raise AssertionError to assert False?

No good reason. Equivalent behaviour but you're probably right that raise AssertionError is better style here.

… tests a4fd89f Make forknotify.py more robust (John Newbery) 1f3d78b Wait for connection to open in bip9-softforks.py (John Newbery) Tree-SHA512: de7d0002ee62ad97059b6f6c89b11f6e9901e3b4164ef6906bcd61e4ca499c277d9034784755966e5baf599869fad611b0b18f5547a384ceb5b7db3cc5bbd132

jnewbery · 2017-04-02T19:53:15Z

🎉 first successful daily run of extended tests on Travis: https://travis-ci.org/bitcoin/bitcoin/builds/217786785

Thanks for merging @MarcoFalke

…ctional tests a4fd89f Make forknotify.py more robust (John Newbery) 1f3d78b Wait for connection to open in bip9-softforks.py (John Newbery) Tree-SHA512: de7d0002ee62ad97059b6f6c89b11f6e9901e3b4164ef6906bcd61e4ca499c277d9034784755966e5baf599869fad611b0b18f5547a384ceb5b7db3cc5bbd132

jnewbery mentioned this pull request Mar 24, 2017

[test] Run extended tests once daily in Travis #10052

Merged

fanquake added the Tests label Mar 25, 2017

jnewbery force-pushed the extended_test_unreliablility branch from 447e5ec to baeecbb Compare March 25, 2017 22:57

JeremyRubin approved these changes Mar 28, 2017

View reviewed changes

jnewbery added 2 commits March 28, 2017 16:15

Make forknotify.py more robust

a4fd89f

forknotify would intermittently fail because the alert file was not being written fast enough. This commit adds a timeout so the test does not fail immediately.

jnewbery force-pushed the extended_test_unreliablility branch from baeecbb to a4fd89f Compare March 28, 2017 20:22

maflcko reviewed Apr 2, 2017

View reviewed changes

maflcko merged commit a4fd89f into bitcoin:master Apr 2, 2017

jnewbery mentioned this pull request Aug 2, 2017

Add blocknotify and walletnotify functional tests #10941

Merged

sickpig mentioned this pull request Jun 1, 2018

[PORT] Remove sources of unreliablility in extended functional tests BitcoinUnlimited/BitcoinUnlimited#1113

Merged

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Remove sources of unreliablility in extended functional tests #10072

Remove sources of unreliablility in extended functional tests #10072

Uh oh!

Conversation

jnewbery commented Mar 24, 2017

Uh oh!

JeremyRubin left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnewbery commented Mar 28, 2017

Uh oh!

maflcko commented Apr 2, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnewbery commented Apr 2, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants