test: Fix RANDOM_CTX_SEED use with parallel tests #30737

hodlinator · 2024-08-28T13:43:58Z

To fix the issue with test directories clashing when running in parallel with the env var RANDOM_CTX_SEED set, we now mix in the test name.

Issue was introduced by 97e16f5, and comment for g_insecure_rand_ctx_temp_path became invalid with that commit. Make the determinism clear through removing the no longer useful g_insecure_rand_ctx_temp_path.

Fixes #30696.

DrahtBot · 2024-08-28T13:44:02Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage

For detailed information about the code coverage, see the test coverage report.

Reviews

See the guideline for information on the review process.
A summary of reviews will appear here.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#30748 (test: Pin and document TEST_DIR_PATH_ELEMENT, SeedRand::FIXED_SEED by maflcko)
#30618 (test: support std::optional in BOOST_CHECK_* and increase FromUserHex fuzz feature coverage by l0rinc)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

hodlinator · 2024-08-28T13:49:43Z

Can be verified through running

RANDOM_CTX_SEED=21 make -j10 check

maflcko · 2024-08-28T14:59:13Z

src/test/util/setup_common.cpp

This isn't set in the fuzz tests, IIRC. So, as explained in #30696 (comment) it would fail if there were two fuzz tests that set up a (Basic)TestingSetup each. No?

Also, obviously, the issue would remain if the same test is run in parallel

Thank you for your detailed linked comment @maflcko. Apologies for not going back to re-read it before publishing this PR.

Before publishing, I did recall that you were worried about fuzz tests and ran:

RANDOM_CTX_SEED=21 ./test/fuzz/test_runner.py -j40 .

..with this added temporarily:

printf("%s\n", rand_str.c_str());

The hashes outputted were unique. I admit it surprised me and I also printed test_path and saw they were empty. Should have dug deeper at that point to figure out the cause of randomness before publishing the PR.

After your latest comment I dug deeper, discovering:

test/fuzz/test_runner.py does not automatically forward on environment variables, I guess it wants to keep the environment under tighter control. Making it explicitly forward RANDOM_CTX_SEED makes the deterministic randomness of the fuzz tests start from the same seed.

Outputting getpid() in the BasicTestingSetup-ctor now shows unique PIDs per fuzz test. I hesitate to add the PID to the seed since the intent of 97e16f5 seems to be to decrease determinism). In the case of benchmarks, the PID would probably not help either, as the intent seems to be to run them single-threaded(?) - which would at the same time mean directories being created/destroyed should not be an issue for benchmarks.

I've opted in the latest push to implement forwarding of RANDOM_CTX_SEED and implemented G_TEST_GET_FULL_NAME for bench+fuzz. Please let me know what you think about that approach.

(Noticed in the debugger that the BasicTestingSetup-ctor, after creating the test directories, creates the kernel::Context which calls RandomInit() which calls ProcRand(..., /*always_use_real_rng=*/true) ...which should be harmless as long as RNGState::m_deterministic_prng has been set).

To offer another alternative fix, beside the ones proposed in #30696 (comment):

Another possible fix would be to ensure the g_rng_temp_path is seeded before any call to SeedRandomForTest.

I think it would help why you didn't think that any of the 4 alternative fixes are worth a try and instead went for this, which is an incomplete fix, as explained in my previous comment.

I think it would help why you didn't think that any of the 4 alternative fixes are worth a try and instead went for this, which is an incomplete fix, as explained in my previous comment.

In general I think determinism should be increased, not decreased. Increased determinism appears to be the original intent behind moving SeedRandomForTest earlier in 97e16f5, please let me know if you have a different interpretation. All your suggestions lead to decreased determinism, to avoid repetition, I only left additional comments:

"Use GetStrongRandBytes, however, I am not sure how slow that'd be in the worst case." - You pointed out that it may be a bit slow.

"Move the SeedRandomForTest after the fuzz target's init function, but before ResetCoverageCounters. However that'd still break if the fuzz target spun up a TestingSetup." - As init often creates BasicTestingSetup, that approach would also need to have the setup_common.cpp part of 97e16f5 reverted. As you said any later created BasicTestingSetup after init would become deterministic.

"Use the hash of the current folder generator, as well as the hash of the pid. Which may obviously break once the kernel wraps the pids, but I am not sure how likely that is." - Addressed in my previous comment.

Ensure the g_rng_temp_path is seeded before any call to SeedRandomForTest. - Just the general determinism concern above.

Also, obviously, the issue would remain if the same test is run in parallel

The same test being run in parallel with itself could use -testdatadir. Is that a common case?

Edit: Could use different RANDOM_CTX_SEED if wanting to run the same test in parallel with itself as well.

Ah, I see. There may be a misunderstanding here. According to the comment you removed, the datadir should actually be "unique", not "deterministic". ("Random context to get unique temp data dirs.")

Aha, time to bring out the git log -S "Random context to get unique temp data dirs" command. ;)

I see it was your fae43a9 from 2019, so it's hard to argue that the author may have meant "unique" as in simply different from other tests. :) See you introduced the later seeding of the RNG in that commit too.

On the other hand, you (together with me) ACKed #29625 which contains 97e16f5, moving the seeding prior to directory naming randomization. Do you now disagree with it?

Once you make it deterministic, there will be a chance of collision, which may lead to a crash, which this pull request is trying to fix, no?

The PR does fix collisions between different tests that would otherwise happen when RANDOM_CTX_SEED was set to the same value, which was resulting in the same folder name being used. (RANDOM_CTX_SEED=21 make -j10 check).

I don't think anything here is a common case. So in theory an alternative "fix" would be to simply say in a comment that RANDOM_CTX_SEED can only be used when running a single test process at a time.

Combining tests with setting RANDOM_CTX_SEED is indeed an edge case. This PR arguably implements missing test names for bench+fuzz, missing support for forwarding RANDOM_CTX_SEED to fuzz tests, simplifies the code through removing g_rng_temp_path which had a no longer valid comment after 97e16f5, the main fix/complexity lies in adding entropy to the random folder path using the test name.

The extremely paranoid would say that the folder name generation could influence test success/failure, so having it deterministic is helpful. Unit tests are poorly written if that is ever the case, but for fuzzing it's still an open question to me.

On the other hand, you (together with me) ACKed #29625 which contains 97e16f5, moving the seeding prior to directory naming randomization. Do you now disagree with it?

I think it was simply an oversight. At least for me.

Once you make it deterministic, there will be a chance of collision, which may lead to a crash, which this pull request is trying to fix, no?

The PR does fix collisions between different tests that would otherwise happen when RANDOM_CTX_SEED was set to the same value, which was resulting in the same folder name being used. (RANDOM_CTX_SEED=21 make -j10 check).

Yes, I understand the code diff. I am just saying that it would be nicer if running the same test also didn't crash for that reason, when run in parallel. It seems plausible that someone would want to do that, given that it is possible and supported right now (when not setting RANDOM_CTX_SEED).

Otherwise, someone else will report that as an issue and the code has to be changed again.

The extremely paranoid would say that the folder name generation could influence test success/failure

Ok, that helps to understand your concern. However, I'd say that the folder name is printed. So if it lead to an issue, it would be trivial to reproduce.

(If it is not printed on failure, then that should be fixed)

On the other hand, you (together with me) ACKed #29625 which contains 97e16f5, moving the seeding prior to directory naming randomization. Do you now disagree with it?

I think it was simply an oversight. At least for me.

Alright, thanks for addressing this. I think it would be good to wait for sipa to have time as this PR continues in the same direction as his commit.

Yes, I understand the code diff. I am just saying that it would be nicer if running the same test also didn't crash for that reason, when run in parallel. It seems plausible that someone would want to do that, given that it is possible and supported right now (when not setting RANDOM_CTX_SEED).

Otherwise, someone else will report that as an issue and the code has to be changed again.

If they don't want to set different RANDOM_CTX_SEED they can set different -testdatadir (and probably should, for clarity in case there are failures).

The extremely paranoid would say that the folder name generation could influence test success/failure

Ok, that helps to understand your concern. However, I'd say that the folder name is printed. So if it lead to an issue, it would be trivial to reproduce.

It will be even more trivial to reproduce if one can just set the RANDOM_CTX_SEED env var and re-run the same binary without modifying and recompiling the code (and potentially having to set up a git & build environment).

It will be even more trivial to reproduce if one can just set the RANDOM_CTX_SEED env var and re-run the same binary without modifying and recompiling the code (and potentially having to set up a git & build environment).

I just think this optimized in the wrong direction. Has there ever been a known issue due to a fixed-size ascii-hex-encoded path element? Stuff like variable-size or non-hex-ascii are known to possibly find issues, but a fixed size [a-f0-9] element, I am not aware of.

Moreover, adding code like fuzz_env['RANDOM_CTX_SEED'] = random_seed to the fuzz runner also seems to go in the wrong direction a bit. The goal of the fuzz tests is to be fully deterministic and stable, within a single run and across different runs. If the fuzz randomness is seeded with anything other than stuff provided by the fuzz engine, it is easy to get non-determinism, or coverage instability.

An having different types of "seed" in the fuzz context is also confusing.

If they don't want to set different RANDOM_CTX_SEED they can set different -testdatadir (and probably should, for clarity in case there are failures).

Well, the only point of setting RANDOM_CTX_SEED is to reproduce a past failure. However, if you want to go down the route of requiring to set a different testdatadir when wanting to run the same test in parallel, it may be best to fully go that route. Instead of a hash, just use the test name as-is? This would also be parallel to the functional tests.

However, if you want to go down the route of requiring to set a different testdatadir when wanting to run the same test in parallel, it may be best to fully go that route. Instead of a hash, just use the test name as-is? This would also be parallel to the functional tests.

For reference, #31000 should be one step into that direction.

jonatack · 2024-08-28T16:35:51Z

Will review after rebase.

To fix the issue with test directories clashing when running in parallel with the env var RANDOM_CTX_SEED set, we now mix in the test name. Issue was introduced by 97e16f5, and comment for g_insecure_rand_ctx_temp_path became invalid with that commit. Make the determinism clear through removing the no longer useful g_insecure_rand_ctx_temp_path. Fixes bitcoin#30696.

DrahtBot · 2024-09-06T09:12:48Z

🐙 This pull request conflicts with the target branch and needs rebase.

maflcko · 2024-10-16T14:49:09Z

Cross-ref to #30748 (comment), depending on how this evolves.

hodlinator · 2024-11-14T13:52:50Z

Closing since the main issue was fixed in a different way by the now merged #31000.

maflcko · 2024-11-14T16:19:02Z

I think the issue in the fuzz tests still persists.

hodlinator · 2024-11-14T21:42:54Z

I think the issue in the fuzz tests still persists.

That's correct. Fuzzing was not my main concern, but could maybe resurrect that part of this PR.

I've rebased on current master and re-pushed the branch I had deleted, but am not allowed to reopen this one.
master...hodlinator:bitcoin:2024-08/RANDOM_CTX_SEED_jobs_fix_alt

Are you still against passing through RANDOM_CTX_SEED in the test_runner.py? In that case I can drop that commit.

maflcko · 2024-11-15T07:56:04Z

If you want you can just open a new pull, it will have to be reviewed from scratch anyway.

Are you still against passing through RANDOM_CTX_SEED in the test_runner.py? In that case I can drop that commit.

Yes, adding code like fuzz_env['RANDOM_CTX_SEED'] = random_seed to the fuzz runner also seems to go in the wrong direction a bit. The goal of the fuzz tests is to be fully deterministic and stable, within a single run and across different runs. If the fuzz randomness is seeded with anything other than stuff provided by the fuzz engine, it is easy to get non-determinism, or coverage instability. (Copied from #30737 (comment))

If you disagree, it would be good to explain why.

maflcko · 2024-11-18T11:55:01Z

For reference, the fuzz issue can be hit more frequent on machines that do not have nanosecond precision. GCE may not have it, according to the oss-fuzz crashes of the form:

libc++abi: terminating due to uncaught exception of type std::__1::__fs::filesystem::filesystem_error: filesystem error: in remove_all: No such file or directory ["/tmp/test_common bitcoin/1731742732339168000/regtest/fuzzed_i2p_private_key"]

Ref: https://oss-fuzz.com/testcase-detail/5425475725361152

https://issues.oss-fuzz.com/issues/379418970

Edit: Fixed in #31317

maflcko · 2024-11-20T07:15:00Z

Edit: Fixed in #31317

My understanding is that with the current state of that pull, all outstanding observable issues mentioned in this thread are fixed? If not, please leave a comment.

There may or may not be some small style-fixups/follow-ups such as #31317 (comment), but no real issue to be fixed.

hodlinator · 2024-11-20T10:30:17Z

Agree that issues here are fixed. For orderliness, maybe the fuzz-part of dfef02f from this PR could be done as a new PR just to ensure named folders?

maflcko · 2024-11-20T10:53:13Z

Ah yes. I am happy to review a pull for named folders when fuzzing

hodlinator · 2024-11-20T14:19:39Z

New PR #31333.

DrahtBot added the Tests label Aug 28, 2024

maflcko reviewed Aug 28, 2024

View reviewed changes

DrahtBot added the Needs rebase label Aug 28, 2024

DrahtBot mentioned this pull request Aug 28, 2024

test: [refactor] Use m_rng directly #30571

Merged

hodlinator added 3 commits August 29, 2024 10:54

test: bench+fuzz - Implement G_TEST_GET_FULL_NAME

dfef02f

test: Make fuzz/test_runner pass on RANDOM_CTX_SEED if set

8e5f8a4

hodlinator force-pushed the 2024-08/RANDOM_CTX_SEED_jobs_fix_alt branch from eeeb56b to 065f0f6 Compare August 29, 2024 09:08

DrahtBot removed the Needs rebase label Aug 29, 2024

This was referenced Aug 29, 2024

test: Pin and document TEST_DIR_PATH_ELEMENT, SeedRand::FIXED_SEED #30748

Merged

test: support std::optional in BOOST_CHECK_* and increase FromUserHex fuzz feature coverage #30618

Merged

DrahtBot added the Needs rebase label Sep 6, 2024

maflcko mentioned this pull request Oct 1, 2024

bench: add support for custom data directory #31000

Merged

hodlinator closed this Nov 14, 2024

hodlinator deleted the 2024-08/RANDOM_CTX_SEED_jobs_fix_alt branch November 14, 2024 13:53

hodlinator mentioned this pull request Nov 19, 2024

test: Revert to random path element #31317

Merged

bitcoin locked and limited conversation to collaborators Nov 20, 2025

test: Fix RANDOM_CTX_SEED use with parallel tests #30737

test: Fix RANDOM_CTX_SEED use with parallel tests #30737

Uh oh!

Conversation

hodlinator commented Aug 28, 2024

Uh oh!

DrahtBot commented Aug 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage

Reviews

Conflicts

Uh oh!

hodlinator commented Aug 28, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maflcko Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hodlinator Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jonatack commented Aug 28, 2024

Uh oh!

DrahtBot commented Sep 6, 2024

Uh oh!

maflcko commented Oct 16, 2024

Uh oh!

hodlinator commented Nov 14, 2024

Uh oh!

maflcko commented Nov 14, 2024

Uh oh!

hodlinator commented Nov 14, 2024

Uh oh!

maflcko commented Nov 15, 2024

Uh oh!

maflcko commented Nov 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maflcko commented Nov 20, 2024

Uh oh!

hodlinator commented Nov 20, 2024

Uh oh!

maflcko commented Nov 20, 2024

Uh oh!

hodlinator commented Nov 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DrahtBot commented Aug 28, 2024 •

edited

Loading

maflcko Aug 29, 2024 •

edited

Loading

hodlinator Aug 29, 2024 •

edited

Loading

maflcko commented Nov 18, 2024 •

edited

Loading