Improve chainstate/blockindex disk writing policy #5241

sipa · 2014-11-07T16:48:21Z

There are 3 pieces of data that are maintained on disk. The actual block and undo data, the block index (which can refer to positions on disk), and the chainstate (which refers to the best block hash).

Earlier, there was no guarantee that blocks were written to disk before block index entries referring to them were written. This commit introduces dirty flags for block index data, and delays writing it until the actual block data is flushed.

With this stricter ordering in writes, it is now safe to not always flush after every block, so there is no need for the IsInitialBlockDownload() check there - instead we just write whenever enough time has passed or the cache size grows too large.

In addition, only do a write inside the block processing loop if necessary (because of cache size exceeded). Otherwise, move the writing to a point after processing is done, after relaying.

This should improve block relay speed, by not writing chainstate and index entries before announcing the new hash.

Future work: move block and undo data writing into FlushStateToDisk, and move it to a background thread.

sipa · 2014-11-12T14:37:34Z

Added wallet tip updating and block/undo file flushing to the FlushStateToDisk() method. Very preliminary testing on my VPS (with very slow I/O) shows a ~100ms decrease in block processing time.

laanwj · 2014-11-13T09:35:42Z

src/main.cpp

Parametrization on multiple bool parameters leads to hard-to-read code; I'd prefer to pass an enum, or bit field

Agree, I'll add a commit for changing it.

laanwj · 2014-11-13T09:36:15Z

Concept ACK, will test

laanwj · 2014-11-16T14:50:09Z

Found a small issue. When AppInit2 exits prematurely, bitcoind will crash in shutdown with the following:

#0  CCoinsViewCache::GetCacheSize (this=0x0) at coins.cpp:196
#1  0x54afe528 in FlushStateToDisk (state=..., fast=fast@entry=false, forceWrite=forceWrite@entry=true) at main.cpp:1774
#2  0x54afe906 in FlushStateToDisk () at main.cpp:1811
#3  0x54ad5c16 in Shutdown () at init.cpp:153
#4  0x54aca9d6 in AppInit (argc=6, argv=<optimized out>) at bitcoind.cpp:170
#5  0x54acad20 in main (argc=6, argv=0x7efff714) at bitcoind.cpp:182

You need a check pcoinsTip!=0 in Shutdown() before calling FlushStateToDisk.

sipa · 2014-11-16T15:04:56Z

Fixed.

laanwj · 2014-11-17T14:52:05Z

What about the pcoinstip->Flush() in gettxoutsetinfo https://github.com/bitcoin/bitcoin/blob/master/src/rpcblockchain.cpp#L322 - does this need to be some form of FlushStateToDisk?

sipa · 2014-11-17T14:56:27Z

@laanwj Ah right, it does. That's sort of inconvienient but inevitable until there's a way to iterate the UTXO entries in a CCoinsView. Writing pcoinsTip to disk means a reference to the tip block hash in the index, so that needs to be written, which on itself refers to disk positions, so blocks need to be written too...

sipa · 2014-11-17T14:58:43Z

Fixed.

sipa · 2014-11-20T14:02:39Z

Tested by kill -KILL'ing a node while it was flushing. No problems.

gmaxwell · 2014-11-23T07:19:10Z

This seems to greatly improves our handling of unclean shutdowns during the IBD. Previously we'd reliably corrupt the database, requring the user to manually delete things. I've tested this with a bunch of killing while syncing and not been able to break it.

rdponticelli · 2014-11-23T15:40:57Z

I performed a thorough synchronization of a testnet node from scratch using this patch, sending term or kill signals randomly every 180-640 seconds. It worked like a charm. Such test easily triggers #5156 without it.

gmaxwell · 2014-11-23T19:51:48Z

I think we should seriously consider this for 0.10. It's a big reliablity improvement during the initial sync, beyond its performance implications.

laanwj · 2014-11-24T09:25:22Z

I've subjected this to some horrible crashes, and a full sync w/ DEBUG_LOCKORDER. ACK commithash 9dd92a12fe92b4a81e612acf1c6245b85fa12a73 for 0.10.
https://dev.visucore.com/bitcoin/acks/5241

There are 3 pieces of data that are maintained on disk. The actual block and undo data, the block index (which can refer to positions on disk), and the chainstate (which refers to the best block hash). Earlier, there was no guarantee that blocks were written to disk before block index entries referring to them were written. This commit introduces dirty flags for block index data, and delays writing entries until the actual block data is flushed. With this stricter ordering in writes, it is now safe to not always flush after every block, so there is no need for the IsInitialBlockDownload() check there - instead we just write whenever enough time has passed or the cache size grows too large. Also updating the wallet's best known block is delayed until this is done, otherwise the wallet may end up referring to an unknown block. In addition, only do a write inside the block processing loop if necessary (because of cache size exceeded). Otherwise, move the writing to a point after processing is done, after relaying.

sipa · 2014-11-24T14:17:25Z

Sorry, needed to rebase after BIP23 block proposals.

gmaxwell · 2014-11-24T22:42:22Z

ACK.

a206950 Introduce separate flushing modes (Pieter Wuille) 51ce901 Improve chainstate/blockindex disk writing policy (Pieter Wuille)

sipa force-pushed the noflush branch from d8c2ebc to 5a280c0 Compare November 12, 2014 13:17

sipa force-pushed the noflush branch 2 times, most recently from 46cec70 to 56e8308 Compare November 12, 2014 21:48

laanwj reviewed Nov 13, 2014
View reviewed changes

sipa mentioned this pull request Nov 13, 2014

Bitcoin core start-up issue with undo data #5156

Closed

sipa force-pushed the noflush branch from 17b8eac to be84f22 Compare November 16, 2014 15:00

sipa force-pushed the noflush branch from be84f22 to 766dded Compare November 17, 2014 14:58

sipa mentioned this pull request Nov 17, 2014

Fix IsInitialBlockDownload which was broken by headers first. #5158

Merged

laanwj added the Refactoring label Nov 17, 2014

sipa mentioned this pull request Nov 20, 2014

cs_main lock not held when reindexing blocks on disk? #5330

Closed

sipa force-pushed the noflush branch from 766dded to ce155ac Compare November 20, 2014 16:17

sipa mentioned this pull request Nov 21, 2014

Add 'blacklistblock' and 'reconsiderblock' RPC commands. #5316

Merged

sipa force-pushed the noflush branch from ce155ac to 9dd92a1 Compare November 23, 2014 14:49

sipa added 2 commits November 24, 2014 15:15

Introduce separate flushing modes

a206950

sipa force-pushed the noflush branch from 9dd92a1 to a206950 Compare November 24, 2014 14:17

laanwj added this to the 0.10.0 milestone Nov 24, 2014

laanwj merged commit a206950 into bitcoin:master Nov 25, 2014

laanwj added a commit that referenced this pull request Nov 25, 2014

Merge pull request #5241

397b901

a206950 Introduce separate flushing modes (Pieter Wuille) 51ce901 Improve chainstate/blockindex disk writing policy (Pieter Wuille)

sipa mentioned this pull request Nov 25, 2014

Do all block index writes in a batch #5367

Merged

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Improve chainstate/blockindex disk writing policy #5241

Improve chainstate/blockindex disk writing policy #5241

Uh oh!

Conversation

sipa commented Nov 7, 2014

Uh oh!

sipa commented Nov 12, 2014

Uh oh!

laanwj Nov 13, 2014

Choose a reason for hiding this comment

Uh oh!

sipa Nov 14, 2014

Choose a reason for hiding this comment

Uh oh!

laanwj commented Nov 13, 2014

Uh oh!

laanwj commented Nov 16, 2014

Uh oh!

sipa commented Nov 16, 2014

Uh oh!

laanwj commented Nov 17, 2014

Uh oh!

sipa commented Nov 17, 2014

Uh oh!

sipa commented Nov 17, 2014

Uh oh!

sipa commented Nov 20, 2014

Uh oh!

gmaxwell commented Nov 23, 2014

Uh oh!

rdponticelli commented Nov 23, 2014

Uh oh!

gmaxwell commented Nov 23, 2014

Uh oh!

laanwj commented Nov 24, 2014

Uh oh!

sipa commented Nov 24, 2014

Uh oh!

gmaxwell commented Nov 24, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants