Parallel ThreadMessageHandler #9488

TheBlueMatt · 2017-01-08T03:35:08Z

Based on a (now outdated) #9441.

This runs multiple ThreadMessageHandlers, but only allows them to do (relatively limited) work - it has a whitelisted list of commands which are expected to not take cs_main, and only runs those in a secondary thread, but running anything in the "main thread" (a concept based on randomly acquiring an atomic_bool at the top of the processing loop).

Additionally, it will never be in the ProcessMessages/SendMessages part of the loop for one node in both threads.

With this, #9419, and #9375 (plus changing the whitelisted list of messages) we can respond to getblocktxn requests while another ProcessMessages is busy connecting the block.

Surprisingly this hasn't been causing me any issues while testing, probably because it requires lots of large blocks to be flying around. Send/Recv corks need tests!

This will be needed so that the message processor can cork incoming messages

These conditions are problematic to check without locking, and we shouldn't be relying on the refcount to disconnect.

when vRecvMsg becomes a private buffer, it won't make sense to allow other threads to mess with it anymore.

This is left-over from before there was proper accounting. Hitting 2x the sendbuffer size should not be possible.

…eserialize We'll soon no longer have access to vRecvMsg, and this is more intuitive anyway.

This allows locking to be pushed down to only where it's needed Also reuse the current time rather than checking multiple times.

This may be used publicly in the future

In order to sleep accurately, the message handler needs to know if _any_ node has more processing that it should do before the entire thread sleeps. Rather than returning a value that represents whether ProcessMessages encountered a message that should trigger a disconnnect, interpret the return value as whether or not that node has more work to do. Also, use a global fProcessSleep value that can be set by other threads, which takes precedence (for one cycle) over the messagehandler's decision. Note that the previous behavior was to only process one message per loop (except in the case of a bad checksum or invalid header). That was added in The only change here in that regard is that the current node now falls to the back of the processing queue for the bad checksum/invalid header cases.

This separates the storage of messages from the net and queued messages for processing, allowing the locks to be split.

Messages are dumped very quickly from the socket handler to the processor, so it's the depth of the processing queue that's interesting. The socket handler checks the process queue's size during the brief message hand-off and pauses if necessary, and the processor possibly unpauses each time a message is popped off of its queue.

Similar to the recv flag, but this one indicates whether or not the net's send buffer is full. The socket handler checks the send queue when a new message is added and pauses if necessary, and possibly unpauses after each message is drained from its buffer.

This is representative of how messages will be sent out once processing is abstracted out. Makes it clear that the processor _must_ accept all messages. It also pushes the use of cs_vProcessMsg to a function with narrow scope.

It's now only used for message size/time accounting

gmaxwell · 2017-01-08T05:29:59Z

src/net_processing.cpp

cs_main is taken by ProcessGetData at like 2456 above. (github won't let me comment there).

gmaxwell · 2017-01-08T05:35:16Z

I am somewhat dubious that it's feasible to determine and maintain that various bits of code do not take cs_main anywhere in their call graph.

I think code like this might need a new debug tool where you can poison_lock(cs_main) which increments a thread local counter while it is in scope. And if the counter is positive when cs_main is taken it causes an assertion/debug print.

TheBlueMatt · 2017-01-08T05:45:48Z

Yea, I thought about adding something like that and never got around to it...I'll try to add it to this in the coming days.

This uses the new return value from ProcessMessages which indicates whether the next message might not need cs_main, trying to process any messages which do not need cs_main, even while another thread is running which takes cs_main for messages.

TheBlueMatt · 2017-01-10T19:57:14Z

Added a DEBUG_LOCKORDER check to prevent cs_main.

TheBlueMatt · 2017-01-17T15:42:52Z

Closing for now. Will work towards a more whole solution for 0.15.

theuni and others added 18 commits January 4, 2017 09:29

net: fix typo causing the wrong receive buffer size

53ad9a1

Surprisingly this hasn't been causing me any issues while testing, probably because it requires lots of large blocks to be flying around. Send/Recv corks need tests!

net: make vRecvMsg a list so that we can use splice()

e5bcd9c

net: make GetReceiveFloodSize public

5b4a8ac

This will be needed so that the message processor can cork incoming messages

net: only disconnect if fDisconnect has been set

f6315e0

These conditions are problematic to check without locking, and we shouldn't be relying on the refcount to disconnect.

net: wait until the node is destroyed to delete its recv buffer

6042587

when vRecvMsg becomes a private buffer, it won't make sense to allow other threads to mess with it anymore.

net: remove redundant max sendbuffer size check

0e973d9

This is left-over from before there was proper accounting. Hitting 2x the sendbuffer size should not be possible.

net: set message deserialization version when it's actually time to d…

56212e2

…eserialize We'll soon no longer have access to vRecvMsg, and this is more intuitive anyway.

net: handle message accounting in ReceiveMsgBytes

cb3456e

This allows locking to be pushed down to only where it's needed Also reuse the current time rather than checking multiple times.

net: record bytes written before notifying the message processor

b8f8d61

net: Add a simple function for waking the message handler

3d23d9f

This may be used publicly in the future

net: remove useless comments

13234c2

net: add a new message queue for the message processor

b5ea4f2

This separates the storage of messages from the net and queued messages for processing, allowing the locks to be split.

net: split the message queueing into its own function

90bfeb0

This is representative of how messages will be sent out once processing is abstracted out. Makes it clear that the processor _must_ accept all messages. It also pushes the use of cs_vProcessMsg to a function with narrow scope.

net: drop the receive lock during message processing

1c779a8

It's now only used for message size/time accounting

Add a new lock to CNode to process one node at once

cf35386

fanquake added the P2P label Jan 8, 2017

gmaxwell reviewed Jan 8, 2017

View reviewed changes

TheBlueMatt force-pushed the 2017-01-parallel-processmessages branch from e8351a2 to 73e696c Compare January 8, 2017 05:43

TheBlueMatt added 4 commits January 8, 2017 00:48

Make ProcessMessages return whether the next message needs cs_main

c4b6c91

Give ProcessMessages a boolean to tell it to avoid cs_main locking

6b4a883

Run multiple (default 2) ProcessMessages threads

ac0a3ad

TheBlueMatt force-pushed the 2017-01-parallel-processmessages branch from 73e696c to ac0a3ad Compare January 8, 2017 05:49

TheBlueMatt added 2 commits January 8, 2017 16:56

Add the ability to disallow a lock in DEBUG_LOCKORDER

93bbd0a

Disallow cs_main in ProcessMessages if we have fAvoidLocking

2e036fe

TheBlueMatt mentioned this pull request Jan 12, 2017

Stop Using cs_main for CNodeState/State() #9419

Closed

TheBlueMatt closed this Jan 17, 2017

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parallel ThreadMessageHandler #9488

Parallel ThreadMessageHandler #9488

Uh oh!

TheBlueMatt commented Jan 8, 2017

Uh oh!

gmaxwell Jan 8, 2017

Uh oh!

TheBlueMatt Jan 8, 2017

Uh oh!

gmaxwell commented Jan 8, 2017

Uh oh!

TheBlueMatt commented Jan 8, 2017

Uh oh!

TheBlueMatt commented Jan 10, 2017

Uh oh!

TheBlueMatt commented Jan 17, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Parallel ThreadMessageHandler #9488

Parallel ThreadMessageHandler #9488

Uh oh!

Conversation

TheBlueMatt commented Jan 8, 2017

Uh oh!

gmaxwell Jan 8, 2017

Choose a reason for hiding this comment

Uh oh!

TheBlueMatt Jan 8, 2017

Choose a reason for hiding this comment

Uh oh!

gmaxwell commented Jan 8, 2017

Uh oh!

TheBlueMatt commented Jan 8, 2017

Uh oh!

TheBlueMatt commented Jan 10, 2017

Uh oh!

TheBlueMatt commented Jan 17, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants