feat(auto_reconnect): allow runtime configuration of give_up_ms (v2) #5701

mbroadst · 2016-04-21T14:49:36Z

Currently the auto_reconnect instance used to reconnect lost nodes
is hardcoded to give up reconnecting after 24 hours. This is not
ideal in some scenarios where a user may want to remove a node from
a cluster, without having to reset all other participating nodes in
the cluster. This patch allows the user to change that value.

mbroadst · 2016-04-21T14:50:07Z

sorry had to open a new PR because I seem to have lost the branch the original PR was opened on (most likely due to trying to solve issues with gitattributes! 😄)

danielmewes · 2016-04-21T20:10:14Z

I'll take another look today.

danielmewes · 2016-04-22T21:15:13Z

src/clustering/administration/main/command_line.cc

+            node_reconnect_timeout_secs * 1000 > std::numeric_limits<int>::max()) {
+            throw std::runtime_error(strprintf(
+                    "ERROR: cluster-reconnect-timeout is too large. Must be at most %d",
+                    std::numeric_limits<int>::max()));


The maximum in the error message needs to take the * 1000 into account.

I think the maximum in the error message is still consistently the actual maximum (numeric_limits::max === biggest supported integer right?). So the error message is correct, it was the checks that needed to ensure that:

a) the input number is indeed within that threshold
b) secondarily, since we intend to use it as ms the number multiplied by 1000 should still be within that range

Perhaps you think the error message should be more descriptive in the event that someone enters a number that is too big only once we are testing for conversion to ms?

It should be:

throw std::runtime_error(strprintf( "ERROR: cluster-reconnect-timeout is too large. Must be at most %d", std::numeric_limits<int>::max() / 1000));

oh right right

danielmewes · 2016-04-22T21:19:43Z

Left just two comments. The rest of it looks great!

Currently the auto_reconnect instance used to reconnect lost nodes is hardcoded to give up reconnecting after 24 hours. This is not ideal in some scenarios where a user may want to remove a node from a cluster, without having to reset all other participating nodes in the cluster. This patch allows the user to change that value.

danielmewes · 2016-04-25T22:10:10Z

Looks good, thanks @mbroadst .

danielmewes · 2016-04-25T22:20:30Z

I'm waiting for some small style improvements to go through review, and will cherry-pick this into v2.3.x then. It will ship with RethinkDB 2.3.2.

mbroadst · 2016-04-25T22:24:54Z

@danielmewes fantastic, thanks!

Currently the auto_reconnect instance used to reconnect lost nodes is hardcoded to give up reconnecting after 24 hours. This is not ideal in some scenarios where a user may want to remove a node from a cluster, without having to reset all other participating nodes in the cluster. This patch allows the user to change that value.

danielmewes · 2016-04-26T00:30:58Z

Cherry-picked into v2.3.x via c64cbe4, with small style improvements for the help output (for this and also other commands) in 057518d.

mbroadst mentioned this pull request Apr 21, 2016

feat(auto_reconnect): allow runtime configuration of give_up_ms #5596

Closed

danielmewes self-assigned this Apr 21, 2016

danielmewes added this to the 2.3.x milestone Apr 21, 2016

danielmewes reviewed Apr 22, 2016
View reviewed changes

mbroadst force-pushed the auto-reconnect-config branch 2 times, most recently from 9e140ca to f677d3e Compare April 25, 2016 16:50

danielmewes merged commit 332d6b6 into rethinkdb:next Apr 25, 2016

mbroadst deleted the auto-reconnect-config branch April 25, 2016 22:24

danielmewes modified the milestones: 2.3.x, 2.3.2 Apr 28, 2016

danielmewes mentioned this pull request May 5, 2016

Document --cluster-reconnect-timeout option rethinkdb/docs#1115

Closed

atomicules mentioned this pull request Jul 19, 2022

Connecting a re-provisioned server brings down the entire cluster #6880

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(auto_reconnect): allow runtime configuration of give_up_ms (v2) #5701

feat(auto_reconnect): allow runtime configuration of give_up_ms (v2) #5701

Uh oh!

mbroadst commented Apr 21, 2016

Uh oh!

mbroadst commented Apr 21, 2016

Uh oh!

danielmewes commented Apr 21, 2016

Uh oh!

danielmewes Apr 22, 2016

Uh oh!

mbroadst Apr 25, 2016

Uh oh!

lbguilherme Apr 25, 2016

Uh oh!

mbroadst Apr 25, 2016

Uh oh!

danielmewes commented Apr 22, 2016

Uh oh!

danielmewes commented Apr 25, 2016

Uh oh!

danielmewes commented Apr 25, 2016

Uh oh!

mbroadst commented Apr 25, 2016

Uh oh!

danielmewes commented Apr 26, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(auto_reconnect): allow runtime configuration of give_up_ms (v2) #5701

feat(auto_reconnect): allow runtime configuration of give_up_ms (v2) #5701

Uh oh!

Conversation

mbroadst commented Apr 21, 2016

Uh oh!

mbroadst commented Apr 21, 2016

Uh oh!

danielmewes commented Apr 21, 2016

Uh oh!

danielmewes Apr 22, 2016

Choose a reason for hiding this comment

Uh oh!

mbroadst Apr 25, 2016

Choose a reason for hiding this comment

Uh oh!

lbguilherme Apr 25, 2016

Choose a reason for hiding this comment

Uh oh!

mbroadst Apr 25, 2016

Choose a reason for hiding this comment

Uh oh!

danielmewes commented Apr 22, 2016

Uh oh!

danielmewes commented Apr 25, 2016

Uh oh!

danielmewes commented Apr 25, 2016

Uh oh!

mbroadst commented Apr 25, 2016

Uh oh!

danielmewes commented Apr 26, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants