round_robin: periodically retry connecting to failed subchannels

As [discussed](https://github.com/grpc/grpc/issues/11578#issuecomment-311057963) in #11578, the `round_robin` load-balancer should periodically attempt to reconnect to failed subchannels.

As it stands, if a service lives on _n_ remote servers, all of which eventually undergo downtime/maintenance, then a (long-lived) client will gradually lose connectivity to more and more servers, until only one is left, and only when that one goes down, does it reconnect to all of them. Not especially great for load-balancing.

I'm creating this issue for tracking, as advised by @dgquintas. Hopefully this can get fixed for 1.5.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

round_robin: periodically retry connecting to failed subchannels #11643

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

round_robin: periodically retry connecting to failed subchannels #11643

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions