buffer: Port idleness changes to tokio-0.2 by olix0r · Pull Request #505 · linkerd/linkerd2-proxy

olix0r · 2020-05-05T18:17:14Z

Applies the changes in #502 to the tokio-0.2 branch.

hawkw

Overall, this looks like it was ported to std::future correctly! I had a couple minor nits, but they're not blockers.

Also, I noticed that the formatting of the IdleError appears to be incorrect when the timeout is >= 1 second. We should probably fix that on master as well.

Once that's fixed, this LGTM!

hawkw · 2020-05-07T16:40:11Z

linkerd/app/inbound/src/lib.rs

+            //                 .push_spawn_buffer_with_idle_timeout(
+            //                     buffer_capacity,
+            //                     cache_max_idle_age,
+            //                 )


Thanks for copying over the changes to commented out code as well, that will make my life much easier later :)

linkerd/buffer/src/dispatch.rs

linkerd/buffer/src/error.rs

linkerd/buffer/Cargo.toml

olix0r · 2020-05-08T01:54:25Z

I took another pass at this to simplify Dispatch to be a simpler async fn

kleimkuhler · 2020-05-08T21:49:43Z

linkerd/buffer/src/lib.rs

+    idle_timeout: Option<Duration>,
+) -> (
+    Buffer<Req, S::Future>,
+    impl std::future::Future<Output = ()> + Send + 'static,


nit: import Future at the stop instead of using the full path here?

kleimkuhler · 2020-05-08T21:51:10Z

linkerd/buffer/src/lib.rs

    S::Response: Send + 'static,
    S::Future: Send + 'static,
 {
+    use futures::future;


Similar comment about having this with the use statements

linkerd/buffer/src/dispatch.rs

kleimkuhler · 2020-05-08T22:02:23Z

linkerd/buffer/src/lib.rs

+    let idle = move || match idle_timeout {
+        Some(t) => future::Either::Left(dispatch::idle(t)),
+        None => future::Either::Right(future::pending()),
+    };


Any reason this is a closure and not just pass it in directly to dispatch::run? If passed directly it wouldn't need to to evaluate it each iteration

Oh is this so idle can be repeatedly polled in the loop? This would move the value if we passed in idle: I instead of idle: Fn() -> I

Right, this is a factory for idles, so they can be reset every time we start waiting for a request.

When `linkerd2-buffer` was updated to `std::future` in PR #505, the behaviour of the buffer was changed subtly. The previous implementation of the buffer's `Dispatch` task was _poll-based_; it implemented its logic in an implementation of `Future::poll` with the following behavior: 1. Call `poll_ready` on the underlying service, returning `NotReady` if it is not ready. 2. Broadcast readiness to senders. 3. Call `poll_next` on the channel of requests. If a request is received, dispatch it to the service. If no request is ready, return `NotReady` (yield). Since this was an implementation of the `poll` function, if we yield due to the request channel being empty, when we are woken again by the next request, we resume _at the beginning of the `poll` function_. The new implementation, however, was written using async/await syntax. Async/await generates a state machine which, when woken after yielding at an await point, resumes _from the same await point it yielded at_. This means that if the new implementation yields because the request channel is empty, when it is woken by a request, it will **not** drive the service to readiness before sending that request. Instead, the previously acquired readiness from before the task yielded is consumed by that request. This behavior is totally fine with regards to the `tower-service` readiness contract. All the contract requires is that a call to `poll_ready` must return `Ready` before each call to `call`. It doesn't matter if there was a long period of time in between `poll_ready` and `call`, as long as the readiness was not consumed by another `call`. However, it is **not** fine from the perspective of the load balancer. The load balancer relies on `poll_ready` to drive updates from service discovery. This means that if a long period of time passes between when the balancer becomes ready and when it is called, it may have a stale service discovery state. Therefore, this change in behavior broke a large number of the proxy's integration tests that expect changes to service discovery state to be reflected in a timely manner. This commit fixes this issue by updating the new `dispatch::run` implementation to drive the service to readiness immediately before dispatching a request. Once the service is driven to readiness initially, we advertise that it is ready, and call `try_recv` on the request channel. If there is a request already in the channel, we can consume the existing readiness. Otherwise, if there is not a request immediately available, and we have to wait on the channel, we will drive the service to readiness again before calling it. This ensures that service discovery changes are reflected for the next request after they occur, rather than for the request _after_ that request. Signed-off-by: Eliza Weisman <[email protected]>

When `linkerd2-buffer` was updated to `std::future` in PR #505, the behaviour of the buffer was changed subtly. The previous implementation of the buffer's `Dispatch` task was _poll-based_; it implemented its logic in an implementation of `Future::poll` with the following behavior: 1. Call `poll_ready` on the underlying service, returning `NotReady` if it is not ready. 2. Broadcast readiness to senders. 3. Call `poll_next` on the channel of requests. If a request is received, dispatch it to the service. If no request is ready, return `NotReady` (yield). Since this was an implementation of the `poll` function, if we yield due to the request channel being empty, when we are woken again by the next request, we resume _at the beginning of the `poll` function_. The new implementation, however, was written using async/await syntax. Async/await generates a state machine which, when woken after yielding at an await point, resumes _from the same await point it yielded at_. This means that if the new implementation yields because the request channel is empty, when it is woken by a request, it will **not** drive the service to readiness before sending that request. Instead, the previously acquired readiness from before the task yielded is consumed by that request. This behavior is totally fine with regards to the `tower-service` readiness contract. All the contract requires is that a call to `poll_ready` must return `Ready` before each call to `call`. It doesn't matter if there was a long period of time in between `poll_ready` and `call`, as long as the readiness was not consumed by another `call`. However, it is **not** fine from the perspective of the load balancer. The load balancer relies on `poll_ready` to drive updates from service discovery. This means that if a long period of time passes between when the balancer becomes ready and when it is called, it may have a stale service discovery state. Therefore, this change in behavior broke a large number of the proxy's integration tests that expect changes to service discovery state to be reflected in a timely manner. This commit fixes this issue by updating the new `dispatch::run` implementation to drive the service to readiness immediately before dispatching a request. Once the service is driven to readiness initially, we advertise that it is ready, and call `try_recv` on the request channel. If there is a request already in the channel, we can consume the existing readiness. Otherwise, if there is not a request immediately available, and we have to wait on the channel, we will drive the service to readiness again before calling it. This ensures that service discovery changes are reflected for the next request after they occur, rather than for the request _after_ that request. Signed-off-by: Eliza Weisman <[email protected]> Co-authored-by: Oliver Gould <[email protected]>

buffer: Port idleness changes to tokio-0.2

682ff49

Applies the changes in #502 to the tokio-0.2 branch.

olix0r requested review from a team and hawkw May 5, 2020 18:17

hawkw approved these changes May 7, 2020

View reviewed changes

olix0r added 3 commits May 8, 2020 01:35

Async/await-ify buffer::dispatch

00f9bc0

fixup error

cf1b833

simplify dispatch constraints

74973b6

olix0r requested a review from hawkw May 8, 2020 01:53

olix0r mentioned this pull request May 8, 2020

Buffer requests while the service is pending #511

Merged

kleimkuhler reviewed May 8, 2020

View reviewed changes

fixup buffer client to propagate errors

c81e0fa

kleimkuhler approved these changes May 11, 2020

View reviewed changes

olix0r merged commit fc320c7 into master-tokio-0.2 May 11, 2020

olix0r deleted the ver/idle-buffer-0.2 branch May 11, 2020 23:53

hawkw mentioned this pull request Jun 10, 2020

buffer: drive inner service to readiness when receiving a request #556

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

buffer: Port idleness changes to tokio-0.2#505

buffer: Port idleness changes to tokio-0.2#505
olix0r merged 5 commits intomaster-tokio-0.2from
ver/idle-buffer-0.2

olix0r commented May 5, 2020

Uh oh!

hawkw left a comment

Uh oh!

hawkw May 7, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

olix0r commented May 8, 2020

Uh oh!

kleimkuhler May 8, 2020

Uh oh!

kleimkuhler May 8, 2020

Uh oh!

Uh oh!

kleimkuhler May 8, 2020

Uh oh!

kleimkuhler May 8, 2020

Uh oh!

olix0r May 8, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

olix0r commented May 5, 2020

Uh oh!

hawkw left a comment

Choose a reason for hiding this comment

Uh oh!

hawkw May 7, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

olix0r commented May 8, 2020

Uh oh!

kleimkuhler May 8, 2020

Choose a reason for hiding this comment

Uh oh!

kleimkuhler May 8, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kleimkuhler May 8, 2020

Choose a reason for hiding this comment

Uh oh!

kleimkuhler May 8, 2020

Choose a reason for hiding this comment

Uh oh!

olix0r May 8, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants