add header-based routing using the outbound policy API #1992

hawkw · 2022-11-15T18:01:27Z

This branch implements header-based routing using the new client policy
API added in linkerd/linkerd2-proxy-api#165.

Overview

In particular, this branch does the following:

Client policy discovery:

Add a new linkerd-client-policy crate with internal types
representing the client policies and code for converting protobuf
policy messages to those representations
Add a new policy module in linkerd-app-outbound with an API client
Add client policy discovery to the outbound proxy's push_discover
stack.

This means that profile discovery and client policy discovery are:

Stored in the same cache per original destination IP address.
Currently always occur for all destinations, although the client policy
discovery is dropped when no profile is resolved or if the profile is a
direct endpoint.
Occur in roughly the same place in the proxy and don't require
propagating original destination addresses to new parts of the outbound
stack.

Client policy HTTPRoute matching:

Add a router implementation in linkerd-client-policy for recognizing
an HTTPRoute defined in a client policy
Add a new stack in the outbound proxy's push_http_logical stack that
is switched to if a client policy is present, rather than the current
Service Profile-based HTTP logical stack
Add the HTTPRoute router to the client policy stack

Client policy traffic splitting:

Move the traffic split implementation from linkerd-service-profiles
into linkerd-client-policy, and update linkerd-service-profiles to
depend on it from client-policy.
Separate the traffic split middleware into fixed and Dynamic
(updated from discovery) traffic splits, because updates to the set of
routes in a client policy will be handled by the policy router.
Build fixed traffic splits for each set of backends defined in an
HTTPRoute rule in the client policy stack.

Not Currently Implemented

backendRefs that are a direct endpoint rather than a logical
name are currently ignored
integration tests for client policy have yet to be written
only traffic splitting is currently implemented (no other policies
can be set)
per route metrics?
client policy and Service Profile API lookups are currently run
sequentially, but could be parallelized (with some rewriting of the
current profile discovery code)

linkerd/app/gateway/src/gateway.rs

linkerd/app/outbound/src/http/logical.rs

linkerd/app/outbound/src/policy/api.rs

linkerd/app/src/env.rs

linkerd/client-policy/src/lib.rs

adleong · 2022-11-16T23:49:17Z

The API response for server policies includes metadata for both the
entire server policy and for the individual routes in that policy.
On the other hand, the client policy API response only includes
metadata for HTTP routes. Presumably we will want to add metadata for
the whole policy message as well?

It looks like in the "inbound" API, there is metadata for each authorization and each httproute. This makes sense because they correspond to k8s resources (e.g. to AuthorizationPolicy and HttpRoute). For "outbound", we attach HttpRoute resources to Services directly. We could have a top level metadata field which describes the Service resource, but this doesn't seem super useful to me.

hawkw · 2022-11-16T23:50:53Z

We could have a top level metadata field which describes the Service resource, but this doesn't seem super useful to me.

hmm, yeah, i suppose the service would already be recorded in other labels..

Cargo.lock

adleong · 2022-11-28T22:41:48Z

Food for thought:

If I understand correctly, in the proxy we do a profile lookup in order to get a logical addr (i.e. Service DNS name), but then throw that away and instead do a client policy lookup using the orig dst. The controller uses that target ip address and does a lookup in a Service index to find the Service name with that IP. This is redundant.

Would it make sense to move the client policy lookup earlier in the proxy: before the lookup of the logical addr? I think this helps us move toward a world where the service profile API isn't used at all when a client policy exists.

Depends on #1992 This branch adds a very rough implementation of traffic splitting based on client policy backends. Client policy traffic splitting is implemented by moving the existing traffic split implementation into the `linkerd-client-policy` crate, and changing it to operate on the `Backend` type. The `Target` struct stored in a ServiceProfile are replaced with the `Backend` type, so that the same traffic split layer can work with a list of backends provided by either a client policy or a ServiceProfile. The `Logical` target type has a new impl of `Param<Vec<Backend>>` and `Param<BackendStream>` that chooses whether to return backends from the client policy discovery or the ServiceProfile based on whether or not a client policy was discovered for that logical destination. This could use some polish, and some cases aren't yet implemented (client policy backends may be either a named address _or_ a direct endpoint address, which is not supported by ServiceProfiles, so we'll need to add additional switching code to handle that case). However, this should be a decent working steelthread.

Signed-off-by: Eliza Weisman <[email protected]>

these are served on separate ports

Depends on #1992 This branch adds a very rough implementation of traffic splitting based on client policy backends. Client policy traffic splitting is implemented by moving the existing traffic split implementation into the `linkerd-client-policy` crate, and changing it to operate on the `Backend` type. The `Target` struct stored in a ServiceProfile are replaced with the `Backend` type, so that the same traffic split layer can work with a list of backends provided by either a client policy or a ServiceProfile. The `Logical` target type has a new impl of `Param<Vec<Backend>>` and `Param<BackendStream>` that chooses whether to return backends from the client policy discovery or the ServiceProfile based on whether or not a client policy was discovered for that logical destination. This could use some polish, and some cases aren't yet implemented (client policy backends may be either a named address _or_ a direct endpoint address, which is not supported by ServiceProfiles, so we'll need to add additional switching code to handle that case). However, this should be a decent working steelthread.

This branch adds a rough draft of a router for selecting client policy routes. Right now, once a route is selected, it just gets logged as a demo that route matching works. A follow-up PR will implement HTTPRoute traffic splitting once a route is matched. Depends on #2021

hawkw · 2022-12-07T23:53:03Z

updated the PR description to reflect the current state of this branch

olix0r · 2022-12-09T19:52:18Z

linkerd/client-policy/src/http/router.rs

+impl<T, N> NewService<T> for NewServiceRouter<N>
+where
+    T: Param<Option<Receiver>> + Clone,
+    N: NewService<(Option<RoutePolicy>, T)> + Clone,
+{


Do we really need to handle these as optional? We should use the stack to enforce that these are present; and all requests must have routes that go through this service, so I'd rather not promulgate optionality to the inner stack.

I know this is different from how it's handled in service profiles; but I think we want to drive towards a state where there is a route description on all requests.

the route is no longer optional as of fc32de7; the policy rx is still an Option, but i could change this to operate on a separate target type (there's already a TODO about this)...

linkerd/client-policy/src/http/router.rs

linkerd/client-policy/src/lib.rs

otherwise, we still build a policy stack. perhaps this should change, and we should have a default route always, that builds a profile stack...but that seems Complicateder...

This reverts commit b2b0bd1. That was a work-in-progress change that I didn't mean to commit, my bad!

olix0r · 2022-12-12T21:59:54Z

linkerd/app/outbound/src/http/logical.rs

+            // for now, the client-policy route stack is just a fixed traffic split
+            let policy = concrete
+                .check_new_service::<(ConcreteAddr, Logical), _>()
+                .push_map_target(|(concrete, PolicyRoute { route: _, logical }): (ConcreteAddr, PolicyRoute)| {
+                    (concrete, logical)
+                })
+                .push(policy::split::layer())


I may be wrong, but my reading of this is that the policy split creates an inner stack for every backend of every route. Does this mean that multiple routes to a single backend service do not share load balancers/connections? Or where are the load balancers cached if not here?

Yes, that's correct; we should probably add a caching layer here so that each concrete address's client stack is shared across multiple routes that share that backend.

When we add more forms of per-route policy, though (e.g. different retry policies, HTTPRoute filters, and per route metrics), we'll want to ensure that the portions of that stack that may differ based on which route is used are before the client stack cache, though.

olix0r · 2022-12-13T21:52:37Z

linkerd/app/outbound/src/http/logical.rs

+                // Any per-route policy has to go between the split layer and
+                // the concrete stack cache, since we _don't_ want to share
+                // client policy across routes that have the same backend!
+                .push_map_target(
+                    |(concrete, PolicyRoute { route: _, logical }): (ConcreteAddr, PolicyRoute)| {
+                        (concrete, logical)
+                    },
+                )
+                .push(policy::split::layer())
+                .push_on_service(
+                    svc::layers()
+                        .push(svc::layer::mk(svc::SpawnReady::new))
+                        .push(
+                            rt.metrics
+                                .proxy
+                                .stack
+                                .layer(stack_labels("http", "HTTPRoute")),
+                        )
+                        .push(svc::FailFast::layer("HTTPRoute", dispatch_timeout))
+                        .push_spawn_buffer(buffer_capacity),
+                )
+                .check_new_clone::<PolicyRoute>()
+                .push_map_target(|(route, logical)| {
+                    tracing::debug!(?route);
+                    PolicyRoute { route, logical }
+                })
+                .push(policy::http::NewServiceRouter::layer())


If the concrete stack is cached, can we omit the cache/buffer around the split so that every route gets its own split?

If so, perhaps it makes sense to pull it all into one layer that is Logical-outside and Concrete-inside... We'll need to consider how/where per-route functionality is added for things like retries/tmeouts.

hawkw commented Nov 15, 2022

View reviewed changes

olix0r reviewed Nov 17, 2022

View reviewed changes

Cargo.lock Show resolved Hide resolved

hawkw mentioned this pull request Nov 29, 2022

rough draft of client policy traffic splitting #2021

Merged

hawkw changed the title ~~rough draft of client policy plumbing~~ add header-based routing using the outbound policy API Dec 6, 2022

hawkw added 22 commits December 6, 2022 13:02

factor out policy meta into a policy-core crate

fdb2b36

add quick internal repr of client policy types

6a2a352

quick api client impl

aa27c52

wip policy caching stuff etc

4e89ee4

Signed-off-by: Eliza Weisman <[email protected]>

add TryFrom for top-level ClientPolicy type

dba58f2

fixup cache stuff

ed2b45a

wip (somewhat unfortunate)

5bd37ba

wire up policy resolution (messy)

453edbc

almost wired up for now

fdf0d45

basically all wired up

e146f88

warnings

5b5af8e

rm tonic's axum dep

52724db

fix clone warning

c711f59

actually construct test configs

be9e206

populate workload env var in tests

54f69d1

propagate policy svc addr

e933678

log client policies for demo purposes

7858530

separate client and server policy service addrs

91b48f4

these are served on separate ports

prettier demo logging

62d46fc

hawkw added 2 commits December 6, 2022 14:02

don't resolve policies if no policy svc is configured

d10360b

rm wrong comment

547f50f

Merge branch 'main' into eliza/client-policy-api

d554333

olix0r reviewed Dec 9, 2022

View reviewed changes

linkerd/client-policy/src/http/router.rs Outdated Show resolved Hide resolved

test: add mock policy controller for integration

c99e42a

olix0r reviewed Dec 9, 2022

View reviewed changes

linkerd/client-policy/src/lib.rs Show resolved Hide resolved

hawkw added 7 commits December 9, 2022 12:28

remove top-level Meta from ClientPolicy

9fe2284

default policy should have a route matching /

3f7ece5

actually implement sending test policy updates

9b0a25b

test requests that don't match a route getting 404

7fb4df3

return 404 when no HTTPRoute matches a request

fc32de7

add header-based routing integration test

79d7492

actually the default policy needs to have no routes

7255fcd

otherwise, we still build a policy stack. perhaps this should change, and we should have a default route always, that builds a profile stack...but that seems Complicateder...

hawkw force-pushed the eliza/client-policy-api branch from f70e578 to 7255fcd Compare December 9, 2022 23:49

hawkw added 2 commits December 9, 2022 15:50

allow splits to have named addrs or socketaddrs

b2b0bd1

Revert "allow splits to have named addrs or socketaddrs"

cd1c2d6

This reverts commit b2b0bd1. That was a work-in-progress change that I didn't mean to commit, my bad!

olix0r reviewed Dec 12, 2022

View reviewed changes

hawkw added 6 commits December 12, 2022 14:38

add cache/buffer around concrete stacks inside a split

b5c64d9

same cache should be shared w/ service profiles

1d9689c

refactor push_http_logical into two fns

9175716

move Concrete::from

98ef287

more refactoring

4db6cae

split out more stuff

789e83e

olix0r reviewed Dec 13, 2022

View reviewed changes

This was referenced Dec 15, 2022

unify HTTPRoute and ServiceProfile route stacks #2070

Closed

remove fixed inbound policy mode #2079

Closed

hawkw closed this Feb 22, 2023

olix0r deleted the eliza/client-policy-api branch March 7, 2023 21:49

add header-based routing using the outbound policy API #1992

add header-based routing using the outbound policy API #1992

Uh oh!

Conversation

hawkw commented Nov 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Client policy discovery:

Client policy HTTPRoute matching:

Client policy traffic splitting:

Not Currently Implemented

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adleong commented Nov 16, 2022

Uh oh!

hawkw commented Nov 16, 2022

Uh oh!

Uh oh!

adleong commented Nov 28, 2022

Uh oh!

hawkw commented Dec 7, 2022

Uh oh!

olix0r Dec 9, 2022

Choose a reason for hiding this comment

Uh oh!

olix0r Dec 9, 2022

Choose a reason for hiding this comment

Uh oh!

hawkw Dec 9, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

olix0r Dec 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hawkw Dec 12, 2022

Choose a reason for hiding this comment

Uh oh!

olix0r Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hawkw commented Nov 15, 2022 •

edited

Loading

olix0r Dec 12, 2022 •

edited

Loading