Skip to content

Commit b98af7d

Browse files
authored
update new trait solver docs (rust-lang#1802)
* rewrite requirements/invariants * add some more info about the trait solver * CI * review
1 parent b7e20c4 commit b98af7d

File tree

4 files changed

+212
-78
lines changed

4 files changed

+212
-78
lines changed

src/SUMMARY.md

+1
Original file line numberDiff line numberDiff line change
@@ -124,6 +124,7 @@
124124
- [Goals and clauses](./traits/goals-and-clauses.md)
125125
- [Canonical queries](./traits/canonical-queries.md)
126126
- [Next-gen trait solving](./solve/trait-solving.md)
127+
- [Invariants of the type system](./solve/invariants.md)
127128
- [The solver](./solve/the-solver.md)
128129
- [Canonicalization](./solve/canonicalization.md)
129130
- [Coinduction](./solve/coinduction.md)

src/solve/invariants.md

+145
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,145 @@
1+
# Invariants of the type system
2+
3+
FIXME: This file talks about invariants of the type system as a whole, not only the solver
4+
5+
There are a lot of invariants - things the type system guarantees to be true at all times -
6+
which are desirable or expected from other languages and type systems. Unfortunately, quite
7+
a few of them do not hold in Rust right now. This is either a fundamental to its design or
8+
caused by bugs and something that may change in the future.
9+
10+
It is important to know about the things you can assume while working on - and with - the
11+
type system, so here's an incomplete and inofficial list of invariants of
12+
the core type system:
13+
14+
- ✅: this invariant mostly holds, with some weird exceptions, you can rely on it outside
15+
of these cases
16+
- ❌: this invariant does not hold, either due to bugs or by design, you must not rely on
17+
it for soundness or have to be incredibly careful when doing so
18+
19+
### `wf(X)` implies `wf(normalize(X))`
20+
21+
If a type containing aliases is well-formed, it should also be
22+
well-formed after normalizing said aliases. We rely on this as
23+
otherwise we would have to re-check for well-formedness for these
24+
types.
25+
26+
This is unfortunately broken for `<fndef as FnOnce<..>>::Output` due to implied bounds,
27+
resulting in [#114936].
28+
29+
### applying inference results from a goal does not change its result ❌
30+
31+
TODO: this invariant is formulated in a weird way and needs to be elaborated.
32+
Pretty much: I would like this check to only fail if there's a solver bug:
33+
https://github.com/rust-lang/rust/blob/2ffeb4636b4ae376f716dc4378a7efb37632dc2d/compiler/rustc_trait_selection/src/solve/eval_ctxt.rs#L391-L407
34+
35+
If we prove some goal/equate types/whatever, apply the resulting inference constraints,
36+
and then redo the original action, the result should be the same.
37+
38+
This unfortunately does not hold - at least in the new solver - due to a few annoying reasons.
39+
40+
### The trait solver has to be *locally sound*
41+
42+
This means that we must never return *success* for goals for which no `impl` exists. That would
43+
mean we assume a trait is implemented even though it is not, which is very likely to result in
44+
actual unsoundness. When using `where`-bounds to prove a goal, the `impl` will be provided by the
45+
user of the item.
46+
47+
This invariant only holds if we check region constraints. As we do not check region constraints
48+
during implicit negative overlap check in coherence, this invariant is broken there. As this check
49+
relies on *completeness* of the trait solver, it is not able to use the current region constraints
50+
check - `InferCtxt::resolve_regions` - as its handling of type outlives goals is incomplete.
51+
52+
### Normalization of semantically equal aliases in empty environments results in a unique type ✅
53+
54+
Normalization for alias types/consts has to have a unique result. Otherwise we can easily
55+
implement transmute in safe code. Given the following function, we have to make sure that
56+
the input and output types always get normalized to the same concrete type.
57+
58+
```rust
59+
fn foo<T: Trait>(
60+
x: <T as Trait>::Assoc
61+
) -> <T as Trait>::Assoc {
62+
x
63+
}
64+
```
65+
66+
Many of the currently known unsound issues end up relying on this invariant being broken.
67+
It is however very difficult to imagine a sound type system without this invariant, so
68+
the issue is that the invariant is broken, not that we incorrectly rely on it.
69+
70+
### Generic goals and their instantiations have the same result ✅
71+
72+
Pretty much: If we successfully typecheck a generic function concrete instantiations
73+
of that function should also typeck. We should not get errors post-monomorphization.
74+
We can however get overflow errors at that point.
75+
76+
TODO: example for overflow error post-monomorphization
77+
78+
This invariant is relied on to allow the normalization of generic aliases. Breaking
79+
it can easily result in unsoundness, e.g. [#57893](https://github.com/rust-lang/rust/issues/57893)
80+
81+
### Trait goals in empty environments are proven by a unique impl ✅
82+
83+
If a trait goal holds with an empty environment, there should be a unique `impl`,
84+
either user-defined or builtin, which is used to prove that goal. This is
85+
necessary to select a unique method. It
86+
87+
We do however break this invariant in few cases, some of which are due to bugs,
88+
some by design:
89+
- *marker traits* are allowed to overlap as they do not have associated items
90+
- *specialization* allows specializing impls to overlap with their parent
91+
- the builtin trait object trait implementation can overlap with a user-defined impl:
92+
[#57893]
93+
94+
### The type system is complete ❌
95+
96+
The type system is not complete, it often adds unnecessary inference constraints, and errors
97+
even though the goal could hold.
98+
99+
- method selection
100+
- opaque type inference
101+
- handling type outlives constraints
102+
- preferring `ParamEnv` candidates over `Impl` candidates during candidate selection
103+
in the trait solver
104+
105+
#### The type system is complete during the implicit negative overlap check in coherence ✅
106+
107+
During the implicit negative overlap check in coherence we must never return *error* for
108+
goals which can be proven. This would allow for overlapping impls with potentially different
109+
associated items, breaking a bunch of other invariants.
110+
111+
This invariant is currently broken in many different ways while actually something we rely on.
112+
We have to be careful as it is quite easy to break:
113+
- generalization of aliases
114+
- generalization during subtyping binders (luckily not exploitable in coherence)
115+
116+
### Trait solving must be (free) lifetime agnostic ✅
117+
118+
Trait solving during codegen should have the same result as during typeck. As we erase
119+
all free regions during codegen we must not rely on them during typeck. A noteworthy example
120+
is special behavior for `'static`.
121+
122+
We also have to be careful with relying on equality of regions in the trait solver.
123+
This is fine for codegen, as we treat all erased regions as equal. We can however
124+
lose equality information from HIR to MIR typeck.
125+
126+
The new solver "uniquifies regions" during canonicalization, canonicalizing `u32: Trait<'x, 'x>`
127+
as `exists<'0, '1> u32: Trait<'0, '1>`, to make it harder to rely on this property.
128+
129+
### Removing ambiguity makes strictly more things compile ❌
130+
131+
Ideally we *should* not rely on ambiguity for things to compile.
132+
Not doing that will cause future improvements to be breaking changes.
133+
134+
Due to *incompleteness* this is not the case and improving inference can result in inference
135+
changes, breaking existing projects.
136+
137+
### Semantic equality implies structural equality ✅
138+
139+
Two types being equal in the type system must mean that they have the
140+
same `TypeId` after instantiating their generic parameters with concrete
141+
arguments. This currently does not hold: [#97156].
142+
143+
[#57893]: https://github.com/rust-lang/rust/issues/57893
144+
[#97156]: https://github.com/rust-lang/rust/issues/97156
145+
[#114936]: https://github.com/rust-lang/rust/issues/114936

src/solve/the-solver.md

+66-7
Original file line numberDiff line numberDiff line change
@@ -6,12 +6,71 @@ approach.
66

77
[chalk]: https://rust-lang.github.io/chalk/book/recursive.html
88

9-
The basic structure of the solver is a pure function
10-
`fn evaluate_goal(goal: Goal<'tcx>) -> Response`.
11-
While the actual solver is not fully pure to deal with overflow and cycles, we are
12-
going to defer that for now.
9+
## A rough walkthrough
1310

14-
To deal with inference variables and to improve caching, we use
15-
[canonicalization](./canonicalization.md).
11+
The entry-point of the solver is `InferCtxtEvalExt::evaluate_root_goal`. This
12+
function sets up the root `EvalCtxt` and then calls `EvalCtxt::evaluate_goal`,
13+
to actually enter the trait solver.
1614

17-
TODO: write the remaining code for this as well.
15+
`EvalCtxt::evaluate_goal` handles [canonicalization](./canonicalization.md), caching,
16+
overflow, and solver cycles. Once that is done, it creates a nested `EvalCtxt` with a
17+
separate local `InferCtxt` and calls `EvalCtxt::compute_goal`, which is responsible for the
18+
'actual solver behavior'. We match on the `PredicateKind`, delegating to a separate function
19+
for each one.
20+
21+
For trait goals, such a `Vec<T>: Clone`, `EvalCtxt::compute_trait_goal` has
22+
to collect all the possible ways this goal can be proven via
23+
`EvalCtxt::assemble_and_evaluate_candidates`. Each candidate is handled in
24+
a separate "probe", to not leak inference constraints to the other candidates.
25+
We then try to merge the assembled candidates via `EvalCtxt::merge_candidates`.
26+
27+
28+
## Important concepts and design pattern
29+
30+
### `EvalCtxt::add_goal`
31+
32+
To prove nested goals, we don't directly call `EvalCtxt::compute_goal`, but instead
33+
add the goal to the `EvalCtxt` with `EvalCtxt::all_goal`. We then prove all nested
34+
goals together in either `EvalCtxt::try_evaluate_added_goals` or
35+
`EvalCtxt::evaluate_added_goals_and_make_canonical_response`. This allows us to handle
36+
inference constraints from later goals.
37+
38+
E.g. if we have both `?x: Debug` and `(): ConstrainToU8<?x>` as nested goals,
39+
then proving `?x: Debug` is initially ambiguous, but after proving `(): ConstrainToU8<?x>`
40+
we constrain `?x` to `u8` and proving `u8: Debug` succeeds.
41+
42+
### Matching on `TyKind`
43+
44+
We lazily normalize types in the solver, so we always have to assume that any types
45+
and constants are potentially unnormalized. This means that matching on `TyKind` can easily
46+
be incorrect.
47+
48+
We handle normalization in two different ways. When proving `Trait` goals when normalizing
49+
associated types, we separately assemble candidates depending on whether they structurally
50+
match the self type. Candidates which match on the self type are handled in
51+
`EvalCtxt::assemble_candidates_via_self_ty` which recurses via
52+
`EvalCtxt::assemble_candidates_after_normalizing_self_ty`, which normalizes the self type
53+
by one level. In all other cases we have to match on a `TyKind` we first use
54+
`EvalCtxt::try_normalize_ty` to normalize the type as much as possible.
55+
56+
### Higher ranked goals
57+
58+
In case the goal is higher-ranked, e.g. `for<'a> F: FnOnce(&'a ())`, `EvalCtxt::compute_goal`
59+
eagerly instantiates `'a` with a placeholder and then recursively proves
60+
`F: FnOnce(&'!a ())` as a nested goal.
61+
62+
### Dealing with choice
63+
64+
Some goals can be proven in multiple ways. In these cases we try each option in
65+
a separate "probe" and then attempt to merge the resulting responses by using
66+
`EvalCtxt::try_merge_responses`. If merging the responses fails, we use
67+
`EvalCtxt::flounder` instead, returning ambiguity. For some goals, we try
68+
incompletely prefer some choices over others in case `EvalCtxt::try_merge_responses`
69+
fails.
70+
71+
## Learning more
72+
73+
The solver should be fairly self-contained. I hope that the above information provides a
74+
good foundation when looking at the code itself. Please reach out on zulip if you get stuck
75+
while doing so or there are some quirks and design decisions which were unclear and deserve
76+
better comments or should be mentioned here.

src/solve/trait-solving.md

-71
Original file line numberDiff line numberDiff line change
@@ -39,77 +39,6 @@ which does not have any nested goals. Therefore `Vec<T>: Clone` holds.
3939
The trait solver can either return success, ambiguity or an error as a [`CanonicalResponse`].
4040
For success and ambiguity it also returns constraints inference and region constraints.
4141

42-
## Requirements
43-
44-
Before we dive into the new solver lets first take the time to go through all of our requirements
45-
on the trait system. We can then use these to guide our design later on.
46-
47-
TODO: elaborate on these rules and get more precise about their meaning.
48-
Also add issues where each of these rules have been broken in the past
49-
(or still are).
50-
51-
### 1. The trait solver has to be *sound*
52-
53-
This means that we must never return *success* for goals for which no `impl` exists. That would
54-
simply be unsound by assuming a trait is implemented even though it is not. When using predicates
55-
from the `where`-bounds, the `impl` will be proved by the user of the item.
56-
57-
### 2. If type checker solves generic goal concrete instantiations of that goal have the same result
58-
59-
Pretty much: If we successfully typecheck a generic function concrete instantiations
60-
of that function should also typeck. We should not get errors post-monomorphization.
61-
We can however get overflow as in the following snippet:
62-
63-
```rust
64-
fn foo<T: Trait>(x: )
65-
```
66-
67-
### 3. Trait goals in empty environments are proven by a unique impl
68-
69-
If a trait goal holds with an empty environment, there is a unique `impl`,
70-
either user-defined or builtin, which is used to prove that goal.
71-
72-
This is necessary for codegen to select a unique method.
73-
An exception here are *marker traits* which are allowed to overlap.
74-
75-
### 4. Normalization in empty environments results in a unique type
76-
77-
Normalization for alias types/consts has a unique result. Otherwise we can easily implement
78-
transmute in safe code. Given the following function, we have to make sure that the input and
79-
output types always get normalized to the same concrete type.
80-
```rust
81-
fn foo<T: Trait>(
82-
x: <T as Trait>::Assoc
83-
) -> <T as Trait>::Assoc {
84-
x
85-
}
86-
```
87-
88-
### 5. During coherence trait solving has to be complete
89-
90-
During coherence we never return *error* for goals which can be proven. This allows overlapping
91-
impls which would break rule 3.
92-
93-
### 6. Trait solving must be (free) lifetime agnostic
94-
95-
Trait solving during codegen should have the same result as during typeck. As we erase
96-
all free regions during codegen we must not rely on them during typeck. A noteworthy example
97-
is special behavior for `'static`.
98-
99-
We also have to be careful with relying on equality of regions in the trait solver.
100-
This is fine for codegen, as we treat all erased regions as equal. We can however
101-
lose equality information from HIR to MIR typeck.
102-
103-
### 7. Removing ambiguity makes strictly more things compile
104-
105-
We *should* not rely on ambiguity for things to compile.
106-
Not doing that will cause future improvements to be breaking changes.
107-
108-
### 8. semantic equality implies structural equality
109-
110-
Two types being equal in the type system must mean that they have the same `TypeId`.
111-
112-
11342
[solve]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/solve/index.html
11443
[`Goal`]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/traits/solve/struct.Goal.html
11544
[`Predicate`]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_middle/ty/struct.Predicate.html

0 commit comments

Comments
 (0)