[ty] Improve completion ranking based on origin and exact match by BurntSushi · Pull Request #22460 · astral-sh/ruff

BurntSushi · 2026-01-08T16:07:42Z

This PR does some refactoring to completions and improves ranking based
on the origin of the symbol and whether there is an exact match.

This addresses a lot of the feedback around preferring, e.g., symbols
from typing over symbols from other stdlib modules. This also
establishes a more general preference ordering for symbols based on
their origin: current module, project modules, third party modules and
then stdlib modules.

Finally, we give very high ranking to symbols that exactly match what
the end user has typed.

Reviewers are encouraged to read this PR commit-by-commit.

Fixes astral-sh/ty#1274

BurntSushi · 2026-01-08T16:09:18Z

cc @RasmusNygren in case you want to review here. In particular the refactoring in the earlier commits here.

astral-sh-bot · 2026-01-08T16:09:49Z

Diagnostic diff on typing conformance tests

No changes detected when running ty on typing conformance tests ✅

astral-sh-bot · 2026-01-08T16:10:39Z

`mypy_primer` results

Changes were detected when running on open source projects

Tanjun (https://github.com/FasterSpeeding/Tanjun)
- tanjun/dependencies/data.py:347:12: error[invalid-return-type] Return type does not match returned value: expected `_T@cached_inject`, found `_T@cached_inject | Coroutine[Any, Any, _T@cached_inject | Coroutine[Any, Any, _T@cached_inject]]`
+ tanjun/dependencies/data.py:347:12: error[invalid-return-type] Return type does not match returned value: expected `_T@cached_inject`, found `Coroutine[Any, Any, _T@cached_inject | Coroutine[Any, Any, _T@cached_inject]] | _T@cached_inject`

prefect (https://github.com/PrefectHQ/prefect)
- src/prefect/deployments/runner.py:795:70: warning[possibly-missing-attribute] Attribute `__name__` may be missing on object of type `Unknown | (((...) -> Any) & ((*args: object, **kwargs: object) -> object))`
+ src/prefect/deployments/runner.py:795:70: warning[possibly-missing-attribute] Attribute `__name__` may be missing on object of type `Unknown | ((...) -> Any)`
+ src/prefect/flow_engine.py:812:32: error[invalid-await] `Unknown | R@FlowRunEngine | Coroutine[Any, Any, R@FlowRunEngine]` is not awaitable
+ src/prefect/flow_engine.py:1401:24: error[invalid-await] `Unknown | R@AsyncFlowRunEngine | Coroutine[Any, Any, R@AsyncFlowRunEngine]` is not awaitable
+ src/prefect/flow_engine.py:1482:43: error[invalid-argument-type] Argument to function `next` is incorrect: Expected `SupportsNext[Unknown]`, found `Unknown | R@run_generator_flow_sync`
+ src/prefect/flow_engine.py:1490:21: warning[possibly-missing-attribute] Attribute `throw` may be missing on object of type `Unknown | R@run_generator_flow_sync`
+ src/prefect/flow_engine.py:1524:44: warning[possibly-missing-attribute] Attribute `__anext__` may be missing on object of type `Unknown | R@run_generator_flow_async`
+ src/prefect/flow_engine.py:1531:25: warning[possibly-missing-attribute] Attribute `throw` may be missing on object of type `Unknown | R@run_generator_flow_async`
- src/prefect/flows.py:286:34: error[unresolved-attribute] Object of type `((**P@Flow) -> R@Flow) & ((*args: object, **kwargs: object) -> object)` has no attribute `__name__`
+ src/prefect/flows.py:286:34: error[unresolved-attribute] Object of type `(**P@Flow) -> R@Flow` has no attribute `__name__`
- src/prefect/flows.py:404:68: error[unresolved-attribute] Object of type `((**P@Flow) -> R@Flow) & ((*args: object, **kwargs: object) -> object)` has no attribute `__name__`
+ src/prefect/flows.py:404:68: error[unresolved-attribute] Object of type `(**P@Flow) -> R@Flow` has no attribute `__name__`
- src/prefect/flows.py:1750:53: warning[unused-ignore-comment] Unused blanket `type: ignore` directive
- Found 5363 diagnostics
+ Found 5368 diagnostics

static-frame (https://github.com/static-frame/static-frame)
- static_frame/core/index.py:580:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemLocReduces[TVContainer_co@loc, TVDtype@Index]`, found `InterGetItemLocReduces[Any | Bottom[Series[Any, Any]], TVDtype@Index]`
+ static_frame/core/index.py:580:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemLocReduces[TVContainer_co@loc, TVDtype@Index]`, found `InterGetItemLocReduces[Bottom[Series[Any, Any]] | Any, TVDtype@Index]`
+ static_frame/core/series.py:772:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[Series[Any, Any], TVDtype@Series]`, found `InterGetItemILocReduces[Series[Any, Any] | Bottom[Index[Any]] | TypeBlocks | ... omitted 6 union elements, TVDtype@Series]`
- static_frame/core/series.py:4072:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[SeriesHE[Any, Any], TVDtype@SeriesHE]`, found `InterGetItemILocReduces[Bottom[Series[Any, Any]] | ndarray[Never, Never] | TypeBlocks | ... omitted 7 union elements, TVDtype@SeriesHE]`
+ static_frame/core/series.py:4072:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[SeriesHE[Any, Any], TVDtype@SeriesHE]`, found `InterGetItemILocReduces[Bottom[Series[Any, Any]] | Bottom[Index[Any]] | TypeBlocks | ... omitted 7 union elements, TVDtype@SeriesHE]`
- Found 1837 diagnostics
+ Found 1838 diagnostics

pandas-stubs (https://github.com/pandas-dev/pandas-stubs)
+ pandas-stubs/_typing.pyi:1232:16: warning[unused-ignore-comment] Unused blanket `type: ignore` directive
- Found 5160 diagnostics
+ Found 5161 diagnostics

No memory usage changes detected ✅

MichaReiser

Nice!

crates/ty_ide/src/completion.rs

MichaReiser · 2026-01-09T08:39:13Z

crates/ty_ide/src/completion.rs

    pub documentation: Option<Docstring>,
+    /// Information used to sort this completion relative to others
+    /// in the same collection.
+    relevance: Relevance,


Should we make the fields on Completion private and expose getter methods instead to ensure a Completion is never modified after computing its relevance?

That's kinda my inclination personally, yeah. Even aside from this new Relevance thing being attached to Completion.

I'll try this out in a follow-up PR.

MichaReiser · 2026-01-09T08:47:39Z

crates/ty_ide/src/completion.rs

    /// sequence of completions.
    fn into_completions(mut self) -> Vec<Completion<'db>> {
-        self.items.sort_by(|c1, c2| self.context.compare(c1, c2));
+        self.items.sort_by(rank);


Could Completion import PartialOrd and Ord now that Relevance is stored on the Completion?

It could, but I at least for now chose not to do that. See the rank function docs:

/// Return an ordering relating the two completions. /// /// A `Ordering::Less` is returned when `c1` should be ranked above /// `c2`. A `Ordering::Greater` is returned when `c1` should be ranked /// below `c2`. In other words, a standard ascending sort used with /// this comparison routine will yields the "best ranked" completions /// first. /// /// Note that this could have been implemented via `Eq` and `Ord` /// impls on `Completion`, but is instead a separate function to avoid /// conflating relevance ranking with identity. fn rank(c1: &Completion<'_>, c2: &Completion<'_>) -> Ordering { (&c1.relevance, &c1.name).cmp(&(&c2.relevance, &c2.name)) }

(I think it's possible we do ultimately do this.)

crates/ty_ide/src/all_symbols.rs

crates/ty_completion_eval/truth/typing-gets-priority/main.py

MichaReiser · 2026-01-09T08:57:23Z

crates/ty_ide/src/completion.rs

+    Namespace,
+    /// Symbols defined somewhere in a dependency, direct or
+    /// indirect.
+    ThirdParty,


Is it intentional that we prioritize third-party dependencies over stdlib?

That was my thinking yeah. With the idea being that third party dependencies are "more specific" to the user's context.

But this may very well be overruled by the ubiquity and frequency of use for stdlib packages.

Another factor here is that it's all third party code, not just direct dependencies. So it's likelier to be noisier.

Yeah, my thinking here is: if something's available in the standard library, I probably want to use that over any third-party equivalent.

I'm also surprised that this doesn't regress the typing use case, or import os as they are now both less specific than a similarly named module in any third-party library.

For typing, a lot of the conflicts come from other standard library modules.

The reason why I went this route was this evaluation task:

# This test was originally written to # check that a third party dependency # gets priority over stdlib. But it was # not clearly the right choice[1]. # # 2026-01-09: We stuck with it for now, # but it seems likely that we'll want # to regress this task in favor of # another. # # [1]: https://github.com/astral-sh/ruff/pull/22460#discussion_r2676343225 fullma<CURSOR: regex.fullmatch>

Where pyproject.toml contains the regex dependency. In this context, you probably want fullmatch from regex over the stdlib re module. I assume this applies to other third party packages that provide a drop-in replacement for stdlib modules. But maybe this case isn't more important than the ones you point out.

I added the comment above in response here. I think you're ultimately going to be right here, but I'd like to iterate on this in response to user feedback.

crates/ty_ide/src/completion.rs

I went through astral-sh/ty#1274 and tried to extract what I could into eval tasks. Some of the suggestions from that issue have already been done, but most haven't. This captures the status quo.

I think "relevance" captures the meaning of this type more precisely.

I want to be able to attach extra data to each `Completion`, but not burden callers with the need to construct it. This commit helps get us to that point by requiring callers to use a `CompletionBuilder` for construction instead of a `Completion` itself. I think this will also help in the future if it proves to be the case that we can improve performance by delaying work until we actually build a `Completion`, which might only happen if we know we won't throw it out. But we aren't quite there yet. This also lets us tighten things up a little bit and makes completion construction less noisy. The downside is that callers no longer need to consider "every" completion field. There should not be any behavior changes here.

This moves the information we want to use to rank completions into `Completion` itself. (This was the primary motivation for creating a `CompletionBuilder` in the first place.) The principal advantage here is that we now only need to compute the relevance information for each completion exactly once. Previously, we were computing it on every comparison, which might end up doing redundant work for a not insignifcant number of completions. The relevance information is also specifically constructed from the builder so that, in the future, we might choose to short-circuit construction if we know we'll never send it back to the client (e.g., its ranking is worse than the lowest ranked completion). But we don't implement that optimization yet.

This addresses a number of minor annoyances where users really want imports from `typing` most of the time. For example: astral-sh/ty#1274 (comment)

Previously, we would only sort by name and file path (unless the symbol refers to an entire module). This led to somewhat confusing ordering, since the file path is absolute and can be anything. Sorting by module name after symbol name gives a more predictable ordering in common practice. This is mostly just an improvement for debugging purposes, i.e., when looking at the output of `all_symbols` directly. This mostly shouldn't impact completion ordering since completions do their own ranking.

I apparently don't know how to use my own API. Previously, we would skip, e.g., `.venv`, but still descend into it. This was annoying in practice because I sometimes have an environment in one of the truth task directories. The eval command should ignore that entirely, but it ended up choking on it without properly ignoring hidden files and directories.

astral-sh-bot · 2026-01-09T14:09:56Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

BurntSushi · 2026-01-09T14:20:46Z

I checked out the mypy_primer differences, but they look like flakes. When I re-run it, it still reports differences, but different differences from the previous run. Since there's no conspicuous change in this PR that should lead to such differences, I'm assuming they are flakes.

This commit adds two new tests. One checks that a symbol in the current project gets priority over a symbol in the standard library. Another checks that a symbol in a third party dependency gets priority over a symbol in the standard library. We don't get either of these right today. Note that these comparisons are done ceteris paribus. A symbol from the standard library could still be ranked above a symbol elsewhere. (Although I believe currently this is somewhat rare.)

Part of this was already done, but it was half-assed. We now look at the search path that a symbol came from and centralize a symbol's origin classification. The preference ordering here is maybe not the right one, but we can iterate as users give us feedback. Note also that the preference ordering based on the origin is pretty low in the relevance sorting. This means that other more specific criteria will and can override this. This results in some nice improvements to our evaluation tasks.

This is now strictly redundant with the `typing-gets-priority` task.

Aside from ruling out "impossible" completions in the current context, we give completions with an exact match the highest possible ranking.

RasmusNygren

LGTM!

BurntSushi requested review from AlexWaygood, Gankra, MichaReiser, carljm, dcreager and sharkdp as code owners January 8, 2026 16:07

BurntSushi removed request for Gankra, carljm, dcreager and sharkdp January 8, 2026 16:08

BurntSushi added server Related to the LSP server ty Multi-file analysis & type inference labels Jan 8, 2026

BurntSushi force-pushed the ag/ranking-improvements branch 2 times, most recently from 464d31e to 6ff8447 Compare January 8, 2026 19:08

MichaReiser approved these changes Jan 9, 2026

View reviewed changes

BurntSushi added 7 commits January 9, 2026 08:58

[ty] Add evaluation tasks for a few cases

3a3f143

I went through astral-sh/ty#1274 and tried to extract what I could into eval tasks. Some of the suggestions from that issue have already been done, but most haven't. This captures the status quo.

[ty] Rename internal completion Rank type to Relevance

77e41aa

I think "relevance" captures the meaning of this type more precisely.

[ty] Rank unimported symbols from typing higher

eb6e649

This addresses a number of minor annoyances where users really want imports from `typing` most of the time. For example: astral-sh/ty#1274 (comment)

BurntSushi force-pushed the ag/ranking-improvements branch from 6ff8447 to d9d8596 Compare January 9, 2026 13:58

BurntSushi added 2 commits January 9, 2026 09:42

[ty] Remove type-var-typing-over-ast evaluation task

5457a8f

This is now strictly redundant with the `typing-gets-priority` task.

[ty] Give exact completion matches very high ranking

15cc74e

Aside from ruling out "impossible" completions in the current context, we give completions with an exact match the highest possible ranking.

BurntSushi force-pushed the ag/ranking-improvements branch from d9d8596 to 15cc74e Compare January 9, 2026 14:42

BurntSushi merged commit 91d24eb into main Jan 9, 2026
48 checks passed

BurntSushi deleted the ag/ranking-improvements branch January 9, 2026 14:55

RasmusNygren reviewed Jan 9, 2026

View reviewed changes

YilunAllenChen mentioned this pull request Mar 1, 2026

Feature Request: Completion Item Ranking astral-sh/ty#2927

Open

Conversation

BurntSushi commented Jan 8, 2026

Uh oh!

BurntSushi commented Jan 8, 2026

Uh oh!

astral-sh-bot bot commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Diagnostic diff on typing conformance tests

Uh oh!

astral-sh-bot bot commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mypy_primer results

Uh oh!

MichaReiser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MichaReiser Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

BurntSushi Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

MichaReiser Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

BurntSushi Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

MichaReiser Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

BurntSushi Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

MichaReiser Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BurntSushi Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

astral-sh-bot bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

Uh oh!

BurntSushi commented Jan 9, 2026

Uh oh!

Uh oh!

RasmusNygren left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

astral-sh-bot bot commented Jan 8, 2026 •

edited

Loading

astral-sh-bot bot commented Jan 8, 2026 •

edited

Loading

`mypy_primer` results

MichaReiser Jan 9, 2026 •

edited

Loading

BurntSushi Jan 9, 2026 •

edited

Loading

astral-sh-bot bot commented Jan 9, 2026 •

edited

Loading

`ruff-ecosystem` results