[ty] remove any_over_type by carljm · Pull Request #19099 · astral-sh/ruff

carljm · 2025-07-02T18:43:35Z

Summary

Remove the recursive type walk any_over_type and its sole current use in deciding whether to emit a redundant-cast diagnostic; instead, just skip the redundant-cast diagnostic if we are casting Unknown or Todo as a top-level type. This removes another recursive type walk (and its associated recursive-type complications).

The ecosystem report suggests the number of false positives this currently adds is quite manageable, and the few that it does add can mostly be removed by targeting a few cases where we currently create Todo types. While suppressing false positives from Todo types is useful in the short term and worth doing when it's easy, I don't think it should drive significant architecture decisions or extra runtime work.

If we do decide (even though it currently seems to have little impact on the ecosystem report) that it's important for gradual-guarantee reasons to do deep suppression of redundant-cast when nested Unknown types are involved (I am not sure how strong the case for this is, since the use of cast in the first place suggests a typed codebase), I think the better approach to this would be a strategy parameter to is_equivalent_to that could cause Unknown to not be considered equivalent to Unknown or Any, rather than a separate recursive type walk.

Test Plan

CI

github-actions · 2025-07-02T18:47:00Z

`mypy_primer` results

Changes were detected when running on open source projects

Expression (https://github.com/cognitedata/Expression)
-     memo fields = ~49MB
+     memo fields = ~54MB

aioredis (https://github.com/aio-libs/aioredis)
-     memo fields = ~54MB
+     memo fields = ~60MB

werkzeug (https://github.com/pallets/werkzeug)
-     memo fields = ~129MB
+     memo fields = ~142MB

hydra-zen (https://github.com/mit-ll-responsible-ai/hydra-zen)
+ warning[redundant-cast] src/hydra_zen/_utils/coerce.py:107:16: Value is already of type `_T`
+ warning[redundant-cast] src/hydra_zen/_utils/coerce.py:178:16: Value is already of type `_T`
- Found 599 diagnostics
+ Found 601 diagnostics
- TOTAL MEMORY USAGE: ~80MB
+ TOTAL MEMORY USAGE: ~88MB

discord.py (https://github.com/Rapptz/discord.py)
- TOTAL MEMORY USAGE: ~228MB
+ TOTAL MEMORY USAGE: ~251MB

kopf (https://github.com/nolar/kopf)
+ warning[redundant-cast] kopf/_cogs/configs/diffbase.py:52:19: Value is already of type `dict[Any, Any]`
- Found 130 diagnostics
+ Found 131 diagnostics

mkosi (https://github.com/systemd/mkosi)
-     memo fields = ~106MB
+     memo fields = ~97MB

bandersnatch (https://github.com/pypa/bandersnatch)
-     memo fields = ~66MB
+     memo fields = ~72MB

tornado (https://github.com/tornadoweb/tornado)
- TOTAL MEMORY USAGE: ~171MB
+ TOTAL MEMORY USAGE: ~156MB

mkdocs (https://github.com/mkdocs/mkdocs)
- TOTAL MEMORY USAGE: ~117MB
+ TOTAL MEMORY USAGE: ~129MB

streamlit (https://github.com/streamlit/streamlit)
+ warning[redundant-cast] lib/streamlit/runtime/state/common.py:71:5: Value is already of type `tuple[@Todo(Inference of subscript on special form), ...]`
- Found 3304 diagnostics
+ Found 3305 diagnostics

meson (https://github.com/mesonbuild/meson)
+ warning[redundant-cast] mesonbuild/options.py:705:35: Value is already of type `list[@Todo(Inference of subscript on special form)]`
+ warning[redundant-cast] mesonbuild/options.py:772:35: Value is already of type `list[@Todo(Inference of subscript on special form)]`
- Found 930 diagnostics
+ Found 932 diagnostics

prefect (https://github.com/PrefectHQ/prefect)
+ warning[redundant-cast] src/prefect/server/orchestration/core_policy.py:68:16: Value is already of type `list[@Todo(unsupported nested subscript in type[X])]`
+ warning[redundant-cast] src/prefect/server/orchestration/core_policy.py:106:16: Value is already of type `list[@Todo(unsupported nested subscript in type[X])]`
+ warning[redundant-cast] src/prefect/server/orchestration/core_policy.py:146:16: Value is already of type `list[@Todo(unsupported nested subscript in type[X])]`
+ warning[redundant-cast] src/prefect/server/orchestration/core_policy.py:184:16: Value is already of type `list[@Todo(unsupported nested subscript in type[X])]`
+ warning[redundant-cast] src/prefect/server/orchestration/core_policy.py:218:16: Value is already of type `list[@Todo(unsupported nested subscript in type[X])]`
+ warning[redundant-cast] src/prefect/server/orchestration/global_policy.py:69:16: Value is already of type `list[@Todo(unsupported nested subscript in type[X])]`
+ warning[redundant-cast] src/prefect/server/orchestration/global_policy.py:102:16: Value is already of type `list[@Todo(unsupported nested subscript in type[X])]`
- Found 3851 diagnostics
+ Found 3858 diagnostics

codspeed-hq · 2025-07-02T18:53:21Z

CodSpeed WallTime Performance Report

Merging #19099 will not alter performance

_{Comparing cjm/noanyover (ca0e578) with main (0660188)}

Summary

✅ 8 untouched benchmarks

AlexWaygood

I love the reduction in code complexity (especially relative to #19094!) but I do think this pretty clearly breaks the gradual guarantee for something like this?

from unresolvable_module import Foo, Bar

def f(x: list[Foo]):
    y = cast(list[Bar], x)

it seems just like somewhat random luck that we don't have anything in the mypy_primer corpus that triggers a false-positive redundant-cast diagnostic on that as a result of this change :-) I'd be curious to see what this idea looks like; it seems viable:

I think the better approach to this would be a strategy parameter to is_equivalent_to that could cause Unknown to not be considered equivalent to Unknown or Any, rather than a separate recursive type walk.

I also don't think this gets us away in the long term from having to find a way to recursively walk types. For example, we've discussed for a while that a feature many people want is a way to know what percentage of types across a single run of ty are inferred as Any/Unknown. Mypy's version of this feature also tells users what percentage of types are "partially Any/Unknown"; it would be a shame not to be able to match that capability.

AlexWaygood · 2025-07-03T12:40:33Z

I'd be curious to see what this idea looks like; it seems viable:

I think the better approach to this would be a strategy parameter to is_equivalent_to that could cause Unknown to not be considered equivalent to Unknown or Any, rather than a separate recursive type walk.

But on second thought, I don't think this approach works for protocols that have unknown fields? Equivalence for protocols just normalizes both the l.h.s. and the r.h.s., then compares for identity; this has the advantage that once recursive protocols are "solved" in Type::normalized(), it means we don't have to worry about them in equivalence checks between protocols:

ruff/crates/ty_python_semantic/src/types/instance.rs

Lines 259 to 261 in fc43d3c

    
               pub(super) fn is_equivalent_to(self, db: &'db dyn Db, other: Self) -> bool { 
        
                   self.normalized(db) == other.normalized(db) 
        
               }

But this means that these two protocols will normalize to the same type, because Unknown/Todo both normalize to Any, so you'd get a false-positive redundant-cast diagnostic if you tried to cast from one to the other. Unknown.is_equivalent_to(Any) would never be called:

from unresolved_module import Foo
from typing import Any, Protocol

class Bar(Protocol):
    x: Foo

class Baz(Protocol):
    x: Any

carljm · 2025-07-03T16:43:10Z

Ok, I'm convinced, both that we may have other needs for this in future, and that doing this cast check in is_equivalent_to won't work in all cases.

carljm added the ty Multi-file analysis & type inference label Jul 2, 2025

carljm mentioned this pull request Jul 2, 2025

[ty] Rewrite Type::any_over_type using a new generalised TypeVisitor trait #19094

Merged

[ty] remove any_over_type

ca0e578

carljm force-pushed the cjm/noanyover branch from 0bc623c to ca0e578 Compare July 2, 2025 19:06

carljm changed the title ~~[experiment] remove any_over_type~~ [ty] remove any_over_type Jul 2, 2025

carljm marked this pull request as ready for review July 2, 2025 19:07

carljm requested review from AlexWaygood, dcreager and sharkdp as code owners July 2, 2025 19:07

AlexWaygood reviewed Jul 3, 2025

View reviewed changes

carljm closed this Jul 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[ty] remove any_over_type#19099

[ty] remove any_over_type#19099
carljm wants to merge 1 commit intomainfrom
cjm/noanyover

carljm commented Jul 2, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

AlexWaygood left a comment

Uh oh!

AlexWaygood commented Jul 3, 2025

Uh oh!

carljm commented Jul 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

carljm commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

github-actions bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mypy_primer results

Uh oh!

codspeed-hq bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed WallTime Performance Report

Merging #19099 will not alter performance

Summary

Uh oh!

AlexWaygood left a comment

Choose a reason for hiding this comment

Uh oh!

AlexWaygood commented Jul 3, 2025

Uh oh!

carljm commented Jul 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

carljm commented Jul 2, 2025 •

edited

Loading

github-actions bot commented Jul 2, 2025 •

edited

Loading

`mypy_primer` results

codspeed-hq bot commented Jul 2, 2025 •

edited

Loading