[ty] Support `dataclass_transform` as a function call by charliermarsh · Pull Request #22378 · astral-sh/ruff

charliermarsh · 2026-01-04T23:14:59Z

Summary

Instead of just as a decorator.

Closes astral-sh/ty#2319.

astral-sh-bot · 2026-01-04T23:16:37Z

Diagnostic diff on typing conformance tests

No changes detected when running ty on typing conformance tests ✅

astral-sh-bot · 2026-01-04T23:20:17Z

`mypy_primer` results

Changes were detected when running on open source projects

attrs (https://github.com/python-attrs/attrs)
- tests/test_annotations.py:644:13: error[invalid-assignment] Object of type `<decorator produced by dataclass-like function>` is not assignable to `<class 'C'>`
+ tests/test_annotations.py:644:13: error[invalid-assignment] Object of type `<class 'tests.test_annotations.TestAnnotations.<locals of function 'test_init_type_hints_fake_module'>.C @ tests/test_annotations.py:640'>` is not assignable to `<class 'tests.test_annotations.TestAnnotations.<locals of function 'test_init_type_hints_fake_module'>.C @ tests/test_annotations.py:640'>`
- tests/test_make.py:620:13: error[invalid-assignment] Object of type `<decorator produced by dataclass-like function>` is not assignable to `<class 'C'>`
+ tests/test_make.py:620:13: error[invalid-assignment] Object of type `<class 'tests.test_make.TestAttributes.<locals of function 'test_adds_all_by_default'>.C @ tests/test_make.py:615'>` is not assignable to `<class 'tests.test_make.TestAttributes.<locals of function 'test_adds_all_by_default'>.C @ tests/test_make.py:615'>`
+ tests/test_make.py:675:13: error[invalid-assignment] Object of type `<class 'tests.test_make.TestAttributes.<locals of function 'test_respects_init_attrs_init'>.C @ tests/test_make.py:672'>` is not assignable to `<class 'tests.test_make.TestAttributes.<locals of function 'test_respects_init_attrs_init'>.C @ tests/test_make.py:672'>`
+ tests/test_make.py:2872:17: error[invalid-assignment] Object of type `type[tests.test_make.TestAutoDetect.<locals of function 'test_total_ordering'>.C @ tests/test_make.py:2858] | type[tests.test_make.TestAutoDetect.<locals of function 'test_total_ordering'>.C @ tests/test_make.py:2858]` is not assignable to `<class 'tests.test_make.TestAutoDetect.<locals of function 'test_total_ordering'>.C @ tests/test_make.py:2858'>`
+ tests/test_make.py:2882:16: error[unsupported-operator] Operator `<` is not supported between two objects of type `C | @Todo`
+ tests/test_make.py:2887:16: error[unsupported-operator] Operator `>` is not supported between two objects of type `C | @Todo`
+ tests/test_slots.py:217:10: error[invalid-argument-type] Argument to bound method `__init__` is incorrect: Expected `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`, found `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`
+ tests/test_slots.py:222:9: error[unresolved-attribute] Unresolved attribute `t` on type `SimpleOrdinaryClass`.
+ tests/test_slots.py:224:17: error[invalid-argument-type] Argument to bound method `method` is incorrect: Expected `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`, found `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`
+ tests/test_slots.py:225:27: error[invalid-argument-type] Argument to bound method `classmethod` is incorrect: Expected `type[tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193]`, found `type[tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193]`
+ tests/test_slots.py:228:50: error[unresolved-attribute] Class `SimpleOrdinaryClass` has no attribute `__slots__`
+ tests/test_slots.py:230:10: error[invalid-argument-type] Argument to bound method `__init__` is incorrect: Expected `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`, found `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`
+ tests/test_slots.py:232:11: error[invalid-argument-type] Argument to bound method `__init__` is incorrect: Expected `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`, found `tests.test_slots.<locals of function 'test_nonslots_these'>.SimpleOrdinaryClass @ tests/test_slots.py:193`
- Found 616 diagnostics
+ Found 627 diagnostics

Tanjun (https://github.com/FasterSpeeding/Tanjun)
- tanjun/dependencies/data.py:347:12: error[invalid-return-type] Return type does not match returned value: expected `_T@cached_inject`, found `_T@cached_inject | Coroutine[Any, Any, _T@cached_inject | Coroutine[Any, Any, _T@cached_inject]]`
+ tanjun/dependencies/data.py:347:12: error[invalid-return-type] Return type does not match returned value: expected `_T@cached_inject`, found `Coroutine[Any, Any, _T@cached_inject | Coroutine[Any, Any, _T@cached_inject]] | _T@cached_inject`

static-frame (https://github.com/static-frame/static-frame)
- static_frame/core/bus.py:671:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemLocReduces[Bus[Any], object_]`, found `InterGetItemLocReduces[Bus[Any] | Bottom[Series[Any, Any]] | ndarray[Never, Never] | ... omitted 6 union elements, object_]`
+ static_frame/core/bus.py:671:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemLocReduces[Bus[Any], object_]`, found `InterGetItemLocReduces[Bus[Any] | Bottom[Index[Any]] | Bottom[Series[Any, Any]] | ... omitted 6 union elements, object_]`
- static_frame/core/bus.py:675:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[Bus[Any], object_]`, found `InterGetItemILocReduces[Bus[Any] | Bottom[Index[Any]] | TypeBlocks | ... omitted 6 union elements, object_ | Self@iloc]`
+ static_frame/core/bus.py:675:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[Bus[Any], object_]`, found `InterGetItemILocReduces[Bus[Any] | ndarray[Never, Never] | TypeBlocks | ... omitted 6 union elements, object_ | Self@iloc]`
- static_frame/core/series.py:772:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[Series[Any, Any], TVDtype@Series]`, found `InterGetItemILocReduces[Series[Any, Any] | Bottom[Index[Any]] | TypeBlocks | ... omitted 6 union elements, TVDtype@Series]`
- static_frame/core/series.py:4072:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[SeriesHE[Any, Any], TVDtype@SeriesHE]`, found `InterGetItemILocReduces[Bottom[Series[Any, Any]] | Bottom[Index[Any]] | TypeBlocks | ... omitted 7 union elements, TVDtype@SeriesHE]`
+ static_frame/core/series.py:4072:16: error[invalid-return-type] Return type does not match returned value: expected `InterGetItemILocReduces[SeriesHE[Any, Any], TVDtype@SeriesHE]`, found `InterGetItemILocReduces[Bottom[Series[Any, Any]] | ndarray[Never, Never] | TypeBlocks | ... omitted 7 union elements, TVDtype@SeriesHE]`
- Found 1840 diagnostics
+ Found 1839 diagnostics

No memory usage changes detected ✅

charliermarsh · 2026-01-05T00:46:34Z

(The hydra-zen diagnostic needs work.)

charliermarsh · 2026-01-05T01:26:07Z

crates/ty_python_semantic/src/types/call/bind.rs

+
+                                // If the return type is class-like and the first argument is a
+                                // class, return it with dataclass params applied. Otherwise,
+                                // return a `DataclassDecorator` for application to a class later.


I think this is wrong. It's intended to address cases like:

@dataclass_transform() def hydrated_dataclass(target: type, *, frozen: bool = False) -> Callable[[type[T]], type[T]]: def decorator(cls: type[T]) -> type[T]: return cls return decorator @hydrated_dataclass(SomeConfig, frozen=True) class MyConfig: pass

If we make this change without this declared return type handling, then hydrated_dataclass(SomeConfig, frozen=True) returns type[SomeConfig] with dataclass params, and we apply that as a decorator.

This is an unfortunate ambiguity baked directly into the dataclass_transform spec. It doesn't clarify how we are supposed to tell whether the @dataclass_transform decorated function is a decorator or a decorator factory (yet it clearly specifies that both should be supported). I think the spec's assumption is that it doesn't matter, because only decorator-syntax usage (with @) is supported, and in that case you can tell how it is used (are there parentheses or not). But we are trying to do something more sophisticated here, and I think it means we have to resort to some kind of heuristic.

But I'm not sure the heuristic should depend on the annotated return type like this; there's no real requirement to annotate your dataclass_transform decorator at all. I think a better heuristic might be based solely on the number, kind, and type of arguments. If only one positional argument is given and it's a class type, assume we should decorate that class. Otherwise, assume we are returning the real decorator.

(Okay this is very comforting to hear, haha.)

I think that wouldn't work for this case, since both functions take a single class as its positional arguments:

from typing_extensions import dataclass_transform @dataclass_transform() def hydrated_dataclass[T](target: type[T], *, frozen: bool = False): def decorator[U](cls: type[U]) -> type[U]: return cls return decorator

For now, I combined the two heuristics.

carljm

This looks pretty good! We are definitely being more ambitious than other type checkers here (and more ambitious than the spec requires), by supporting non-decorator-syntax usages of dataclass_transform at all. Our initial implementation of dataclass_transform kind of set us on this path, by implementing the logic generally in function-call-binding and having a "DataclassTransformer" type, rather than doing everything as a special case in decorator application. And this PR seems pretty good, so I don't see much harm in continuing on this path -- it's cool if we can support non-decorator-syntax usage like this.

A few comments inline.

carljm · 2026-01-10T00:46:02Z

crates/ty_python_semantic/resources/mdtest/dataclasses/dataclass_transform.md

+### Passing a specialized generic class
+
+When calling a `@dataclass_transform()` decorated function with a specialized generic class, the
+specialization should be preserved.
+
+```py
+from typing_extensions import dataclass_transform
+
+@dataclass_transform()
+def my_dataclass[T](cls: type[T]) -> type[T]:
+    return cls
+
+class A[T]:
+    x: T
+
+B = my_dataclass(A[int])
+
+reveal_type(B)  # revealed: <class 'A[int]'>
+
+B(1)
+```


I'm impressed that you support this, but I'm curious where it came up? Did you see this in the ecosystem?

This won't work at runtime with stdlib dataclass -- it expects to get a class object, not a typing.GenericAlias. But I suppose there could be third-party dataclass-transforms that are built to handle GenericAlias at runtime?

I honestly can't remember -- I may have asked Claude to add something to verify that we preserve specialization after seeing handling for it in the code?!

carljm · 2026-01-10T00:50:06Z

crates/ty_python_semantic/resources/mdtest/dataclasses/dataclass_transform.md

+class Target:
+    pass
+
+decorator = hydrated_dataclass(Target)


What about then using this like MyClass = decorator(SomeClass)?

Doesn't seem to work, even on this branch:

class M1: x: int M2 = decorator(M1) reveal_type(M2) # reveals `type[M1] & Any`

And I'm not sure where the & Any comes from?

(As noted above, I don't think any other type checker supports this non-decorator use of a dataclass transform at all, so I don't know that we need to support this edge case, just curious why it doesn't work when the base case above does.)

carljm · 2026-01-10T00:53:53Z

crates/ty_python_semantic/resources/mdtest/dataclasses/dataclass_transform.md

+from typing_extensions import dataclass_transform
+
+@dataclass_transform()
+def my_dataclass[T](cls: type[T]) -> type[T]:
+    return cls
+
+class A:
+    x: int
+
+B = my_dataclass(A)
+
+reveal_type(B)  # revealed: <class 'A'>
+
+B(1)


Doesn't seem like any other type checkers support this, but it's cool that we can.

carljm · 2026-01-10T01:22:32Z

crates/ty_python_semantic/src/types/function.rs

-        let (implementation, overloads) = self.overloads_and_implementation(db);
-        overloads.into_iter().chain(implementation.iter().copied())
+    ) -> impl DoubleEndedIterator<Item = OverloadLiteral<'db>> + 'db {
+        let (overloads, implementation) = self.overloads_and_implementation(db);


lol, oops

I guess no other caller cared about the ordering here?

Looking at the usage in infer_class_definition, I think we might need a .rev() call there also, to make the behavior match the comment?

carljm · 2026-01-10T01:25:40Z

crates/ty_python_semantic/src/types/call/bind.rs

+
+                                // If the return type is class-like and the first argument is a
+                                // class, return it with dataclass params applied. Otherwise,
+                                // return a `DataclassDecorator` for application to a class later.


This is an unfortunate ambiguity baked directly into the dataclass_transform spec. It doesn't clarify how we are supposed to tell whether the @dataclass_transform decorated function is a decorator or a decorator factory (yet it clearly specifies that both should be supported). I think the spec's assumption is that it doesn't matter, because only decorator-syntax usage (with @) is supported, and in that case you can tell how it is used (are there parentheses or not). But we are trying to do something more sophisticated here, and I think it means we have to resort to some kind of heuristic.

But I'm not sure the heuristic should depend on the annotated return type like this; there's no real requirement to annotate your dataclass_transform decorator at all. I think a better heuristic might be based solely on the number, kind, and type of arguments. If only one positional argument is given and it's a class type, assume we should decorate that class. Otherwise, assume we are returning the real decorator.

crates/ty_python_semantic/src/types/call/bind.rs

charliermarsh added bug Something isn't working ty Multi-file analysis & type inference labels Jan 4, 2026

charliermarsh force-pushed the charlie/class-x branch from e016ff5 to 7f675dd Compare January 4, 2026 23:17

charliermarsh force-pushed the charlie/class-x branch from 7f675dd to bace8c4 Compare January 4, 2026 23:29

charliermarsh commented Jan 5, 2026

View reviewed changes

charliermarsh marked this pull request as ready for review January 5, 2026 01:28

charliermarsh requested review from AlexWaygood, carljm, dcreager and sharkdp as code owners January 5, 2026 01:28

charliermarsh force-pushed the charlie/class-x branch from bd84250 to c5a897a Compare January 9, 2026 20:42

carljm approved these changes Jan 10, 2026

View reviewed changes

carljm reviewed Jan 10, 2026

View reviewed changes

crates/ty_python_semantic/src/types/call/bind.rs Outdated Show resolved Hide resolved

charliermarsh added 3 commits January 9, 2026 23:04

[ty] Support dataclass_transform as a function call

59c0416

Try to avoid factory...

5d99782

Review feedback

fb99ce4

charliermarsh force-pushed the charlie/class-x branch from c5a897a to fb99ce4 Compare January 10, 2026 04:45

charliermarsh merged commit 046c5a4 into main Jan 10, 2026
49 checks passed

charliermarsh deleted the charlie/class-x branch January 10, 2026 13:45

Conversation

charliermarsh commented Jan 4, 2026

Summary

Uh oh!

astral-sh-bot bot commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Diagnostic diff on typing conformance tests

Uh oh!

astral-sh-bot bot commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mypy_primer results

Uh oh!

charliermarsh commented Jan 5, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carljm left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

astral-sh-bot bot commented Jan 4, 2026 •

edited

Loading

astral-sh-bot bot commented Jan 4, 2026 •

edited

Loading

`mypy_primer` results