Drop excess capacity from Suites during parsing by MichaReiser · Pull Request #25368 · astral-sh/ruff

MichaReiser · 2026-05-24T06:48:38Z

Summary

Shrink Suite vectors to drop the excess capacity during parsing.

I excluded other nodes for now to get a better sense of the performance and memory impact.

I'm a bit conflicted on this. This is a pretty huge memory improvement for ty, but there are a few linter benchmarks that regress by 1-2% because some vectors need to be copied to smaller allocations, which hurts performance. For the linter and formatter, it's also not important to shrink the vectors, because the AST is never stored for long. But this is different for ty where we cache the AST.

I'm curious to hear what others think on this. I could also try to reduce the places where we call shrink_to_fit, e.g., only for statements?

codspeed-hq · 2026-05-24T06:57:35Z

Merging this PR will improve performance by 5.67%

⚠️

Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 23 improved benchmarks
✅ 102 untouched benchmarks

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
⚡	Memory	`ty_check_file[cold]`	21.7 MB	20.8 MB	+4.32%
⚡	Memory	`DateType`	25.4 MB	24.3 MB	+4.78%
⚡	Memory	`ty_micro[many_string_assignments]`	16.2 MB	15.5 MB	+4.77%
⚡	Memory	`ty_micro[pandas_tdd]`	44.2 MB	42 MB	+5.29%
⚡	Memory	`ty_micro[very_large_tuple]`	14.9 MB	14.1 MB	+5.12%
⚡	Memory	`ty_micro[large_isinstance_narrowing]`	16.2 MB	15.4 MB	+4.88%
⚡	Memory	`ty_micro[many_enum_members]`	17.5 MB	16.7 MB	+4.36%
⚡	Memory	`ty_micro[literal_match_fallthrough]`	13.5 MB	12.8 MB	+4.94%
⚡	Memory	`ty_micro[vararg_parameter_type_accumulation]`	13.1 MB	12.5 MB	+5%
⚡	Memory	`ty_micro[many_tuple_assignments]`	14.3 MB	13.6 MB	+4.59%
⚡	Memory	`ty_micro[complex_constrained_attributes_2]`	15.1 MB	14.4 MB	+5.05%
⚡	Memory	`ty_micro[literal_equality_fallthrough_guarded_any]`	15.7 MB	15.1 MB	+4.21%
⚡	Memory	`ty_micro[gradual_vararg_call]`	14.7 MB	13.9 MB	+5.19%
⚡	Memory	`ty_micro[complex_constrained_attributes_3]`	16.6 MB	15.9 MB	+4.63%
⚡	Memory	`ty_micro[complex_constrained_attributes_1]`	15.1 MB	14.4 MB	+5.05%
⚡	Memory	`ty_micro[many_enum_members_2]`	16.3 MB	15.5 MB	+4.79%
⚡	Memory	`ty_micro[many_protocol_members_mismatch]`	21.1 MB	20.3 MB	+4.21%
⚡	Memory	`ty_micro[many_tuple_assignments]`	15.6 MB	14.9 MB	+4.87%
⚡	Memory	`parser[pydantic/types.py]`	403.2 KB	368.5 KB	+9.41%
⚡	Memory	`parser[large/dataset.py]`	862.2 KB	816.8 KB	+5.56%
...	...	...	...	...	...

ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.

Tip

Curious why this is faster? Comment @codspeedbot explain why this is faster on this PR, or directly use the CodSpeed MCP with your agent.

_{Comparing micha/parser-shrink-to-fit (3ade505) with main (572e4b5)}

astral-sh-bot · 2026-05-24T06:59:07Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

astral-sh-bot · 2026-05-24T15:26:47Z

Typing conformance results

No changes detected ✅

Current numbers

The percentage of diagnostics emitted that were expected errors held steady at 91.94%. The percentage of expected errors that received a diagnostic held steady at 87.09%. The number of fully passing files held steady at 92/134.

astral-sh-bot · 2026-05-24T15:27:30Z

Memory usage report

Summary

Project	Old	New	Diff	Outcome
flake8	45.94MB	44.10MB	-4.02% (1.85MB)	⬇️
trio	112.21MB	109.53MB	-2.39% (2.68MB)	⬇️
sphinx	264.77MB	261.54MB	-1.22% (3.22MB)	⬇️
prefect	714.89MB	711.43MB	-0.48% (3.45MB)	⬇️

Significant changes

Click to expand detailed breakdown

flake8

Name	Old	New	Diff	Outcome
`parsed_module`	15.55MB	13.71MB	-11.87% (1.85MB)	⬇️

trio

Name	Old	New	Diff	Outcome
`parsed_module`	23.70MB	21.02MB	-11.31% (2.68MB)	⬇️

sphinx

Name	Old	New	Diff	Outcome
`parsed_module`	28.84MB	25.61MB	-11.18% (3.22MB)	⬇️

prefect

Name	Old	New	Diff	Outcome
`parsed_module`	30.35MB	26.89MB	-11.38% (3.45MB)	⬇️

astral-sh-bot · 2026-05-24T15:28:59Z

`ecosystem-analyzer` results

No diagnostic changes detected ✅

Full report with detailed diff (timing results)

dhruvmanila

I'm supportive of this. Related to the linter regression, possible alternatives could be to introduce a ty-specific API in the parser crate or have a new field in the ParseOptions but I'm not a fan of either options given that they'll be part of the public API.

charliermarsh · 2026-05-26T10:31:44Z

IMO the memory gains here are clearly worth a 1-2% linter regression.

MichaReiser · 2026-05-29T18:07:57Z

I scoped this PR down to only cover statements to get a better sense for the performance trade-off when adding the same treatment to some other nodes.

I also tried a few different techniques to mitigate the performance regression with very limited success:

Use a pool of Suite buffers that ruff parses into. This allows us to have a few vecs with large capacity. The downside is that it's always necessary to copy all elements (minus module level where I did not apply this optimization). This was maybe marginally faster, but not worth the complexity.
Special case a body with a single statement because they are more common than I thought. This was maybe marginally faster, but not worth the complexity
Use a SmallVec of size 8

In short, Vec is pretty fast

MichaReiser closed this May 24, 2026

MichaReiser reopened this May 24, 2026

Base automatically changed from micha/thin-vec-stmt to main May 24, 2026 15:23

MichaReiser closed this May 24, 2026

MichaReiser reopened this May 24, 2026

MichaReiser marked this pull request as ready for review May 24, 2026 15:37

MichaReiser requested a review from dhruvmanila as a code owner May 24, 2026 15:37

astral-sh-bot Bot assigned ntBre May 24, 2026

astral-sh-bot Bot requested a review from ntBre May 24, 2026 15:37

MichaReiser force-pushed the micha/parser-shrink-to-fit branch from 605800b to 974d9d7 Compare May 24, 2026 15:39

dhruvmanila added the parser Related to the parser label May 25, 2026

dhruvmanila approved these changes May 26, 2026

View reviewed changes

MichaReiser changed the title ~~Drop excess capacity from Vecs when parsing~~ Drop excess capacity from Vecs in parser May 26, 2026

MichaReiser force-pushed the micha/parser-shrink-to-fit branch 4 times, most recently from cd1b259 to 67a08d4 Compare May 28, 2026 09:47

MichaReiser marked this pull request as draft May 28, 2026 10:04

Shrink statement suites after parsing

60d29d6

MichaReiser force-pushed the micha/parser-shrink-to-fit branch from bb45a9d to 60d29d6 Compare May 29, 2026 12:22

MichaReiser changed the title ~~Drop excess capacity from Vecs in parser~~ Drop excess capacity from Suite's during parsing May 29, 2026

MichaReiser added 2 commits May 29, 2026 14:39

Preallocate singleton inline statement suites

69cd975

Reuse statement buffers when parsing suites

adf9d39

MichaReiser changed the title ~~Drop excess capacity from Suite's during parsing~~ Drop excess capacity from Suites during parsing May 29, 2026

MichaReiser added 4 commits May 29, 2026 16:08

Avoid resizing singleton block suites

bea8ca0

Build block suites using inline SmallVec storage

7fa74a7

Reduce inline block suite storage to four statements

ad6e2e0

Inline block suite parsing

f004c81

Revert block suite SmallVec experiment

3ade505

MichaReiser marked this pull request as ready for review May 29, 2026 18:11

MichaReiser enabled auto-merge (squash) May 29, 2026 18:11

MichaReiser merged commit 1518f1d into main May 29, 2026
57 of 58 checks passed

MichaReiser deleted the micha/parser-shrink-to-fit branch May 29, 2026 18:17

dhruvmanila added the performance Potential performance improvement label Jun 1, 2026

BrewTestBot mentioned this pull request Jun 4, 2026

ruff 0.15.16 Homebrew/homebrew-core#286308

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop excess capacity from Suites during parsing#25368

Drop excess capacity from Suites during parsing#25368
MichaReiser merged 8 commits into
mainfrom
micha/parser-shrink-to-fit

MichaReiser commented May 24, 2026 •

edited

Loading

Uh oh!

codspeed-hq Bot commented May 24, 2026 •

edited

Loading

Uh oh!

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

Uh oh!

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

Uh oh!

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

flake8

trio

sphinx

prefect

Uh oh!

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

Uh oh!

dhruvmanila left a comment

Uh oh!

charliermarsh commented May 26, 2026

Uh oh!

MichaReiser commented May 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

MichaReiser commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

codspeed-hq Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will improve performance by 5.67%

Performance Changes

Uh oh!

astral-sh-bot Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

Uh oh!

astral-sh-bot Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Typing conformance results

No changes detected ✅

Uh oh!

astral-sh-bot Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Memory usage report

Summary

Significant changes

flake8

trio

sphinx

prefect

Uh oh!

astral-sh-bot Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ecosystem-analyzer results

Uh oh!

dhruvmanila left a comment

Choose a reason for hiding this comment

Uh oh!

charliermarsh commented May 26, 2026

Uh oh!

MichaReiser commented May 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MichaReiser commented May 24, 2026 •

edited

Loading

codspeed-hq Bot commented May 24, 2026 •

edited

Loading

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

`ruff-ecosystem` results

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

astral-sh-bot Bot commented May 24, 2026 •

edited

Loading

`ecosystem-analyzer` results