BestFitting: Reduce allocations by MichaReiser · Pull Request #7037 · astral-sh/ruff

MichaReiser · 2023-09-01T07:51:36Z

Summary

This PR aims to reduce the allocations necessary for best_fitting.

Today:

One allocation for interning the object that should be formatted with different variants (outside of best fitting, avoids formatting the same content multiple times)
One allocation for the Vec that stores all variants
One allocation per variant that stores the format elements for that variant

This PR removes the allocations per variant and instead writes all variants into a single buffer stored on BestFitting.
The downside of this is that resolving the most-flat or most-expanded variants now requires a search for the StartBestFittingEntry or EndBestFittingEntry . I don't expect this to be significant because best-fitting entries tend to be small (<16 entries).

Performance

This gives us a 2% improvement for most files, except the large/dataset.py. I need to dig deeper to understand why only large data set is regressing. This is especially surprising because I would have expected the performance to improve the most for large files that make heavy use of best fitting.

Test Plan

cargo test

MichaReiser · 2023-09-01T07:51:47Z

Current dependencies on/for this PR:

main
- PR Introduce Token element #7048
  - PR Memoize text width #6552
    - PR BestFitting: Reduce allocations #7037 👈

This comment was auto-generated by Graphite.

codspeed-hq · 2023-09-01T08:02:23Z

CodSpeed Performance Report

Merging #7037 will degrade performances by 2.05%

_{Comparing best-fitting-reduce-allocations (37c62ff) with memoize-text-width (6a31ef4)}

Summary

🔥 3 improvements
❌ 1 regressions
✅ 16 untouched benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`memoize-text-width`	`best-fitting-reduce-allocations`	Change
❌	`formatter[large/dataset.py]`	58.5 ms	59.7 ms	-2.05%
🔥	`formatter[pydantic/types.py]`	20.2 ms	19.7 ms	+2.46%
🔥	`formatter[numpy/ctypeslib.py]`	11 ms	10.7 ms	+2.45%
🔥	`formatter[unicode/pypinyin.py]`	3.7 ms	3.6 ms	+2.69%

MichaReiser · 2023-09-01T08:17:17Z

crates/ruff_formatter/src/builders.rs

        let variants = self.variants.items();

-        let mut formatted_variants = Vec::with_capacity(variants.len());
+        let mut buffer = VecBuffer::with_capacity(variants.len() * 8, f.state_mut());


It would be nice to remove the need for this buffer too but this complicates things a bit because finding the boundaries of a variant suddenly need to account for nested best fitting elements:

StartBestFittingEntry (Outer) .... StartBestFittingEntry (Inner) ... EndBestFittingEntry ... EndBestFittingEntry

The nesting is probably rare because best fitting is almost always used together with Interned, meaning that the nested best fitting most likely ends up being inside of the interned vec, but it remains possible.

MichaReiser · 2023-09-07T17:18:04Z

Isn't as promising as I thought, or even regressing and it introduces additional complexity.

MichaReiser force-pushed the best-fitting-reduce-allocations branch from a6a6c1a to 243a81b Compare September 1, 2023 08:11

MichaReiser commented Sep 1, 2023

View reviewed changes

MichaReiser force-pushed the best-fitting-reduce-allocations branch from 243a81b to b51d602 Compare September 1, 2023 08:27

MichaReiser added the formatter Related to the formatter label Sep 1, 2023

MichaReiser force-pushed the best-fitting-reduce-allocations branch 2 times, most recently from ddc19b0 to 5f148a7 Compare September 1, 2023 08:54

MichaReiser changed the base branch from main to token-element September 1, 2023 17:17

MichaReiser force-pushed the best-fitting-reduce-allocations branch from 5f148a7 to 0e5caac Compare September 1, 2023 17:17

MichaReiser mentioned this pull request Sep 1, 2023

Introduce Token element #7048

Merged

MichaReiser added the performance Potential performance improvement label Sep 1, 2023

MichaReiser force-pushed the best-fitting-reduce-allocations branch from 0e5caac to 092c9a2 Compare September 1, 2023 17:42

MichaReiser force-pushed the token-element branch from ce2b166 to e1e047f Compare September 1, 2023 17:51

MichaReiser force-pushed the best-fitting-reduce-allocations branch from 092c9a2 to 770e2b4 Compare September 1, 2023 18:21

MichaReiser mentioned this pull request Sep 1, 2023

Memoize text width #6552

Merged

MichaReiser force-pushed the token-element branch from fc0d6aa to 409f8f9 Compare September 2, 2023 07:47

Base automatically changed from token-element to main September 2, 2023 08:05

MichaReiser added 2 commits September 2, 2023 10:06

Memoize text width

6a31ef4

BestFitting: Reduce allocations

37c62ff

MichaReiser changed the base branch from main to memoize-text-width September 2, 2023 08:33

MichaReiser force-pushed the best-fitting-reduce-allocations branch from 770e2b4 to 37c62ff Compare September 2, 2023 08:33

MichaReiser force-pushed the memoize-text-width branch from 6a31ef4 to e09159a Compare September 6, 2023 06:59

Base automatically changed from memoize-text-width to main September 6, 2023 07:10

MichaReiser closed this Sep 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

BestFitting: Reduce allocations#7037

BestFitting: Reduce allocations#7037
MichaReiser wants to merge 2 commits intomainfrom
best-fitting-reduce-allocations

MichaReiser commented Sep 1, 2023 •

edited

Loading

Uh oh!

MichaReiser commented Sep 1, 2023 •

edited

Loading

Uh oh!

codspeed-hq bot commented Sep 1, 2023 •

edited

Loading

Uh oh!

MichaReiser Sep 1, 2023

Uh oh!

MichaReiser commented Sep 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

MichaReiser commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Performance

Test Plan

Uh oh!

MichaReiser commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codspeed-hq bot commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #7037 will degrade performances by 2.05%

Summary

Benchmarks breakdown

Uh oh!

MichaReiser Sep 1, 2023

Choose a reason for hiding this comment

Uh oh!

MichaReiser commented Sep 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

MichaReiser commented Sep 1, 2023 •

edited

Loading

MichaReiser commented Sep 1, 2023 •

edited

Loading

codspeed-hq bot commented Sep 1, 2023 •

edited

Loading