Add basic red knot benchmark by MichaReiser · Pull Request #13026 · astral-sh/ruff

MichaReiser · 2024-08-21T09:09:12Z

Summary

This PR adds a basic benchmark runner for red knot. These benchmarks are different from codspeed in that they benchmark
entire projects. They are not micro-benchmarks.

The benchmarks aren't representive! Don't draw any conclusion from them today.
Red Knot only implements a tiny tiny subset of Mypy's and Pyright's functionality.

This work is inspired by mypy primer but it serves
a different purpose. We may want to add Red Knot to mypy primer when we have a more complete feature set (and a published binary)

Test Plan

Ran the benchmarks for each project and verified the diagnostics I saw:

uv run benchmark -p black --mypy --verbose

I'm a bit surprised that I e.g. see an error for Black. I would appreciate it if someone could verify that the diagnostics are reasonable.

MichaReiser · 2024-08-21T09:10:35Z

scripts/knot_benchmark/src/benchmark/cases.py

+    INCREMENTAL = "incremental"
+    """Incremental check between two commits."""


Not sure if we should remove this one again. I don't have an incremental benchmark yet but the idea would be to switch between two revisions.

github-actions · 2024-08-21T09:26:34Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

AlexWaygood

Some comments from reading through the code

crates/red_knot_python_semantic/src/types.rs

scripts/knot_benchmark/README.md

scripts/knot_benchmark/pyproject.toml

scripts/knot_benchmark/src/benchmark/__init__.py

scripts/knot_benchmark/src/benchmark/cases.py

scripts/knot_benchmark/src/benchmark/projects.py

scripts/knot_benchmark/README.md

AlexWaygood

Basically this all looks good to me in terms of structure, nothing sticks out

AlexWaygood · 2024-08-22T13:39:25Z

scripts/knot_benchmark/src/benchmark/cases.py

+            "--venvpath",
+            str(
+                venv.path.parent
+            ),  # This is not the path to the venv folder, but the folder that contains the venv...


wow, bizarre. Do we also need to specify --venv=venv along with this argument, so that it knows which subdirectory inside the --venv-path directory the venv is in? https://github.com/microsoft/pyright/blob/main/docs/import-resolution.md#configuring-your-python-environment

Yes we do. But there's no venv CLI option. So we have to create a config

for project in projects: with tempfile.TemporaryDirectory() as cwd: cwd = Path(cwd) project.clone(cwd) venv = Venv.create(cwd) venv.install(project.dependencies) # Set the `venv` config for pyright. Pyright only respects the `--venvpath` # CLI option when `venv` is set in the configuration... 🤷‍♂️ with open(cwd / "pyrightconfig.json", "w") as f: f.write(json.dumps(dict(venv=venv.name)))

pyright's config setup is so odd...

scripts/knot_benchmark/src/benchmark/run.py

scripts/knot_benchmark/src/benchmark/projects.py

Co-authored-by: Alex Waygood <[email protected]>

hauntsaninja · 2024-08-23T06:59:41Z

scripts/knot_benchmark/src/benchmark/cases.py

+            "pip",
+            "install",
+            "--quiet",
+            *dependencies,


Worth adding --exclude-newer? While you pin the commit, you don't pin the deps

MichaReiser commented Aug 21, 2024

View reviewed changes

MichaReiser force-pushed the knot-mypy-pyright-benchmarks branch from 5694376 to 8da745b Compare August 21, 2024 09:12

MichaReiser added the ty Multi-file analysis & type inference label Aug 21, 2024

AlexWaygood reviewed Aug 21, 2024

View reviewed changes

scripts/knot_benchmark/README.md Show resolved Hide resolved

MichaReiser added 2 commits August 22, 2024 15:33

Add basic red knot benchmarks

35dc6f0

Add isort, checkout specific revision

28debb8

AlexWaygood reviewed Aug 22, 2024

View reviewed changes

scripts/knot_benchmark/src/benchmark/run.py Show resolved Hide resolved

MichaReiser force-pushed the knot-mypy-pyright-benchmarks branch from 8da745b to 9ccaa8a Compare August 22, 2024 14:18

AlexWaygood approved these changes Aug 22, 2024

View reviewed changes

scripts/knot_benchmark/src/benchmark/projects.py Outdated Show resolved Hide resolved

Address review comments

ee2e088

MichaReiser force-pushed the knot-mypy-pyright-benchmarks branch from 9ccaa8a to ee2e088 Compare August 22, 2024 14:31

MichaReiser marked this pull request as ready for review August 22, 2024 14:32

AlexWaygood reviewed Aug 22, 2024

View reviewed changes

scripts/knot_benchmark/src/benchmark/projects.py Outdated Show resolved Hide resolved

Update scripts/knot_benchmark/src/benchmark/projects.py

83fdf60

Co-authored-by: Alex Waygood <[email protected]>

MichaReiser merged commit 4f6accb into main Aug 23, 2024

MichaReiser deleted the knot-mypy-pyright-benchmarks branch August 23, 2024 06:22

hauntsaninja reviewed Aug 23, 2024

View reviewed changes

MichaReiser mentioned this pull request Sep 3, 2024

Fix virtual environment details in knot_benchmark #13228

Merged

		INCREMENTAL = "incremental"
		"""Incremental check between two commits."""

Comments

Conversation

MichaReiser commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Plan

Uh oh!

MichaReiser Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 21, 2024

ruff-ecosystem results

Linter (stable)

Linter (preview)

Uh oh!

AlexWaygood left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlexWaygood left a comment

Choose a reason for hiding this comment

Uh oh!

AlexWaygood Aug 22, 2024

Choose a reason for hiding this comment

Uh oh!

MichaReiser Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AlexWaygood Aug 22, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hauntsaninja Aug 23, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MichaReiser commented Aug 21, 2024 •

edited

Loading

`ruff-ecosystem` results

MichaReiser Aug 22, 2024 •

edited

Loading