Parallel GPU buffer writes by aevyrie · Pull Request #22314 · bevyengine/bevy

aevyrie · 2025-12-30T05:22:13Z

Objective

After a series of optimizations making render and postupdate more parallel, write_batched_instance_buffers was regularly one of the largest spans with very low thread use, sitting at 4ms in 1 4ms frame. This makes it an ideal target to improve throughput. Note this screenshot doesn't include some visibility system optimizations:

Solution

Spawn tasks for writing buffers to the GPU. This is especially helpful for current_input_buffer and previous_input_buffer, which take about the same time and are the longest buffer writes - moving these to tasks effectively halves the time spent in the system.

In the 250k bevymark_3d stress test, this saves 1.7ms in the system, and 2.8ms in frame time

frametime

system

Testing

cargo rer bevymark_3d --features=debug,trace_tracy -- --benchmark --waves 250 --per-wave 1000

crates/bevy_render/src/batching/gpu_preprocessing.rs

kfc35

LGTM for correctness

Co-authored-by: Kevin Chen <[email protected]>

aevyrie · 2026-01-02T22:49:09Z

Revisted benchmarks on latest main, and improvements are still reproducible.

Bottle eck is mesh collection, which is improved in #22297.

# Objective - After a series of optimizations making render and postupdate more parallel, `write_batched_instance_buffers` was regularly one of the largest spans with very low thread use, sitting at 4ms in 1 4ms frame. This makes it an ideal target to improve throughput. Note this screenshot doesn't include some visibility system optimizations: <img width="650" height="718" alt="image" src="https://github.com/user-attachments/assets/bbd6762b-5145-48f8-a427-5da3cb11a04a" /> ## Solution - Spawn tasks for writing buffers to the GPU. This is especially helpful for `current_input_buffer` and `previous_input_buffer`, which take about the same time and are the longest buffer writes - moving these to tasks effectively halves the time spent in the system. <img width="588" height="251" alt="image" src="https://github.com/user-attachments/assets/0a086e7a-1d3c-4c17-9d66-eff94196943d" /> - In the 250k bevymark_3d stress test, this saves 1.7ms in the system, and 2.8ms in frame time frametime <img width="620" height="376" alt="image" src="https://github.com/user-attachments/assets/a4c106ac-7668-4f8a-970f-71cbb8be851c" /> system <img width="1384" height="744" alt="image" src="https://github.com/user-attachments/assets/5c42227d-8ee5-4b84-bc1a-c04768356255" /> ## Testing - `cargo rer bevymark_3d --features=debug,trace_tracy -- --benchmark --waves 250 --per-wave 1000` --------- Co-authored-by: Kevin Chen <[email protected]>

Write buffers in parallel

0940eca

alice-i-cecile added A-Rendering Drawing game state to the screen C-Performance A change motivated by improving speed, memory usage or compile times S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Dec 30, 2025

github-project-automation bot added this to Rendering (Old) Dec 30, 2025

james7132 self-requested a review December 30, 2025 19:37

james7132 approved these changes Dec 30, 2025

View reviewed changes

kfc35 reviewed Jan 2, 2026

View reviewed changes

crates/bevy_render/src/batching/gpu_preprocessing.rs Outdated Show resolved Hide resolved

kfc35 approved these changes Jan 2, 2026

View reviewed changes

aevyrie and others added 2 commits January 2, 2026 14:20

Update crates/bevy_render/src/batching/gpu_preprocessing.rs

1b64fbb

Co-authored-by: Kevin Chen <[email protected]>

Merge branch 'main' into parallel-gpu-buffer-writes

de14d13

james7132 added S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it and removed S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Jan 5, 2026

alice-i-cecile added this to the 0.18 milestone Jan 5, 2026

alice-i-cecile added this pull request to the merge queue Jan 5, 2026

Merged via the queue into bevyengine:main with commit 5066b03 Jan 5, 2026
40 checks passed

github-project-automation bot moved this to Done in Rendering (Old) Jan 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parallel GPU buffer writes#22314

Parallel GPU buffer writes#22314
alice-i-cecile merged 3 commits intobevyengine:mainfrom
aevyrie:parallel-gpu-buffer-writes

aevyrie commented Dec 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

kfc35 left a comment

Uh oh!

aevyrie commented Jan 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

aevyrie commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Solution

Testing

Uh oh!

Uh oh!

kfc35 left a comment

Choose a reason for hiding this comment

Uh oh!

aevyrie commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aevyrie commented Dec 30, 2025 •

edited

Loading

aevyrie commented Jan 2, 2026 •

edited

Loading