Cluster small table/archetype into single Task in parallel iteration #12846

re0312 · 2024-04-02T12:33:50Z

Objective

Fix Cluster small archetypes in parallel iteration #7303
bevy would spawn a lot of tasks in parallel iteration when it matchs a large storage and many small storage ,it significantly increase the overhead of schedule.

Solution

collect small storage into one task

alice-i-cecile · 2024-04-02T13:23:06Z

Looks sensible, but I'd really like to see benchmarks on the impact of this change.

re0312 · 2024-04-02T15:23:12Z

Looks sensible, but I'd really like to see benchmarks on the impact of this change.

Added benchmark for fragment par-iter

have slight loss without fragment for par_iter , but a huge gain when fragments exceeds available threads.

I also test many_foxes (which is not friendly for this PR's changes), it still showed a slight performance improvement. , the most significant performance gains came from the check_visibility

james7132

I'm a bit concerned what this will do with hierarchical parallel queries (i.e. transform propagation), where the top level parallel query has only a few values.

Otherwise looks good! Just one bit about allocation.

james7132 · 2024-04-03T01:32:52Z

crates/bevy_ecs/src/query/state.rs

-                    let table = &tables[table_id];
-                    if table.is_empty() {
-                        continue;
+            let mut batch_queue = vec![];


I'm not the biggest fan of needing to allocate a growable vec repeatedly. Could you try using arrayvec, with a reasonable max queue size (128 seems more than reasonable).

benchmark using arrayvec

crates/bevy_ecs/Cargo.toml

crates/bevy_ecs/src/query/state.rs

re0312 added 4 commits April 2, 2024 20:13

submit

81efcb2

cleanup

b5116be

typo

2e9bb48

fix cli

fbe0021

pablo-lua added A-ECS Entities, components, systems, and events C-Performance A change motivated by improving speed, memory usage or compile times labels Apr 2, 2024

alice-i-cecile requested a review from james7132 April 2, 2024 13:22

alice-i-cecile added the S-Needs-Benchmarking This set of changes needs performance benchmarking to double-check that they help label Apr 2, 2024

re0312 added 2 commits April 2, 2024 22:43

bench

6e79e47

new

81a9b3d

alice-i-cecile added this to the 0.14 milestone Apr 2, 2024

alice-i-cecile removed the S-Needs-Benchmarking This set of changes needs performance benchmarking to double-check that they help label Apr 2, 2024

clean up

a0785fa

james7132 reviewed Apr 3, 2024

View reviewed changes

arrayvec

260fa53

james7132 approved these changes Apr 3, 2024

View reviewed changes

crates/bevy_ecs/Cargo.toml Outdated Show resolved Hide resolved

dep

38e9b0c

SkiFire13 reviewed Apr 3, 2024

View reviewed changes

crates/bevy_ecs/src/query/state.rs Show resolved Hide resolved

reset

5f9a80a

james7132 requested a review from SkiFire13 April 4, 2024 04:19

SkiFire13 approved these changes Apr 4, 2024

View reviewed changes

james7132 added this pull request to the merge queue Apr 4, 2024

Merged via the queue into bevyengine:main with commit 4ca8cf5 Apr 4, 2024
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster small table/archetype into single Task in parallel iteration #12846

Cluster small table/archetype into single Task in parallel iteration #12846

re0312 commented Apr 2, 2024

alice-i-cecile commented Apr 2, 2024

re0312 commented Apr 2, 2024

james7132 left a comment

james7132 Apr 3, 2024

re0312 Apr 3, 2024

Cluster small table/archetype into single Task in parallel iteration #12846

Cluster small table/archetype into single Task in parallel iteration #12846

Conversation

re0312 commented Apr 2, 2024

Objective

Solution

alice-i-cecile commented Apr 2, 2024

re0312 commented Apr 2, 2024

james7132 left a comment

Choose a reason for hiding this comment

james7132 Apr 3, 2024

Choose a reason for hiding this comment

re0312 Apr 3, 2024

Choose a reason for hiding this comment