Incorrect calculation of the total number of estimated / real rows in some parallel plans #604

yhuelf · 2023-07-19T09:49:49Z

In this plan, the inner side of the join is executed in full for each worker and the leader. This means that every process must have a private copy of the hash. Therefore, it is inappropriate to multiply the number of rows by "loops" in this case (nodes 5 and 6).

See here for further details : https://www.postgresql.org/docs/current/parallel-plans.html#PARALLEL-JOINS

yhuelf · 2023-07-19T10:10:23Z

Compare with the parallel hash join for the same query.

The only difference with before is a RESET enable_parallel_hash;

https://explain.dalibo.com/plan/f11gg33e19adf0dh

yhuelf · 2023-07-19T10:33:00Z

Same problem with a merge join, of course, as per the documentation

https://explain.dalibo.com/plan/56a23c086073a315

MatteoGioioso · 2023-08-10T05:27:21Z

Noticed the same, when workers are present the rows in the plan are the average returned per worker despite the number of loops.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect calculation of the total number of estimated / real rows in some parallel plans #604

Incorrect calculation of the total number of estimated / real rows in some parallel plans #604

yhuelf commented Jul 19, 2023 •

edited

Loading

yhuelf commented Jul 19, 2023

yhuelf commented Jul 19, 2023 •

edited

Loading

MatteoGioioso commented Aug 10, 2023

Incorrect calculation of the total number of estimated / real rows in some parallel plans #604

Incorrect calculation of the total number of estimated / real rows in some parallel plans #604

Comments

yhuelf commented Jul 19, 2023 • edited Loading

yhuelf commented Jul 19, 2023

yhuelf commented Jul 19, 2023 • edited Loading

MatteoGioioso commented Aug 10, 2023

yhuelf commented Jul 19, 2023 •

edited

Loading

yhuelf commented Jul 19, 2023 •

edited

Loading