Core: Drop ParallelIterable's queue low water mark #10978

findepi · 2024-08-20T17:01:44Z

As part of the change in commit
7831a8d, queue low water mark was introduced. However, it resulted in increased number of manifests being read when planning LIMIT queries in Trino Iceberg connector. To avoid increased I/O, back out the change for now.

As part of the change in commit 7831a8d, queue low water mark was introduced. However, it resulted in increased number of manifests being read when planning LIMIT queries in Trino Iceberg connector. To avoid increased I/O, back out the change for now.

Fokko

Since this reverts to the original behavior 👍

* Core: Fix ParallelIterable memory leak where queue continues to be populated even after iterator close (apache#9402) (cherry picked from commit d3cb1b6) * Core: Limit ParallelIterable memory consumption by yielding in tasks (apache#10691) ParallelIterable schedules 2 * WORKER_THREAD_POOL_SIZE tasks for processing input iterables. This defaults to 2 * # CPU cores. When one or some of the input iterables are considerable in size and the ParallelIterable consumer is not quick enough, this could result in unbounded allocation inside `ParallelIterator.queue`. This commit bounds the queue. When queue is full, the tasks yield and get removed from the executor. They are resumed when consumer catches up. (cherry picked from commit 7831a8d) * Drop ParallelIterable's queue low water mark (apache#10978) As part of the change in commit 7831a8d, queue low water mark was introduced. However, it resulted in increased number of manifests being read when planning LIMIT queries in Trino Iceberg connector. To avoid increased I/O, back out the change for now. (cherry picked from commit bcb3281) --------- Co-authored-by: Helt <heltman@qq.com> Co-authored-by: Piotr Findeisen <piotr.findeisen@gmail.com>

github-actions bot added the core label Aug 20, 2024

findepi changed the title ~~Drop ParallelIterable's queue low water mark~~ Core: Drop ParallelIterable's queue low water mark Aug 20, 2024

findepi mentioned this pull request Aug 20, 2024

[1.6] Core: Drop ParallelIterable's queue low water mark #10979

Merged

Fokko approved these changes Aug 21, 2024

View reviewed changes

nastra approved these changes Aug 21, 2024

View reviewed changes

findepi merged commit bcb3281 into apache:main Aug 21, 2024
46 checks passed

findepi deleted the findepi/drop-paralleliterable-s-queue-low-water-mark-1fd7b3 branch August 21, 2024 22:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core: Drop ParallelIterable's queue low water mark #10978

Core: Drop ParallelIterable's queue low water mark #10978

findepi commented Aug 20, 2024

Fokko left a comment

Core: Drop ParallelIterable's queue low water mark #10978

Core: Drop ParallelIterable's queue low water mark #10978

Conversation

findepi commented Aug 20, 2024

Fokko left a comment

Choose a reason for hiding this comment