Repartition writes across nodes when loading data into partitioned table #304

findepi · 2019-02-24T16:05:29Z

Currently, when Presto INSERTs or CREATE TABLE AS a partitioned table, each node can write to all partitions.
This is good when number of partitions is reasonable, but doesn't play well when loading data into large number of partitions.
Typically, such query will fail with "Too many open files". After bumping file limits, one can run into other issues (too many threads, out of memory, etc.)

We should repartition data so that each partition is written by one node.

at first, this can be an opt-int feature ~~(or opt-out? repartitioning is a safer bet)~~
later, we can perhaps smartly choose whether to repartition data or not based at planning time

This issue replaces #579

electrum · 2019-02-24T16:28:31Z

Repartitioning is not safer. If there is only one or a few partitions, the query will be many times slower. At FB, we never implemented this mode because we worried users would blindly set it, causing their queries would run for hours and never finish. Writing a single partition was the common case. Either mode will be completely wrong for certain types of queries. It definitely needs to be a session property. I also think the current behavior (no-repartition) should be the default, as running fast then explicitly failing is better than silently running very slow. We had ideas around doing it adaptively at runtime, but that’s a big project and it was never a priority. A heuristic that might work is to always disable repartitioning when all the partition keys are constant.

sopel39 · 2019-12-28T11:27:59Z

I will create a PR that will allow connectors to specify preferred insert partitioning (#2358). There will be a toggle that will allow to enable usage of such partitioning in the engine. In the future we could use CBO for choosing preferred insert partitioning.

sopel39 · 2020-02-05T20:10:01Z

Fixed via: #2358

findepi added the enhancement New feature or request label Feb 24, 2019

kokosing mentioned this issue Feb 26, 2019

Release notes for 304 #235

Closed

6 tasks

findepi mentioned this issue Apr 4, 2019

Optionally distribute Hive writes on partition keys #579

Closed

sopel39 self-assigned this Dec 28, 2019

sopel39 closed this as completed Feb 5, 2020

sopel39 mentioned this issue Feb 5, 2020

Use stats to automatically determine if connector preferred layout should be used for insert #2741

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repartition writes across nodes when loading data into partitioned table #304

Repartition writes across nodes when loading data into partitioned table #304

findepi commented Feb 24, 2019 •

edited

Loading

electrum commented Feb 24, 2019 via email

sopel39 commented Dec 28, 2019 •

edited by findepi

Loading

sopel39 commented Feb 5, 2020

Repartition writes across nodes when loading data into partitioned table #304

Repartition writes across nodes when loading data into partitioned table #304

Comments

findepi commented Feb 24, 2019 • edited Loading

electrum commented Feb 24, 2019 via email

sopel39 commented Dec 28, 2019 • edited by findepi Loading

sopel39 commented Feb 5, 2020

findepi commented Feb 24, 2019 •

edited

Loading

sopel39 commented Dec 28, 2019 •

edited by findepi

Loading