[native] Add caching of parsed Types #21325

kevinwilfong · 2023-11-06T23:21:37Z

We've seen cases of queries that spend a large amount of time just parsing types when converting the Presto Plan to Velox. This seems to be because it parses the same large Row Types that are used across many field accesses.

Adding caching within a request shows a substantial decrease in the amount of time it takes to do the conversion.

Notably, this helps with timeouts we're seeing making calls from the coordinator to create tasks on the Workers.

xiaoxmeng

@kevinwilfong nice catch. Thanks for the optimization!

xiaoxmeng · 2023-11-10T17:51:12Z

presto-native-execution/presto_cpp/main/types/PrestoToVeloxExpr.h

@@ -61,6 +67,7 @@ class VeloxExprConverter {
      const protocol::CallExpression& pexpr) const;

  velox::memory::MemoryPool* pool_;


Mark pool_ and typeParser_ as consts? Thanks!

xiaoxmeng · 2023-11-10T17:52:39Z

presto-native-execution/presto_cpp/main/types/PrestoToVeloxQueryPlan.h

@@ -35,7 +37,7 @@ class VeloxQueryPlanConverterBase {
  explicit VeloxQueryPlanConverterBase(


NYC: drop explicit as the ctor takes more than one input? Thanks!

xiaoxmeng · 2023-11-10T17:53:17Z

presto-native-execution/presto_cpp/main/types/PrestoToVeloxQueryPlan.h

@@ -218,6 +220,7 @@ class VeloxQueryPlanConverterBase {
  velox::memory::MemoryPool* pool_;


NYC: mark poo_ and queryCtx_ as consts?

velox::memory::MemoryPool* const pool_; velox::core::QueryCtx* const queryCtx_;

We've seen cases of queries that spend a large amount of time just parsing types when converting the Presto Plan to Velox. This seems to be because it parses the same large Row Types that are used across many field accesses. Adding caching within a request shows a substantial decrease in the amount of time it takes to do the conversion. Notably, this helps with timeouts we're seeing making calls from the coordinator to create tasks on the Workers.

majetideepak · 2023-11-15T23:27:59Z

@kevinwilfong I am adding Presto type parser support using Flex/Bison in Velox. facebookincubator/velox#7568
The end goal is to replace Antlr with that and remove a dependency.
I will add support for caching.
Is there a benchmark to evaluate the performance?

majetideepak · 2023-11-15T23:32:21Z

presto-native-execution/presto_cpp/main/types/TypeParser.h

+  velox::TypePtr parse(const std::string& text) const;
+
+ private:
+  mutable std::unordered_map<std::string, velox::TypePtr> cache_;


Any reason not to use the SimpleLRUCache from Velox?
We use that to cache file handles
https://github.com/facebookincubator/velox/blob/main/velox/connectors/hive/FileHandle.h#L62

majetideepak · 2023-11-15T23:33:37Z

I am worried that without a bound, the cache might grow too big in a production system.

kevinwilfong requested a review from a team as a code owner November 6, 2023 23:21

kevinwilfong marked this pull request as draft November 6, 2023 23:21

kevinwilfong force-pushed the cache_type_conversion branch 2 times, most recently from 2d8fc42 to 17b39fa Compare November 7, 2023 21:31

kevinwilfong marked this pull request as ready for review November 8, 2023 19:11

xiaoxmeng approved these changes Nov 10, 2023

View reviewed changes

kevinwilfong force-pushed the cache_type_conversion branch from 17b39fa to a239894 Compare November 13, 2023 20:25

kevinwilfong force-pushed the cache_type_conversion branch from a239894 to 20e0ac1 Compare November 13, 2023 20:34

xiaoxmeng approved these changes Nov 13, 2023

View reviewed changes

xiaoxmeng merged commit d1c5d83 into prestodb:master Nov 13, 2023
59 checks passed

majetideepak reviewed Nov 15, 2023

View reviewed changes

majetideepak mentioned this pull request Nov 15, 2023

[Native] Bound caching of parsed types #21395

Open

wanglinsong mentioned this pull request Dec 8, 2023

Add release notes for 0.285 #21500

Closed

26 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[native] Add caching of parsed Types #21325

[native] Add caching of parsed Types #21325

kevinwilfong commented Nov 6, 2023

xiaoxmeng left a comment

xiaoxmeng Nov 10, 2023

xiaoxmeng Nov 10, 2023

xiaoxmeng Nov 10, 2023

majetideepak commented Nov 15, 2023 •

edited

Loading

majetideepak Nov 15, 2023

majetideepak commented Nov 15, 2023

		@@ -61,6 +67,7 @@ class VeloxExprConverter {
		const protocol::CallExpression& pexpr) const;

		velox::memory::MemoryPool* pool_;

		@@ -35,7 +37,7 @@ class VeloxQueryPlanConverterBase {
		explicit VeloxQueryPlanConverterBase(

		@@ -218,6 +220,7 @@ class VeloxQueryPlanConverterBase {
		velox::memory::MemoryPool* pool_;

[native] Add caching of parsed Types #21325

[native] Add caching of parsed Types #21325

Conversation

kevinwilfong commented Nov 6, 2023

xiaoxmeng left a comment

Choose a reason for hiding this comment

xiaoxmeng Nov 10, 2023

Choose a reason for hiding this comment

xiaoxmeng Nov 10, 2023

Choose a reason for hiding this comment

xiaoxmeng Nov 10, 2023

Choose a reason for hiding this comment

majetideepak commented Nov 15, 2023 • edited Loading

majetideepak Nov 15, 2023

Choose a reason for hiding this comment

majetideepak commented Nov 15, 2023

majetideepak commented Nov 15, 2023 •

edited

Loading