Merge from master #1

madhurkumar · 2017-06-27T20:32:29Z

No description provided.

If the current and new values are the same do not set jmx export otherwise an exception is thrown.

Expose elements in the queue via an iterator.

Added 2 new properties to the resource groups: queuedTimeLimit and runningTimeLimit.

Allow repeating an element for a given number of times. The return result is an array.

Add array aggregation benchmark to measure the performance for double-typed array aggregation. Extend the ablility of AbstractOperatorBenchmark to expose peak memory usage.

Make BlockBuilderStatus nullable in various block builders to reduce the memory usage for functions that aggregate columns into complex types; e.g., array_agg, map_agg, histogram, min_by, etc. In theory, the change can reduce memory utilization up to 43%, which is the ratio of the size of BlockBuilderStatus over the size of BlockBuilder. In practice, we may observe a memory safe around 25 - 30% for large scale of data. Small-scale benchmark for array_agg on double type: before: sql_double_array_agg :: 33.493 cpu ms :: 3.13MB peak memory after: sql_double_array_agg :: 33.414 cpu ms :: 2.33MB peak memory Large-scale benchmark for array_agg on double type: rows memory (before) memory (after) save 20 M 4.79 GB 3.50 GB 27% 30 M 7.53 GB 5.64 GB 25% 40 M 9.86 GB 7.03 GB 28%

The thee last test cases in TestTransformExistsApplyToScalarApply#testDoesNotFire do not differ much and there is not much sense to keep them all. This commit removes two of them.

LateralJoin is preferred to be used in case of scalar subqueries.

Returning only input properties for apply node is incorrect as it does not consider subquery properties.

AddLocalExcanges is using PropertyDerivation classes which do not support subquery related plan nodes like ApplyNode.

wkkk#

nonReserved cannot contain recursive rules.

The mechanism for handling non-reserved keywords relies on being able to intercept the parsing operation and replacing the nonReserved entry in the parse tree with an IDENT token. Modifying the tree in this manner breaks when dealing with intermediate nodes, presumably due to how enter/exit rule notification is weaved into the traversal/building of the tree.

We need to allow '.' in resource group name for backward compatibility, so '.' cannot be used as delimiter to parse '.' separated segmented name.

Make Util.pruneInputs return Optional<Set<Symbol>>, rather than Optional<List<Symbol>>, because for most nodes we're thinking about sets of used symbols, rather than the ordering of the output symbol list. In two specific cases, ValueNode and TableScanNode, retain the previous ordering explicitly, rather than losing it in the set, to avoid making superfluous changes to the node during optimization. The PruneJoinColumns rule was already retaining the order of its outputs, and this patch does not affect that behavior.

The Travis builds hang on TestHiveTableStatistics with the new images. https://blog.travis-ci.com/2017-06-21-trusty-updates-2017-Q2-launch

If page.getSizeInBytes() is zero, the page may not be returned to the client. The only known case when this happens is if the page contains only SliceArrayBlocks and all values are null.

After executing a schema-altering query such as CREATE TABLE, datastax driver waits for a certain time period for "schema agreement", before refreshing the schema. The default timeout is 10 seconds, which might be low for schema changes to be propagated to all the nodes.

to account for the fact that it may take some time for the schema metadata to be refreshed. This is similar to the retry mechanism that was added to assertContains() method. Hopefully this will fix the intermittent test failures.

The Hive connector lists partition names during planning, then fetches metadata separately while generating splits. The latter can fail if the partition is dropped in between the two metastore calls.

Detect the MySql error ER_TRANS_CACHE_FULL and generate an additional error message.

Elon Azoulay and others added 30 commits June 6, 2017 10:00

Fix jmx export issue in resource group configuration manager

e098136

If the current and new values are the same do not set jmx export otherwise an exception is thrown.

Change UpdateablePriorityQueue to implement Iterable

de9f348

Expose elements in the queue via an iterator.

Add query queued and running time limits to resource groups

19f8ee1

Added 2 new properties to the resource groups: queuedTimeLimit and runningTimeLimit.

Cap result of sequence function to 10000 entries

f2543bb

Add repeat function

d804fd0

Allow repeating an element for a given number of times. The return result is an array.

Include object overhead in big array sizeOf calculations

c6a12f7

Include object overhead in grouped state getEstimatedSize

bb2bf99

Remove unused field

bd64363

Add array aggregation benchmark

84f1b01

Add array aggregation benchmark to measure the performance for double-typed array aggregation. Extend the ablility of AbstractOperatorBenchmark to expose peak memory usage.

Remove deprecated methods in AggregationNode

5611a18

Support generating interface

5c8cc82

Remove extra test cases

ec5ac90

The thee last test cases in TestTransformExistsApplyToScalarApply#testDoesNotFire do not differ much and there is not much sense to keep them all. This commit removes two of them.

Make possible to filter not supported optimizers in tests

091b9a4

Introduce LateralJoinNode

588a01d

Use LateralJoin for scalar subqueries

ab72f8b

Check that ApplyNode is used only for non scalar subqueries

40785a0

LateralJoin is preferred to be used in case of scalar subqueries.

Throw exception instead of returning wrong value

baa7878

Returning only input properties for apply node is incorrect as it does not consider subquery properties.

Filter AddLocalExchanges when testing subqueries logical plan

592f41f

AddLocalExcanges is using PropertyDerivation classes which do not support subquery related plan nodes like ApplyNode.

Extract BasePlanTest#assertPlan

1e2bb6c

Use expression method instead of hardcoding AST tree

97cb912

Format TestTransformCorrelatedScalarAggregationToJoin

d6bf984

Fix ORDER BY with LIMIT expressions with same canonicals

762a0db

wkkk#

Use static import for checkArgument

07c98a7

Support GROUPING() in legacy ORDER BY

ca013b2

Use Hive Text instead of Java String in GenericHiveRecordCursor

e4dfc8e

Fix parsing of normal form tokens

4b2612d

nonReserved cannot contain recursive rules.

Remove legacy Hive connectors

1fbd226

Remove ResourceGroupId.fromSegmentedName

7936d29

We need to allow '.' in resource group name for backward compatibility, so '.' cannot be used as delimiter to parse '.' separated segmented name.

ArturGajowy and others added 29 commits June 23, 2017 07:38

Migrate TransformUncorrelatedInPredicateSubqueryToSemiJoin to Rule

36d1afe

Clean up RemoveUnreferencedScalarLateralNodes

15d1d00

Migrate RemoveUnreferencedScalarLateralNodes to Rule

410c63c

Use older Travis image

001c0e5

The Travis builds hang on TestHiveTableStatistics with the new images. https://blog.travis-ci.com/2017-06-21-trusty-updates-2017-Q2-launch

Check position count in Block and Page assertions

859a02e

Add configuration option to diable RCFile write validation

d243baf

Enable new RCFile without validation by default

3ca2819

Use OkHttp for CLI and JDBC driver

62a8af4

Allow configuring JDBC HTTP client per connection

d46cf82

Simplify assertions in TestPrestoDriverUri

c4370c9

Improve properties/URL parameter handling

defabcb

Add support for SSL in JDBC driver

322c57d

Extract base class for LDAP JDBC tests

f7d2ddb

Add SSL LDAP product tests for Presto JDBC driver

5a7d9d1

Add SOCKS proxy support for JDBC driver

f978ab4

Add HTTP proxy support for JDBC driver and CLI

ce76411

Add Kerberos support for JDBC driver

431f870

Fix statement resource handling of zero size pages

7df7b6e

If page.getSizeInBytes() is zero, the page may not be returned to the client. The only known case when this happens is if the page contains only SliceArrayBlocks and all values are null.

Fix formatting in QueryRewriter

94d34ef

Fix NPE in TestMemoryManager

9302693

Retry when retrieving metadata in Cassandra tests

920fc48

to account for the fact that it may take some time for the schema metadata to be refreshed. This is similar to the retry mechanism that was added to assertContains() method. Hopefully this will fix the intermittent test failures.

Fix message for Hive partitions dropped during execution

14cf725

The Hive connector lists partition names during planning, then fetches metadata separately while generating splits. The latter can fail if the partition is dropped in between the two metastore calls.

Remove redundant verify check

afe0c11

Create a user friendly error message for Raptor

5f0a575

Detect the MySql error ER_TRANS_CACHE_FULL and generate an additional error message.

Build instruction with maven wrapper

3f4a79d

Fix PipelineContext fullyBlocked status check

d4c4ad3

Add blocked drivers to StatementStats and StageStats

d518b75

madhurkumar merged commit dcc11c9 into madhurkumar:master Jun 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge from master #1

Merge from master #1

madhurkumar commented Jun 27, 2017

Merge from master #1

Merge from master #1

Conversation

madhurkumar commented Jun 27, 2017