Added ability to generate unique paths for tables #2

sshkvar · 2020-10-22T14:46:37Z

Also, add an integration test to TestMemorySmoke

Some connector might have partial support for DELETE statement.

sendUpdate for the next request can be triggered after currentRequest future is done but before UpdateResponseHandler callback execution. This can overwrite value of currentRequestStartNanos before the callback has recorded the stats for the completed request. The next request may also miss the value of sendPlan set in the callback execution in this scenario and send the plan an extra time.

- inline field - remove obvious comments - limit access to field - remove explicit default constructor

Fixes trinodb#8268 The problem was caused by multiple rows having the same (writeId, bucket, rowId). In order to fix this it is necessary to ensure unique row IDs across writers. To achieve it different writers will have separated id ranges in the split assigned to them

Co-authored-by: Ashhar Hasan <hashhar_dev@outlook.com>

SPI cannot use Guava's toImmutableList

Add option to configure many SystemSessionPropertiesProviders in the SessionPropertyManager

Not all connector tests run within containers so any operation that deletes data from the TPCH tables can change state of the testing infrastructure leading to hard to diagnose failures. So we create temporary tables to verify if deletes are supported or not.

Hoists "any row" replication and nullBlock.isNull logic out of the PartitionedOutputOperator.PagePartitioner inner loop logic

Trino will cast (if possible) storage table columns if they don't match materialized view column types.

Before the change, Trino was not able to read from an Iceberg v1 table in which a partitioning field was removed via Spark. In v1 tables, removed partitioning fields are replaced with `void` transformation to keep field ordinal numbers unchanged, and `void` transformation was not supported.

In v1 tables, removed partitioning fields are replaced with `void` transformation to keep field ordinal numbers unchanged, and `void` transformation was not supported when writing. This commit adds support for writing with `void` transformation and, implicitly, for creating tables with such transformation.

Some databases are case-insensitive (MySQL, SQL Server) while others sort textual types differently compared to Trino (PostgreSQL). For such databases pushdown of aggregation functions when the grouping set includes a textual type can lead to incorrect results. So we prevent aggregation pushdown for such cases. We also prevent pushdown for functions whose results depend on sort order (min/max) when the input is a textual type.

They are currently not supported. Previously, Iceberg would use `HiveMetastoreModule` and thus inherited metastore configuration from Hive connector. `IcebergMetastoreModule` existed for validation purposes. Besides thrift and file, Iceberg inherited 'support' for glue (which actually didn't work at runtime) and alluxio metastores (which was not intentional). The commit copies `HiveMetastoreModule` into `IcebergMetastoreModule` so that the latter class defines metastore configuration for Iceberg. The Glue metastore option is currently not supported, but will be added in the future. The Alluxio metastore option is dropped.

We started to hitting trinodb#8719 without any explicit change on our side (the world has changed). Disabling oauth2 to not to distract other areas of development. While, in the same time running investigation what has changed and how to mitigate it.

Before the change, it was not intuitive that `CharValueWriter` is suitable for `varbinary` data. The new name better describes what the writer is doing, and that it doesn't assume slice's contents to represent characters.

It should have been deleted in ced0253.

In order to prevent introducing regressions a test that counts accesses to file system is added. The test differentiate between different files and accesses types. Additioanlly this pr removes queryId from information provided by TrackingFileIoProvider as it is no longer needed

…table

raunaqmorarka and others added 30 commits July 1, 2021 21:43

Handle concurrent table drop in TableCommentSystemTable

922aac8

Add current_groups function

4922d66

Add 359 release notes

de922fd

[maven-release-plugin] prepare release 359

1609dfb

[maven-release-plugin] prepare for next development iteration

851edb1

Move Mergeable interface to SPI

d159c6c

Allow exposing custom metrics from the connector

38305b7

Implement Count and Histogram metrics in trino-plugin-toolkit

60f66d8

Also, add an integration test to TestMemorySmoke

Extract complex delete tests to separate test methods

3013849

Some connector might have partial support for DELETE statement.

Support metadata DELETE in JDBC connectors

825e6d8

Add missing method call delegation to HiveMaterializedViewMetadata

cf45f7b

Add missing jol-core SPI dependency

5cacd61

Minor cleanup in connectors

cbea02b

- inline field - remove obvious comments - limit access to field - remove explicit default constructor

Extract utility TypeDeserializerModule

da6aa6f

Fix comment

5ff27ed

Introduce SystemTableProvider

a9148a9

Add insert_batch_size to JDBC metadata session properties

70e849c

Co-authored-by: Ashhar Hasan <hashhar_dev@outlook.com>

Replace toList with toUnmodifiableList() collector

c7adc54

SPI cannot use Guava's toImmutableList

Remove redundant code

90c22f6

Inject SessionPropertyManager in DispatchManager

9d090a5

Add SystemSessionPropertiesProvider

b5217f0

Add option to configure many SystemSessionPropertiesProviders in the SessionPropertyManager

Simplify lambda to method reference

baa7082

Fix README for Trino Verifier

a35080a

Close systemMemoryContext in PartitionedOutputOperator#close()

17cf25f

Implement mayHaveNull() for Dictionary, RLE, and GroupById Blocks

bc1d5ba

Simplify PagePartitioner inner loop conditional branching

6bbe9f1

Hoists "any row" replication and nullBlock.isNull logic out of the PartitionedOutputOperator.PagePartitioner inner loop logic

Support materialized view storage table column casting

12fb514

Trino will cast (if possible) storage table columns if they don't match materialized view column types.

lukasz-stec and others added 27 commits July 30, 2021 10:49

Rename PagePartitioner to DefaultPagePartitioner

6c5a101

Modularize PagePartitioner

6058b29

Add .run directory to .gitignore

09fd03a

Test optimized Parquet writer in Hive product tests

6f65dbe

Add 360 release notes

685fe6d

[maven-release-plugin] prepare release 360

125ce0a

[maven-release-plugin] prepare for next development iteration

f5fd024

Don't use raw Metric type outside of Metrics class

1f896ea

Update ConnectorMetadata#listTables contract

61077e6

Update naming in Iceberg Trino/Spark compatibility test

adb45f3

Bind container ports to host ports by default in product tests

c3cd438

Update error-prone to 2.8.0

480a621

Provide more information to connectors to control aggregation pushdown

9e3a1db

Iceberg: add new fields in files system table

cf0ae11

Remove redundant test resource from TestIcebergPlugin

e927b4c

Disable oauth2 product tests

ca55aac

We started to hitting trinodb#8719 without any explicit change on our side (the world has changed). Disabling oauth2 to not to distract other areas of development. While, in the same time running investigation what has changed and how to mitigate it.

Convert to Iceberg value strictly

8fbd7fa

Rename varchar/varbinary Parquet writer

4312479

Before the change, it was not intuitive that `CharValueWriter` is suitable for `varbinary` data. The new name better describes what the writer is doing, and that it doesn't assume slice's contents to represent characters.

Correct type used to get varbinary value

89b40ce

Fix typo in test constant name

f18a0a5

Move teardown next to setup method

d1571c0

Remove left over file

e99b53b

It should have been deleted in ced0253.

sshkvar force-pushed the issue-5632 branch from 4e7941a to 3599db5 Compare August 3, 2021 11:05

rebase: Added ability to have unique table location for each iceberg …

0a30f79

…table

sshkvar force-pushed the issue-5632 branch from 3599db5 to 0a30f79 Compare August 3, 2021 12:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added ability to generate unique paths for tables #2

Added ability to generate unique paths for tables #2

sshkvar commented Oct 22, 2020

Added ability to generate unique paths for tables #2

Are you sure you want to change the base?

Added ability to generate unique paths for tables #2

Conversation

sshkvar commented Oct 22, 2020