Skip to content

Commit

Permalink
Document the Iceberg split weight property
Browse files Browse the repository at this point in the history
Cherry-pick of trinodb/trino#12744

Co-authored-by: Alex Jo <jo.alex2144@gmail.com>
  • Loading branch information
2 people authored and highker committed Jul 1, 2022
1 parent 31979f7 commit 19397a1
Showing 1 changed file with 24 additions and 12 deletions.
36 changes: 24 additions & 12 deletions presto-docs/src/main/sphinx/connector/iceberg.rst
Original file line number Diff line number Diff line change
Expand Up @@ -29,25 +29,28 @@ Configuration Properties

The following configuration properties are available:

====================================== ====================================================
Property Name Description
====================================== ====================================================
``hive.metastore.uri`` The URI(s) of the Hive metastore.
========================================= =====================================================
Property Name Description
========================================= =====================================================
``hive.metastore.uri`` The URI(s) of the Hive metastore.

``iceberg.file-format`` The storage file format for Iceberg tables.
``iceberg.file-format`` The storage file format for Iceberg tables.

``iceberg.compression-codec`` The compression codec to use when writing files.
``iceberg.compression-codec`` The compression codec to use when writing files.

``iceberg.catalog.type`` The catalog type for Iceberg tables.
``iceberg.catalog.type`` The catalog type for Iceberg tables.

``iceberg.catalog.warehouse`` The catalog warehouse root path for Iceberg tables.
``iceberg.catalog.warehouse`` The catalog warehouse root path for Iceberg tables.

``iceberg.catalog.cached-catalog-num`` The number of Iceberg catalogs to cache.
``iceberg.catalog.cached-catalog-num`` The number of Iceberg catalogs to cache.

``iceberg.hadoop.config.resources`` The path(s) for Hadoop configuration resources.
``iceberg.hadoop.config.resources`` The path(s) for Hadoop configuration resources.

``iceberg.max-partitions-per-writer`` The maximum number of partitions handled per writer.
====================================== ====================================================
``iceberg.max-partitions-per-writer`` The maximum number of partitions handled per writer.

``iceberg.minimum-assigned-split-weight`` A decimal value in the range (0, 1] used as a minimum
for weights assigned to each split.
========================================= =====================================================

``hive.metastore.uri``
^^^^^^^^^^^^^^^^^^^^^^
Expand Down Expand Up @@ -114,6 +117,15 @@ The Maximum number of partitions handled per writer.

The default is 100.

``iceberg.minimum-assigned-split-weight``
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

A decimal value in the range (0, 1] used as a minimum for weights assigned to each split.
A low value may improve performance on tables with small files. A higher value may improve
performance for queries with highly skewed aggregations or joins.

The default is 0.05.

Nessie catalog
^^^^^^^^^^^^^^

Expand Down

0 comments on commit 19397a1

Please sign in to comment.