-
Notifications
You must be signed in to change notification settings - Fork 244
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'branch-22.08' of github.com:NVIDIA/spark-rapids into ad…
…d-regex-choice-to-fuzzer
- Loading branch information
Showing
204 changed files
with
15,088 additions
and
2,198 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,6 @@ | ||
com/nvidia/spark/rapids/*/RapidsShuffleManager* | ||
com/nvidia/spark/rapids/AvroProvider.class | ||
com/nvidia/spark/rapids/HiveProvider.class | ||
com/nvidia/spark/rapids/iceberg/IcebergProvider.class | ||
org/apache/spark/sql/rapids/shims/*/ProxyRapidsShuffleInternalManager* | ||
spark-*-info.properties |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,62 @@ | ||
--- | ||
layout: page | ||
title: Apache Iceberg Support | ||
parent: Additional Functionality | ||
nav_order: 7 | ||
--- | ||
|
||
# Apache Iceberg Support | ||
|
||
The RAPIDS Accelerator for Apache Spark provides limited support for Apache Iceberg tables. | ||
This document details the Apache Iceberg features that are supported. | ||
|
||
## Apache Iceberg Versions | ||
|
||
The RAPIDS Accelerator supports Apache Iceberg 0.13.x. Earlier versions of Apache Iceberg are | ||
not supported. | ||
|
||
## Reading Tables | ||
|
||
### Metadata Queries | ||
|
||
Reads of Apache Iceberg metadata, i.e.: the `history`, `snapshots`, and other metadata tables | ||
associated with a table, will not be GPU-accelerated. The CPU will continue to process these | ||
metadata-level queries. | ||
|
||
### Row-level Delete and Update Support | ||
|
||
Apache Iceberg supports row-level deletions and updates. Tables that are using a configuration of | ||
`write.delete.mode=merge-on-read` are not supported. | ||
|
||
### Schema Evolution | ||
|
||
Columns that are added and removed at the top level of the table schema are supported. Columns | ||
that are added or removed within struct columns are not supported. | ||
|
||
### Data Formats | ||
|
||
Apache Iceberg can store data in various formats. Each section below details the levels of support | ||
for each of the underlying data formats. | ||
|
||
#### Parquet | ||
|
||
Data stored in Parquet is supported with the same limitations for loading data from raw Parquet | ||
files. See the [Input/Output](../supported_ops.md#inputoutput) documentation for details. The | ||
following compression codecs applied to the Parquet data are supported: | ||
- gzip (Apache Iceberg default) | ||
- snappy | ||
- uncompressed | ||
- zstd | ||
|
||
#### ORC | ||
|
||
The RAPIDS Accelerator does not support Apache Iceberg tables using the ORC data format. | ||
|
||
#### Avro | ||
|
||
The RAPIDS Accelerator does not support Apache Iceberg tables using the Avro data format. | ||
|
||
## Writing Tables | ||
|
||
The RAPIDS Accelerator for Apache Spark does not accelerate Apache Iceberg writes. Writes | ||
to Iceberg tables will be processed by the CPU. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.