Skip to content
This repository has been archived by the owner on Dec 20, 2018. It is now read-only.

spark-avro ignores the compression option in DataFrameWriter #259

Open
juarezr opened this issue Nov 30, 2017 · 0 comments
Open

spark-avro ignores the compression option in DataFrameWriter #259

juarezr opened this issue Nov 30, 2017 · 0 comments

Comments

@juarezr
Copy link

juarezr commented Nov 30, 2017

Documentation says that should set compression in SparkConf:

spark.conf.set("spark.sql.avro.compression.codec", "deflate")

But others formats like parquet allows setting it in DataFrameWriter options:

DataFrameWriter writer = rowDataset.write()
.format("com.databricks.spark.avro")
.option("compression","snappy")
.save(path);

For consistency, spark-avro could also respect this setting.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant