From 41d7e9ea7f14b38dab441e099b4e48570489c19c Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Viktor=20Sz=C3=A9pe?= Date: Tue, 6 Aug 2024 22:02:54 +0200 Subject: [PATCH] Improve parquet.md (#1172) --- docs/components/libs/parquet.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/components/libs/parquet.md b/docs/components/libs/parquet.md index e9437e71b..505375f90 100644 --- a/docs/components/libs/parquet.md +++ b/docs/components/libs/parquet.md @@ -281,7 +281,7 @@ A Much better approach is to reduce the row group size to something closer to 1M what your default page size should be - like for example 100 or 500 (that obviously depends on your data) This way you will keep memory usage low, and you will be able to paginate over big files without any issues. -But it will take a bit longer to write into those files since writter will need to flush and calculate staticists +But it will take a little longer to write to these files, because writer has to flush and calculate statistic more frequently. Unfortunately, there is no one size fits all solution here.