diff --git a/bad_data/README.md b/bad_data/README.md index 472865b..baafde6 100644 --- a/bad_data/README.md +++ b/bad_data/README.md @@ -21,4 +21,8 @@ These are files used for reproducing various bugs that have been reported. * PARQUET-1481.parquet: tests a case where a schema Thrift value has been - corrupted + corrupted. +* bad-dict-page-header.parquet: tests a case where the number of values + stored in dictionary page header is negative. +* bad-levels.parquet: tests a case where a page has insufficient repetition + levels. diff --git a/data/bad-dict-page-header.parquet b/data/bad-dict-page-header.parquet new file mode 100755 index 0000000..7d14d5e Binary files /dev/null and b/data/bad-dict-page-header.parquet differ diff --git a/data/bad-levels.parquet b/data/bad-levels.parquet new file mode 100644 index 0000000..110b783 Binary files /dev/null and b/data/bad-levels.parquet differ