Skip to content

Commit

Permalink
adding some bad parquet files (#58)
Browse files Browse the repository at this point in the history
* adding some bad parquet files

* move to bad-data

* fmt?

---------

Co-authored-by: jp0317 <zjpzlz@gmail.com>
Co-authored-by: mwish <maplewish117@gmail.com>
  • Loading branch information
3 people authored Aug 15, 2024
1 parent 89ec47e commit cb7a967
Show file tree
Hide file tree
Showing 3 changed files with 5 additions and 1 deletion.
Binary file added bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet
Binary file not shown.
Binary file added bad_data/ARROW-RS-GH-6229-LEVELS.parquet
Binary file not shown.
6 changes: 5 additions & 1 deletion bad_data/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,11 @@
These are files used for reproducing various bugs that have been reported.

* PARQUET-1481.parquet: tests a case where a schema Thrift value has been
corrupted
corrupted.
* ARROW-RS-GH-6229-DICTHEADER.parquet: tests a case where the number of values
stored in dictionary page header is negative.
* ARROW-RS-GH-6229-LEVELS.parquet: tests a case where a page has insufficient
repetition levels.
* ARROW-GH-41321.parquet: test case of https://github.com/apache/arrow/issues/41321
where decoded rep / def levels is less than num_values in page_header.
* ARROW-GH-41317.parquet: test case of https://github.com/apache/arrow/issues/41317
Expand Down

0 comments on commit cb7a967

Please sign in to comment.