add_files
raises KeyError
if parquet file doe not have column stats
#1353
Milestone
add_files
raises KeyError
if parquet file doe not have column stats
#1353
Apache Iceberg version
0.8.0 (latest release)
Please describe the bug 🐞
Using the NYC taxi data set found here, if I follow the standard way of creating catalog, and table, but instead of doing
append
, I doadd_files
:I get a
KeyError
:This is because since this parquet file does not have columns level stats sets, in the source code, it goes into the else block here
So col_aggs and null_value_counts is not updated, but invalidate_col is update. So when the del command is run here, the KeyError is thrown.
As discussed on slack, @kevinjqliu proposed to switch
del col_aggs[field_id]
withcol_aggs.pop(field_id, None)
.I will be raising a PR soon.
The text was updated successfully, but these errors were encountered: