Parquet should encode RowSets more natively #5262
Labels
2023_unscheduled
core
Core development tasks
feature request
New feature or request
parquet
Related to the Parquet integration
query engine
Milestone
Right now, we encode and decode
RowSets
usingRowSetCodec
, delegating toExternalizableRowSetUtils
. We should consider whether we can replace this with an array-formatted column of longs (following the same strategy for representing individual row keys and row key ranges as we currently employ) and a newColumnTypeInfo.SpecialType
. This might produce comparable compression while allowing the column data to at least make some sense to external tools.Note that we should still preserve
RowSetCodec
for backwards-compatibility purposes.The text was updated successfully, but these errors were encountered: