You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current implementation of project and project_as_struct prune a schema down to a subset of its columns. However, they do not prune columns beyond the first level. Consider a checkpoint:
The current projection would only allow you to project onto txn, add, remove, metaData, or protocol columns. We cannot project onto a nested column like metaData.id. So if we wanted to inspect or filter on a nested column like metaData.id, we could only project it to metaData. In this case, we'd be forced to fetch all the leaf columns of metaData. This is because parquet flattens logical types and only stores the leaf columns.
The text was updated successfully, but these errors were encountered:
The current implementation of
project
andproject_as_struct
prune a schema down to a subset of its columns. However, they do not prune columns beyond the first level. Consider a checkpoint:The current projection would only allow you to project onto
txn
,add
,remove
,metaData
, orprotocol
columns. We cannot project onto a nested column likemetaData.id
. So if we wanted to inspect or filter on a nested column likemetaData.id
, we could only project it tometaData
. In this case, we'd be forced to fetch all the leaf columns of metaData. This is because parquet flattens logical types and only stores the leaf columns.The text was updated successfully, but these errors were encountered: