-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spec disagrees with implementation on field names #8684
Comments
Thanks for raising this Jan. I made a first attempt to fix this a while ago: #5338 For context, this has been changed with v2, because then there were both data and delete files. Since Iceberg does the lookup by field-id, this should not impose an issue, but it is good to get it fixed. |
@Fokko What is the intended future of your PR? If I read the discussion in there correctly, then the spec is correct, as the same field is also used to track delete files, so Do I get the gist of the discussion correctly? If so, do I understand correctly that we all agree that this should be fixed in the implementation, it was just not merged, yet? (I'm trying to gauge in which direction this will go, so I can adapt my implementation to either also write |
@JFinis You're in the right direction.
It is on my list for today. |
Apache Iceberg version
1.3.1 (latest release)
Query engine
Spark
Please describe the bug 🐞
The spec calls the following fields as follows:
504 added_files_count
505 existing_files_count
506 deleted_files_count
The implementation calls them as follows:
(The
_data
is in the implementation, but not in the spec)And hence the produced iceberg use these field names, which deviate from the names in the spec. Either the spec or the implementation should be updated.
The text was updated successfully, but these errors were encountered: