-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cif_core: _publ_author.name data item is insufficient as a unique key to the PUBL_AUTHOR category #103
Comments
I agree with the proposal to add |
Update - cif core had no opinion, tried cif-developers. |
Update - cif-developers didn't seem much perturbed either way. So here are some updated definitions which I plan to run by the Core DMG. I have written them to replace
|
Note that audit_author would also be updated in the same way. |
Identifiers and links added in c70b1e8. |
I tried validating the existing CIF files against the new version of the dictionary and ran into a bit of a problem. Since the newly added Some possible solutions:
The ordinal data item would be useful either way, so I will file a separate issue for its inclusion if this approach is not accepted.
However, both of these approaches have the drawback of no longer requiring the @jamesrhester , what is your opinion on this? It would be nice to resolve this prior to the release of the new version of the dictionary which formally introduces the |
This looks to be the identical situation to In terms of validation, I think there is value in raising a validation issue (perhaps in some sort of strict mode?) as eventually we would like software to move to using the new data names. Nevertheless, if dREL were added to the Ideally, dREL would have a function, named something like |
You seem to have identified the best approach. Adding a dREL evaluation code would silence the validator in a semantically sound way. A |
The
PUBL_AUTHOR
category is defined as having the_publ_author.name
as its primary key. Clearly, situations can arise where several authors having the same name co-write a paper (the COD currently contains about ~40 such entries);. What is more, no combination of the data items currently defined in the category seem to satisfy the conditions of the primary key -- all items seem to describe properties that can match for both authors (i.e. the address) or be simply be undefined (i.e. ORCID).One of the possible solutions would be to create a new data item (i.e.
_publ_author.ordinal
) that takes the author order into account as is done in theCITATION_AUTHOR
category (using the_citation_author.ordinal
data item). The author order by itself could act as a sufficient primary key, however, this change would render most of the CIFs produced so far invalid. Using both the author order and the name as a composite key might be a more reasonable change since it would require the addition of ordinance only to those loops that contain the duplicate author names.The text was updated successfully, but these errors were encountered: