-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Are dataset and file metadata records already sent to EZID/DataCite being updated? #5060
Comments
For now - Document the metadata is sent and when Different issue - Resending metadata to PID provider when the schema is modified (Julian to create a new issue) |
|
@sekmiller and everybody:
This does seem like a real issue. Since this (#5060) is related to DOI metadata, and it's already in dev., should we take a look at this problem as well, while we're at it? |
A quick investigation: I checked a couple of brand-new DOIs that have been minted w/ DataCite since the upgrade yesterday - and they appear to have separate creatorName entries; for example:
So my first guess was that this was only a problem with the DOIs migrated from EZID; but then the example Martin provided, above, is ALSO brand-new; minted yesterday. |
There was a typo in the doc the curl command for the api is: |
Regarding @landreev comment: "So this means, I'm guessing, that those 3 authors are actually stored as the single author name in our database (??)", I'm not sure why I never looked into this until now, but Leonid's hunch was right. Many of the datasets in the collection at https://dataverse.harvard.edu/dataverse/MIT-PSFC have multiple authors added in one field. This problem's been documented at #4035, though I suppose the name of the issue should be broadened. |
Are metadata records already sent to EZID/DataCite being updated as Dataverse adds metadata in the DataCite schema?
I'm assuming (so it's possible I'm wrong :) that in some cases it's not being updated because:
However, this OAI-PMH record from EZID includes all of the authors added to this dataset's second version (the first version had just one author), so it looks like some information from the DataCite schema's required fields, like the creator field, are getting updated, and some, like the resourceType, are not. (Will this affect Dataverse's ability to update the resourceType displayed in DataCite Fabrica (#5086)?)
Could the issue be on DataCite's and EZID's ends (maybe with the way they're updating metadata they make available over OAI-PMH)? Or with how DataCite produces the DataCite XML we can download for each work on DataCite Search?
It's important that the existing metadata records that EZID and DataCite have (and make available over OAI-PMH) are updated as Dataverse continues to improve the amount of metadata it sends to these data hubs, which redistribute this metadata and rely on some of it, like the relatedIdentifier metadata, to help generate citation metrics (for the Make Data Count work, #4821, which will be less effective if the metadata that DataCite has for old datasets doesn't include related identifier metadata).
The text was updated successfully, but these errors were encountered: