Semantic Metadata API can break when labels are changed (e.g. Contact -> Point of Contact) #8590

pdurbin · 2022-04-08T14:48:38Z

When we merged pull request #8454 that changed a variety of metadata field labels in the citation block (e.g. "Contact" became "Point of Contact"), we learned that it resulted in a backward-incompatible change to the Semantic Metadata API.

Specifically, testSemanticMetadataAPIs in DatasetsIT started failing.

The goal is for label changes to not affect the API. That is, pull requests against the citation block or other blocks where you are only changing the human-readable label ("Contact" vs. "Point of Contact") should not result in a breaking change to the API.

@qqmyers has provided a nice write up of the situation at #8533 (comment) and discussed all this during a recent tech hours.

I don't want to speak for everyone but I'm ok with a breaking change to the Semantic Metadata API in order to make it more tolerant of future label changes. One of the options we discussed is switching from the human readable label (Contact) to the machine readable one (datasetContact). That way, if "Contact" becomes "Point of Contact" in the future, the API continues to work the same way. So when creating a dataset, we'd change part of the JSON, perhaps like below.

old/current human readable

  "https://dataverse.org/schema/citation/Contact": {
    "https://dataverse.org/schema/citation/datasetContact#E-mail": "finch@mailinator.com",
    "https://dataverse.org/schema/citation/datasetContact#Name": "Finch, Fiona"
  }

new/proposed machine readable

  "https://dataverse.org/schema/citation/datasetContact": {
    "https://dataverse.org/schema/citation/datasetContactEmail": "finch@mailinator.com",
    "https://dataverse.org/schema/citation/datasetContactName": "Finch, Fiona"
  }

These machine readable names come from the tsv:

cat citation.tsv | cut -f2 | grep -i contact
datasetContact
datasetContactName
datasetContactAffiliation
datasetContactEmail

These are just ideas. Mostly I'm just trying to capture the problem. I'm not trying to specify the exact solution.

One more thing I'd feel remiss without saying... at some point we should decide that it's time for the Semantic Metadata API to graduate from the Developer Guide to the API Guide. No rush on this. Whenever we're ready.

The text was updated successfully, but these errors were encountered:

pdurbin mentioned this issue Apr 8, 2022

8127 citation field improvements #8454

Merged

qqmyers mentioned this issue Apr 9, 2022

IQSS/8533 Update Internal Semantic Mappings #8592

Merged

kcondon closed this as completed in #8592 May 13, 2022

mreekie added the pm.sprint.2022_05_11 label May 16, 2022

pdurbin added a commit to ErykKul/dataverse that referenced this issue May 18, 2022

rename SQL scripts, clarify docs IQSS#7492 IQSS#8533 IQSS#8590 IQSS#8592

ea342ab

pdurbin added this to the 5.11 milestone Jun 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semantic Metadata API can break when labels are changed (e.g. Contact -> Point of Contact) #8590

Semantic Metadata API can break when labels are changed (e.g. Contact -> Point of Contact) #8590

pdurbin commented Apr 8, 2022

Semantic Metadata API can break when labels are changed (e.g. Contact -> Point of Contact) #8590

Semantic Metadata API can break when labels are changed (e.g. Contact -> Point of Contact) #8590

Comments

pdurbin commented Apr 8, 2022

old/current human readable

new/proposed machine readable