-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TG2-VALIDATION_OCCURRENCEID_STANDARD #23
Comments
Comment by Paul Morris (@chicoreus) migrated from spreadsheet: |
@ArthurChapman two typos in 'uninique' |
Thanks Joanna - fixed |
Splitting bdqffdq:Information Elements into "Information Elements ActedUpon" and "Information Elements Consulted" |
Description doesn't conform to the rest of the test. There is no source authority. Specification needs further work as well. |
RDF Guide asserts that identifiers "SHOULD be globally unique, referentially consistent, and persistent" And /'if those identifiers are to be used to identify subject resources in RDF, they MUST also be in the form of an IRI." This test can't readily assess the first three criteria (globally unique, referenially consistent (always returns the same object), and persistent), thought it can assess the later. Better is probably to cite recommendation R1 from the guid applicability statement https://github.com/tdwg/guid-as/blob/master/guid/tdwg_guid_applicability_statement.pdf, recommending a PURL, LSID, UUID, DOI, other Handle, or http URI. These have patterns that could be detected by this test. |
However Darwin Core doesn't require it to be globally unique, so probably needs further looking at From DwC: An identifier for the dwc:Occurrence (as opposed to a particular digital record of the dwc:Occurrence). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the dwc:occurrenceID globally unique. |
@ArthurChapman the "persistent global unique identifier," or "combination of identifiers in the record" are untenable to either write a specification for a test or to implement a test. This one we would be much better off adopting recommendation R1 from the guid applicability statement. That is tenable to write a specification for and to write code to evaluate. This is very similar to the problem we faced in #121 and #212 |
Thanks @chicoreus. I agree the GUID Applicability Statement is a fair base (Source Authority?), but what about the complexity of implementing #115? |
Thank you Lee for flagging this in your email. I have mixed feelings on this test. |
The recommendations for the term "Recommended best practice is to use a persistent, globally unique identifier" cannot be tested. I would put this into the "DO NOT IMPLEMENT" category. |
I agree with @tucotuco suggestion. i.e. DO NOT IMPLEMENT |
I mostly concur. The broad scope of the recommended best practice in the term itself is not practical to implement. However, this is essentially the same test as VALIDATION_SCIENTIFICNAMEID_COMPLETE #212 (and supplementary #121) but performed on a different darwin core term with a different definition. Since we are supporting that test, in a form that is implementable, I suggest we frame a more precisely parallel test, VALIDATION_OCCURRENCEID_COMPLETE that tests for conformance to recommendation R1 from the guid applicability statement https://github.com/tdwg/guid-as/blob/master/guid/tdwg_guid_applicability_statement.pdf, recommending a PURL, LSID, UUID, DOI, other Handle, or http URI. These have detectable patterns that could be validated by this test. Like #212, this would be somewhat aspirational (and might go in supplemental rather than core). If we don't we need to be very explicit about why we put a do not implement on #23 and have #212 in core (and #121 in supplementary). The difference might be that scientific name id is expected to point to someone else's identifier, and occurrence id is very likely to be minted locally by the database of record for the occurrence. But, we need to be very explicit about why one and not the other., or we need a paralell test VALIDATION_OCCURRENCEID_COMPLETE based on the RDF guide. |
This one would be VALIDATION_OCCURRENCEID_COMPLETE, proposed, but not specified is VALIDATION_OCCURRENCEID_STANDARD that would operate on the dwciri term. |
Perhaps I misunderstand, but there is no dwciri: equivalent of dwc: Id terms. Instead, identification of a subject in RDF is recommended to be accomplished with the rdf:about attribute of the rdf:Description element. |
You are right. No need for a VALIDATION_OCCURRENCEID_STANDARD test. |
The text was updated successfully, but these errors were encountered: