-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement changes to "normative document" discussed at DwC MG meeting #264
Conversation
In the record level terms section, replace dcterms:language with dc:language. Use the former dcterms: examples for string values with dc:language. Move dcterms:language to the UseWithIRI section and change its recommended value to an IRI from the LOC ISO 639-2 scheme (to maintain consistency with Audubon Core usage. NOTE: the value of organized_in for dc:language is now different from the other DCMI terms in the record-level section. Will that break the QRG build script?
Move dcterms:type below the recommended terms and change status to deprecated. Change its notes to indicate that dc:type should be used for strings and that the RDF guide recommends rdf:type instead of dcterms:type when providing an IRI value. Replace dcterms:type in the record-level terms with dc:type and carry over the examples that were previously in dcterms:type to it. NOTE: the terms below the recommended terms are not in alphabetical order still.
Note: I did not change the obsolete terms, which have always had camelCase labels. Also, the IRI terms that have non-IRI analogs have " (IRI)" appended to their labels to distinguish them.
Added the missing legacy superproperty dwc:accordingTo and made the status deprecated.
In the previous term_versions.csv file, the value in the replaces column indicated what DCMI said the term replaced. However, in the complete history table, we use this column to indicate what this term version replaces with respect to previous TDWG use. For example, in an Executive Decision, dcterms:license was to be used in lieu of dcterms:rights. I've changed the table to reflect this replacement, rather than indicating how http://dublincore.org/usage/terms/history/#rightsHolder-002 was caused by DCMI to replace http://dublincore.org/usage/terms/history/#rightsHolder-001 . There are several other examples of replacements caused by TDWG actions and these have replaced the record of DCMI actions. If there was no TDWG-sanctioned replacement, the column was changed to empthy.
I forgot several technical changes related to the |
dcterms:rights was originally in Darwin Core, but was replaced by dcterms:license in Executive Decision http://rs.tdwg.org/dwc/terms/history/decisions/#Decision-2014-11-06_17 . It has been added as part of the historical record. It is listed as the "replaces" value for dcterms:license term and needs to be there.
Just to let you know where I am on this: I've run the 2017 and 2018 changes through the build script and am now comparing the diffs of the term_version.csv file in this pull request with the generated one. I'm determining whether the difference is something that I missed in this branch's version or if it is a problem with the source data in the rs.tdwg.org that the build script is using. There are a bunch of camelCase/prose differences I need to reconcile, but I think I'm close to getting it right with the build script. |
The DCMI History page http://dublincore.org/usage/terms/history/ uses a version model that differs from what we are using in TDWG. They use the term '"issued" for what we call "current terms", whereas we only use "issued" for term versions. Thus their issued date is what we call the created date and their modified date for the current term is what we call the issued date of the version. These dates have been changed to match the TDWG model. Also, version 003 of dcterms:modified has "Date Modified" as the label, rather than "Modified"
The comment "Access Rights may include information regarding access or restrictions based on privacy, security, or other policies." is found in the comments for dcterms:accessRights, not as part of the definition.
Added a replaces value showing that the deprecated dwctype:Location term was replaced by dcterms:Location
@baskaufs indeed, the QRG build script now spits out the English labels (like Can you also generate a term_versions_localName.csv with that |
@peterdesmet Yes, that is correct. I can easily add the term_localname column as the second column right after iri. And yes, I'm going through every diff that I see and either changing the source rs.tdwg.org data (if I think it's wrong) or the draft term_versions.csv (if I think it is wrong). I will keep going until there are no diffs and then consider the build script to be fully working. So you can operate on the assumption that there will be a term_localname column - I will add it as the last thing. |
@tucotuco One of the major diffs that I have is that the abcdEquivalence values have been removed from all of the obsolete terms (curatorial, digr.org terms, etc.) in term_versions.csv . I still have those values in rs.tdwg.org. I could remove them from the rs.tdwg.org source files, but that seems like a bad idea since it would be throwing away historical information. I'm not sure why it hurts to have that information in the term_versions.csv file since nobody will probably ever care or look at it and it has no effect on building the QRG. Let me know what you want me to do. I'm going to have to stop working on this for a while, so I'll just wait until you give me an answer on this before proceeding. |
I don't know how those equivalents were lost, truthfully. It surely was not
intended. So if you can add them back again, that would be great.
…On Mon, Jul 27, 2020 at 7:57 AM Steve Baskauf ***@***.***> wrote:
@tucotuco <https://github.com/tucotuco> One of the major diffs that I
have is that the abcdEquivalence values have been removed from all of the
obsolete terms (curatorial, digr.org terms, etc.) in term_versions.csv .
I still have those values in rs.tdwg.org. I could remove them from the
rs.tdwg.org source files, but that seems like a bad idea since it would
be throwing away historical information. I'm not sure why it hurts to have
that information in the term_versions.csv file since nobody will probably
ever care or look at it and it has no effect on building the QRG. Let me
know what you want me to do. I'm going to have to stop working on this for
a while, so I'll just wait until you give me an answer on this before
proceeding.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#264 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AADQ725IRPQEZEWYW6CY52TR5VMRZANCNFSM4PHXK43Q>
.
|
This all looks good to me. Is it OK to merge? |
When @tucotuco agrees that the series of commits are OK, the first box can be ticked and I will go ahead with merging tdwg/rs.tdwg.org#38, which I think is the only thing blocking #256 . The next steps are for @peterdesmet to decide what changes need to be made in the QRG build script. First, whether to make it work without the |
@tucotuco I think there are still some boxes to tick before merging |
The script that builds |
@baskaufs, you can:
Once that is done, I will revert the build script to work from that file again and we can merge. |
Changed the path of output so that the current term_versions.csv file is replaced by the output. Changed the name of the script itself to generate_term_versions.py . Added the list of terms used to indicate the order of terms in the Quick Reference Guide.
@peterdesmet I have renamed the script as you suggested. I also added a source file that I had forgotten ( I think that when you can verify that the built QRG is OK we would be ready to merge. |
Checking off the box for "modify the QRG build script to use the extra column in the generated_normative_document.csv file so that it makes the labels appear on the QRG as desired. Once that is done, it can replace the term_versions.csv file and this branch can be merged with the master." because it was done in 5aa48c8 |
Checking off the box for "move the script that builds the term_versions.csv file from https://github.com/baskaufs/msc/blob/master/dwc_workflow/generate_normative_csv.ipynb to some appropriate place in the DwC repo. There are a few path changes that will have to be made in the script Jupyter notebook. Once I merge the changes I've made to the rs.tdwg.org repo, the value of github_baseUri will need to be changed to the master branch. At the end of the script, the save file path will need to be changed from generated_normative_document.csv to term_versions.csv with an appropriate relative path so that the file will end up in the right place relative to where the script is stashed." because it was done in 665ca08 |
In b16d893 I have moved the Jupyter notebook that generates the list of terms document to the Todo:
|
@tucotuco @peterdesmet The link to the updated list of terms document is here. I believe that it now includes all of the changes @tucotuco made and those terms are versioned using today's date. |
Hi @baskaufs, the The
|
checkbox 1: Yes, only checkbox 2: John might know the history of checkbox 3: Not sure what the problem is here. Are you talking about the links in the index? The page source shows them as pointing to local fragment identifiers: and they link to other places in the document as designed: Am I misunderstanding something? checkbox 4: I'm not seeing this problem either. In the page source, the
I tried putting the HTML in Oxygen to validate it, but there are a lot of problems caused by the backticks mixed with Jekyll's automatic hyperlinking. So it's hard to identify problems with the HTML until that is fixed. checkbox 5: I would like to fix this, but I'm running out of time to work on this project now. So if it's "good enough" for now, I guess I would be OK with fixing it later. I'll create an issue, assign it to me and try to get to it before too much time goes by. |
Checkbox 3: no misunderstanding, links seem ok |
Just an FYI. dwc:DwCType is correct. It was a term parallel to dc:type, but for the basisOfRecord vocabulary. All history. |
This branch includes a number of changes that were discussed during and after recent DwC Maintenance Group meeting. It needs to be checked to see what effects the changes have on the Quick Reference Guide. The most obvious thing is that the labels were reverted from camelCase to English prose. This change will definitely have an effect on the QRG unless the script is changed to use the term local names. So it should not be merged until that particular issue is resolved.
The changes involving
dcterms:language
anddcterms:type
should also be looked at carefully. The intent is thatdc:language
should replacedcterms:language
in the record-level terms section,dc:type
should replacedcterms:type
in the record-level terms section,dcterms:language
should move into the UseWithIRI section, anddcterms:type
should disappear from the QRG (since the DwC RDF Guide says thatrdf:type
should be used instead of it in the case where one desires to indicate type by an IRI).If these changes are appropriate, I'll use it as the standard to check whether build script 3 will faithfully generate the file necessary to build the QRG. That can be done before a merge as long as I know that this is ultimately the form we want the "normative document" to take.
This pull request is related to the following issues:
dc:/dcterms:language
andtype
dwc:accordingTo
term history (deprecated)