Skip to content

Latest commit

 

History

History
32 lines (18 loc) · 3.58 KB

README.md

File metadata and controls

32 lines (18 loc) · 3.58 KB

Welcome to the Vocabularies Group

(Oficially the Best practices for development of vocabularies of values Task Group)

Our goals, in short:

  1. Preparation of a Scoping Document.

  2. Development of a common repository for TDWG vocabularies-of-values.

  3. Development of a current best practice for building of TDWG vocabularies.

  4. Building of at least one exemplary vocabulary.

  5. Collection and assessment of already existing vocabularies across the community. . . . 🔧 In progress 🐛

  6. Identification of domain-specific groups that may be involved in the preparation of vocabularies.

  7. In-depth evaluation of the current state of data shared through aggregators in relation to the use of controlled values. . . . 🔧 In progress 🐛

  8. Preparation of a list of vocabularies needed for terms of the Darwin Core standard.

For more details, you can refer to the Workplan.

How you can contribute

🔸 Add to the list of existing vocabularies any vocabulary that you may know exists.

🔸 If you want to but are not yet a member of the Task Group, send us an email, we will welcome you in the discussions.

Motivation

Biodiversity data are increasingly being shared from myriad sources. While the Darwin Core standard defines a set of terms under which data are organized and shared, it does not refer to the actual values used to describe the content of each field. More often than not, distinct sources utilize diverse criteria to populate the fields. While this has allowed data publication broadly, when it comes to data usage, it becomes apparent that such heterogeneity hinders discoverability and use of the data. One way to reduce this variability and improve data use is to provide standardized vocabularies for the community to use. Vocabularies exist for some terms, but are constrained to specific groups or disciplines. Furthermore, there is no standard format defined for the creation of biodiversity data vocabularies for the values captured under Darwin Core terms (hereafter vocabularies-of-values), and no recommended environment in which to do so. This TG aims to create such a framework under the TDWG umbrella, with the ultimate objective of building a corpus of vocabularies of standard values for terms. We identify four target audiences who could potentially benefit from the outcomes of this TG: 1) Data producers (i.e., data collectors) who could capture data using the controlled vocabularies through pick lists and could impart valuable information more efficiently; 2) Data custodians (e.g., staff in museum collections) who could manage, provide and use data more efficiently; 3) Data aggregators who could use the vocabularies to provide infrastructure for data filtering; 4) Data users for whom more effective filtering would represent improvement in data fitness for use. Currently, other initiatives concerning biodiversity data vocabularies-of-values are scattered, and no other TDWG task group is addressing the issue of the structure of such standards. As this TG is directly related with data quality and use, the Data Quality Interest Group seems the appropriate environment for this work.