-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spike: Investigate how Dataverse stakeholders and users need to collect and use funder metadata #4859
Comments
While updating the crosswalk I saw that when you use an Atom entry (XML) to create a dataset, funder metadata is mapped to the Contributor field, with Contributor Type being set to "Funder". So if we removed Contributor Type "Funder," the mapping done when an Atom entry is used to create a dataset would need to change so that funding info is mapped to the "Funding Information" field. |
@pdurbin @jggautier There seems to be a question of where this should go. This issue is in the deliverable backlog but under 1.5.1. I'm not an expert here. Is it better addressed here or under NIH OTA 1.2.1 ? |
I think this should be worked on as part of any effort to improve how Dataverse collects and exports funding metadata about datasets, which I think is a goal of NIH OTA 1.2.1 so I think it should be addressed in 1.2.1. |
No strong opinion. It could be worked on under either. |
Thanks @pdurbin. I updated the original comment. |
I'm talking with depositors in the Harvard repository who've used both fields in the same dataset most often, in order to learn why. I spoke with a manager from the WorldFish repository. They've used both fields only because when they create datasets and include funder names in the metadata, they often use a different platform instead of entering metadata in the Dataverse deposit form. And in the platform's deposit page, the metadata field for funder names, called "Donor", is mapped to Dataverse's "Contributor" field and given the "Funder" contributor type. We spoke about how they should update their platform so that they use the Funder Information field instead, and we'll schedule another meeting, hopefully one including developers of their platform, to review the changes being made to the Funder Information field (#9150). They said it's fine if we move the funder names in their datasets' Contributor field to the Funder Information field. This week I'll be meeting with the manager of another collection that most often adds funding metadata to their datasets to learn why they've used both fields. |
I reviewed the metadata I collected from most known Dataverse installations in October 2022 (https://doi.org/10.7910/DVN/DCDKZQ, version 12) to learn which datasets have values in both fields. While Harvard Dataverse had the most of these kinds of datasets at the time (250), other Dataverse installations also have datasets with funder names in both fields. Here's a CSV file listing the datasets, which installation they're published in, and the funder names entered in both fields: I've emailed the Dataverse Google Group to try to learn, from as many Dataverse installations as possible, why both fields were used and what we should consider when moving the funder names in the Contributor fields to the Funding Information fields. See this GitHub issue's original post, which I've been updating with a more detailed proposal. |
All datasets in UVa Dataverse have "Funder" information in the grantNumber block. Examples: BUT have made this block "displayoncreate" = TRUE AND in hopes to make what goes in that field clearer (to US), I have changed the title and the description
|
I just realized that this was removed from a Dataverse_Funded_Deliverables list last month, but I'm not sure what that means. @mreekie could you write about what that means? I'm wondering if this will be addressed as part of efforts, such as #9150, to improve the quality of funding metadata in Dataverse repositories. I think it should; trying not to let it fall through the cracks. |
I will follow up with @mreekie to ascertain where this issue should be moved (e.g., back in to Global Backlog related to NIH deliverable) |
A couple other brand new funding-related issues, spawed from #9150 (just closed) I believe: |
sizing:
|
|
Just an update about the meeting @cmbz mentioned. This Friday @cmbz, @scolapasta and I will be talking about this while planning for how Dataverse should follow a set of metadata recommendations from NIH GREI. |
2023/10/16
|
Just an update:
|
More updates:
|
More updates: While the CGIAR and Borealis folks are discussing, I emailed Steve McEachern to share what I learned from a review of funder metadata in ADA's repository and wrote that I'd share in this GitHub issue. To reiterate and expand on the great points Steve made in June:
When looking at ADA's and other installation's funder metadata, I also noticed and at least want to acknowledge these things:
|
This GitHub issue is in Dataverse SODHA's "Santa's watching" list, so I emailed those folks to learn about their interests in this issue. Our regular contacts for this installation, Benjamin Peuch and Youssef Ouahalou, are no longer working on this installation, so I emailed the installation's general email address as Youssef suggested. As with Borealis and CGIAR, I'm waiting to hear back from them and I'll follow up on the first week of January after the winter break. |
Just an update on progress so far. Most of the discussion happening in the GitHub issue at #10196 involves user goals we'll need to consider when we're thinking about a redesign of how Dataverse collects and distributes funding metadata. I'm also trying to find time to chat with @stevenmce about what he wrote about ADA's needs and goals. |
2024/07/10
|
Yes definitely! I'm going to close this issue, since the discovery research that the community was doing last winter effectively ended in April when the UX WG started planning and executing the design sprint (being tracked in IQSS/dataverse-pm#127 and GitHub issues listed in that issue). We've been using what we've learned in this spike GitHub issue and in #10196 as we plan for how to evaluate the success of a redesign of the Citation metadata block and use of the external controlled vocabulary functionality and as we consider different design ideas for addressing the goals driving the work recorded in this GitHub issue - improving the experience of adding funding metadata and of making it easier to find datasets by funders, in part by resolving the issues caused by having two places on the dataset deposit form where people can record who funded the deposit and keeping in mind what we learned from @stevenmce about the value of thinking of a funder as a kind of contributor. More broadly, I hope that the design sprint idea we're working on can help us more effectively research, like what was done for this spike issue, by timeboxing research and setting other expectations for how much resources we'll need from stakeholders who are vital to our shared understanding of goals. |
In Dataverse 4.x Citation metadatablock, in the Contributor metadata field, there's a "Funder" contributor type:
The "Funder" type comes from DataCite's list of contributor types, added in their 3.x schema. I think we should remove the contributor type "Funder" because:
It's a duplicate of Dataverse's "Funding Information Agency" field....
...and it's probably confusing depositors.
This complicates metadata exporting and makes it harder to find data based on who funded the research. For example, if we send funding metadata to DataCite, it won't except metadata that includes "Funder" as a Contributor Type. Newer versions of the DataCite standard don't include a "Funder" contributor type. (It was deprecated when a FundingReference property was added, so that more information about funding could be included in subproperties of FundingReference.)
Definition of done:
The text was updated successfully, but these errors were encountered: