This repo contains work in progress on creating Mannheim Data Bibliography (or datagraphy), i.e., a registry of metadata of all data, created or collected by the employees of the University of Mannheim.
- FAIRness (Findability, Accessibility, Interoperability, and Reusability) of data at the University of Mannheim,
- a single point of access to metadata of data, created or collected by the employees of the University of Mannheim (MADATA),
- performance evaluation using metrics for data sharing, data reuse and data citation in the University of Mannheim,
- improving culture of data sharing, data reuse and data citation in the University of Mannheim.
Mannheim Data Bibliography MADATA contains only a part of all (meta)data published by the employees of the University of Mannheim. All other data, created or collected by employees of the University of Mannheim, are stored somewhere else.
We want to:
- collect metadata of
data
, created or collected by employees of University of Mannheim, - update the data bibliography regularly,
- store the collected metadata in MADATA,
- evaluate metrics for data sharing, data reuse and data citation,
- create data dashboard.
What can data
mean?
- dataset,
- software (code, script, package),
- executable notebook (Jupyter notebook),
- data management plan,
- software management plan,
- workflow,
- model,
- figure,
- table,
- image,
- video,
- text,
- interview,
- project (e.g., https://doi.org/10.3886/E124902V2),
- reproducibility (or replication) package.
Controlled vocabulary for resource types of da|ra
- Audiovisual
- Collection
- DataPaper
- Dataset
- Event
- Image
- InteractiveResource
- Model
- PhysicalObject
- Service
- Software
- Sound
- Text
- Workflow
- Other
The resource types for DataCite DOIs:
- Audiovisual
- Book
- BookChapter
- Collection
- ComputationalNotebook
- ConferencePaper
- ConferenceProceeding
- DataPaper
- Dataset
- Dissertation
- Event
- Image
- InteractiveResource
- Journal
- JournalArticle
- Model
- OutputManagementPlan
- PeerReview
- PhysicalObject
- Preprint
- Report
- Service
- Software
- Sound
- Standard
- Text
- Workflow
- Other
Scope of work and plan:
- Extract dataset mentions, "Data availability statements" and "Supplemented Materials" from publications at MADOC, i.e., Mannheim University bibliography and publication server:
- From full texts in PDF-files at MADOC
- From external online versions of publications (using DOIs from MADOC)
- Add metadata for data resources (e.g., databases, digital editions, etc.) hosted at uni-mannheim.de to MADATA
Plan:
- Harvesting metadata of data from data repositories and metadata portals
- Searching for "University of Mannheim" or "Mannheim University" or "Universität Mannheim" ⏳
- Searching for names of employees of University of Mannheim ⏳
Repositories and portals:
-
External data repositories with links to search queries "University of Mannheim":
- Zenodo
- GESIS Search
- Dataverse
- Figshare
- GitHub and GitLab
- CESSDA Data Catalogue
- repositories described in subject-specific publication policies
-
Metadata portals with links to search queries "University of Mannheim":
See folder ./metadata/
.