This repository hosts the INCLUDE LinkML Model, a specialized data model crafted to empower collaboration and streamline data integration within Down syndrome research. By harnessing advanced LinkML features, it provides a versatile framework for harmonizing data representation, articulating metadata, and delineating intricate relationships between entities. This structured approach not only promotes seamless data exchange but also facilitates interoperability and scalability across diverse research initiatives.
To work with the INCLUDE Model, follow these steps:
-
Clone the Repository: Clone this repository to your local machine.
-
Install Poetry: If not already installed, install Poetry as a system dependency. Refer to Poetry Documentation for installation instructions.
-
Install Dependencies: Run the following command to install project dependencies using Poetry:
poetry install
After running poetry install
, you can proceed with further setup or usage instructions specific to your project.
The repository structure is organized as follows:
src/linkml
: Contains the YAML files defining the INCLUDE Model.
- Updated Assay component to include latest Virtual Biorepository model.
- Updated Study component and added new Dataset and DatasetManifest components to support new Study/Dataset pages in Data Hub.
- Inclusion of over 25 new slots.
- 5 new enumerations: EnumClinicalDataSourceType, EnumDataCategory, EnumGuidType, EnumParticipantLifespanStage, EnumResearchDomain.
- Validation: Streamlines data cleaning and validation via the command line (CLI), allowing users to specify the data type and file path. The CLI reads, cleans, and validates data using LinkML-defined models for robust validation. For more details, use:
validate-data --help