Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docs/fair #376

Merged
merged 1 commit into from
Sep 28, 2021
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
17 changes: 17 additions & 0 deletions docs/data_life_cycle.rst
Original file line number Diff line number Diff line change
Expand Up @@ -31,3 +31,20 @@ III) Stores
Using the streamlined data set collections from (II), we built a computationally efficient data interface for machine learning on such large distributed data set collection, thus improving **Usage** (step 5):
Specifically, this interface is optimised for out-of-core observation-centric indexing in scenarios that are typical to machine learning on single-cell data.
Read more in our guide to data stores :ref:`distributed_data_rst`.

FAIR data
=========

FAIR_ data is a set of data management guidelines that are designed to improve data reuse and automated access
(see also the original publication of FAIR_ for more details).
The key data management topics addressed by FAIR_ are findability, accessibility, interoperability and reusability.
Single-cell data sets are usually public and also adhere to varying degrees to FAIR_ principles.
We designed sfaira so that it improves FAIR_ attributes of published data sets beyond their state at publication.
Specifically, sfaira:

- improves **findability** of data sets by serving data sets through complex meta data query.
- improves **accessibility** of data sets by serving streamlined data sets.
- improves **interoperability** of data sets by streamlining data using versioned meta data ontologies.
- improves **reusability** of data sets by allowing for iterative improvements of meta data annotation and by shipping usage critical meta data.

.. _FAIR: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792175/