Skip to content

Commit

Permalink
added FAIR data description into docs (#376)
Browse files Browse the repository at this point in the history
  • Loading branch information
davidsebfischer authored Sep 28, 2021
1 parent 6d0eb83 commit 533bfbc
Showing 1 changed file with 17 additions and 0 deletions.
17 changes: 17 additions & 0 deletions docs/data_life_cycle.rst
Original file line number Diff line number Diff line change
Expand Up @@ -31,3 +31,20 @@ III) Stores
Using the streamlined data set collections from (II), we built a computationally efficient data interface for machine learning on such large distributed data set collection, thus improving **Usage** (step 5):
Specifically, this interface is optimised for out-of-core observation-centric indexing in scenarios that are typical to machine learning on single-cell data.
Read more in our guide to data stores :ref:`distributed_data_rst`.

FAIR data
=========

FAIR_ data is a set of data management guidelines that are designed to improve data reuse and automated access
(see also the original publication of FAIR_ for more details).
The key data management topics addressed by FAIR_ are findability, accessibility, interoperability and reusability.
Single-cell data sets are usually public and also adhere to varying degrees to FAIR_ principles.
We designed sfaira so that it improves FAIR_ attributes of published data sets beyond their state at publication.
Specifically, sfaira:

- improves **findability** of data sets by serving data sets through complex meta data query.
- improves **accessibility** of data sets by serving streamlined data sets.
- improves **interoperability** of data sets by streamlining data using versioned meta data ontologies.
- improves **reusability** of data sets by allowing for iterative improvements of meta data annotation and by shipping usage critical meta data.

.. _FAIR: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792175/

0 comments on commit 533bfbc

Please sign in to comment.