Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 908 Bytes

README.md

File metadata and controls

19 lines (13 loc) · 908 Bytes

datascripts

Collection of scripts / notebooks to reliably select and curate datasets

Current sources

  • data: Some definition data used to query other datasets. These files are mostly static because they don't change often, but they are versioned just in case.
  • human-kinases: Aggregates different sources that list the complete human kinome and provides a curated CSV file with Uniprot IDs and their origin.
  • kinases-in-chembl: Maps human kinases UniProt IDs (from human-kinases) to ChEMBL target IDs.
  • kinase-bioactivities-in-chembl: Retrieve and curate activities (IC50, Ki, Kd) of small compounds against human kinases in ChEMBL (requires local SQLite copy).

Installation

Create a conda environment using the included environment file:

conda env create -f devtools/environment.yml
conda activate kinodata