Skip to content

A curated list for preprocessing, cleaning, mapping and analyzing UK Biobank data.

Notifications You must be signed in to change notification settings

spiros/ukbiobank-resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

10 Commits
ย 
ย 

Repository files navigation

UK Biobank Resources

A curated list for preprocessing, cleaning, mapping and analyzing UK Biobank data.


Data schema

  • Tofu - Tofu is a Python tool for generating synthetic UK Biobank data.
  • ukbschemas - Use R to generate a database containing the UK Biobank data schemas.
  • Biobank Read - Python-based tools for the extraction, cleaning and pre-processing.
  • FUNPACK - Python library for pre-processing of UK BioBank data.
  • phemap - Python functions to map between ICD-10 terms and PheCodes.
  • ukb_download_and_prep_template - Template for common processing operations.
  • ukbiobank-loaders - a collection of Python tools to load data into Apache Parquet files developed by BenevolentAI.

Analytical tools


Information resources

Phenotypes

  • Primary care EHR biomarkers -- machine-readable versions (CSV files) of electronic health record phenotyping algorithms for 31 commonly-measured biomakers.

Ops

  • Docker utils - Docker image for UK Biobank utilities: ukbunpack, ukbfetch, ukblink, ukbgene, ukbmd5, ukbconv

Other resources

  • WikiMedMap mapping of phenotype strings from questionnaires in the UK Biobank and from Mendelian diseases in Online Mendelian Inheritance in Man (OMIM) database to eight vocabularies: International Classification of Diseases, Ninth Revision (ICD-9), ICD-10, ICD-O, Medical Subject Headings (MeSH), OMIM, Disease Database, and MedlinePlus.

  • ukbb-srmed mapping of self-reported medication entries to the Anatomical Therapeutic Chemical (ATC) classification system and in the British National Formulary (BNF) coding systems.


Contributing

Your contributions are always welcome!

Please submit a pull request or create an issue to add a new package, library or software to the list.

License

CC0

To the extent possible under law, Spiros Denaxas has waived all copyright and related or neighboring rights to this work.

About

A curated list for preprocessing, cleaning, mapping and analyzing UK Biobank data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published