Skip to content

Commit

Permalink
Reorganized About, added installation in resources, changes in powerp…
Browse files Browse the repository at this point in the history
…oint
  • Loading branch information
apca committed Nov 11, 2024
1 parent 671375a commit 8c37655
Show file tree
Hide file tree
Showing 3 changed files with 22 additions and 14 deletions.
32 changes: 18 additions & 14 deletions about.md
Original file line number Diff line number Diff line change
@@ -1,19 +1,5 @@
# About

## [Data Science platform](https://www.biosustain.dtu.dk/technologies/informatics/data-science-platform)

Data Science has become an essential piece both in academia and industry to accelerate gaining insights into the generated data. As a strategy to integrate high-level analytics we created a centralized Data Science platform (DSP) that provides support to our researchers while promoting standardized data and data processes.

The DSP team aims to make data science more accessible and inclusive at DTU Biosustain and across the DTU Community. The platform follows a data-centric approach that focuses on data infrastructure, processes, and outputs as ongoing, evolving products rather than one-time projects. Each data product is designed as a multidisciplinary collaboration involving the entire data lifecycle and pursuing standardization and automation, and with data usage in mind.

The DSP is based on four pillars:
– Support

– Education

– Innovation

– Tooling

## The course
At the Data Science platform we are integrating with the research data management team and we are also building pipelines for our researchers at DTU - Biosustain. To that end we are building and deploying in Azure scalable, portable and reproducible bioinformatics workflows using Nextflow. Nextflow is a workflow orchestration tool designed for developing and executing reproducible and scalable data analysis pipelines. It is a scripting language to develop the workflows based on Groovy, allowing for easy integration with existing software and tools. Nextflow supports parallelization and can run on multiple computing environments, including local systems, high-performance clusters, and cloud platforms. Nextflow has a strong community in the bioinformatics field and is supported by the nf-core project, which provides a large repository of pre-built, community-maintained bioinformatics pipelines that are optimized for Nextflow. For these characteristics it has been our workflow building language of choice and would like to teach it to our fellows at DTU-Biosustain.
Expand Down Expand Up @@ -41,3 +27,21 @@ In this course you will learn:
- nf-core community (113 bioinformatics standardized workflows)

- Resources to keep training

## [Data Science platform](https://www.biosustain.dtu.dk/technologies/informatics/data-science-platform)

Data Science has become an essential piece both in academia and industry to accelerate gaining insights into the generated data. As a strategy to integrate high-level analytics we created a centralized Data Science platform (DSP) that provides support to our researchers while promoting standardized data and data processes.

The DSP team aims to make data science more accessible and inclusive at DTU Biosustain and across the DTU Community. The platform follows a data-centric approach that focuses on data infrastructure, processes, and outputs as ongoing, evolving products rather than one-time projects. Each data product is designed as a multidisciplinary collaboration involving the entire data lifecycle and pursuing standardization and automation, and with data usage in mind.

The DSP is based on four pillars:

– Support: our rsearch fellows on Statistics, Programming, Data analytics, and Machine learning

– Education: coorganizing Data club with DTU - Bioengineering and organizing Data Science workshops

– Innovation: introducing researchers to new computational biology methods and technologies

– Tooling: implementing open-sourced standard tools

You can contact us at [Data Science platform email](mailto:datascience@biosustain.dtu.dk).
Binary file modified course_contents/IntroNextflowFundamentals.pptx
Binary file not shown.
4 changes: 4 additions & 0 deletions resources.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,10 @@

Here we are sharing some resources to keep learning Nextflow language or nf-core pipelines, and keep updated of Nextflow events. We have gone through some of these resources on our own to lean or get inspiration to prepare these course contents. They are also good to once in a while refresh some of the knowledge or learn something new.

## Local installation

Please visit <https://training.nextflow.io/envsetup/02_local/>

## Training material

Nextflow training portal (different levels): <https://training.nextflow.io/>
Expand Down

0 comments on commit 8c37655

Please sign in to comment.