rna-data-products

A Python and CWL pipeline for concatenating HuBMAP RNA-seq [Salmon] data into data products per organ and one large RNA-seq [Salmon] data product.

Pipeline steps

Create a UUIDs TSV file with all UUIDs and HuBMAP IDs of public processed data wanted for the run.
With the UUIDs TSV, create a data directory of all H5ADs needed for the run.
Make an AWS access key id and a secret access key to upload the files to S3 bucket.
Annotate and concatenate a raw data product and a processed data product.
Upload the UMAP and data product metadata to VM

Requirements

Check the list of python packages in docker/requirements.txt

How to run

Step 1

python3 make_uuids_tsv.py [tissue_type]

Step 2

python3 make_directory.py /hive/hubmap/data/ [uuids_file] [tissue_type]

Step 3

cwltool pipeline.cwl --[data_directory] --[uuids_file] --[tissue_type] --[access_key_id] --[secret_access_key]

Step 4

python3 upload_to_ec2.py [umap_png] [data_product_metadata] [ssh_key]

Name		Name	Last commit message	Last commit date
Latest commit History 274 Commits
bin		bin
data		data
docker		docker
steps		steps
LICENSE		LICENSE
README.md		README.md
docker_images.txt		docker_images.txt
install_R_packages.R		install_R_packages.R
make_directory.py		make_directory.py
make_uuids_tsv.py		make_uuids_tsv.py
pipeline.cwl		pipeline.cwl
upload_to_ec2.py		upload_to_ec2.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rna-data-products

Pipeline steps

Requirements

How to run

Step 1

Step 2

Step 3

Step 4

About

Releases

Packages

Contributors 3

Languages

License

hubmapconsortium/rna-data-products

Folders and files

Latest commit

History

Repository files navigation

rna-data-products

Pipeline steps

Requirements

How to run

Step 1

Step 2

Step 3

Step 4

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages