Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter mouse reads from PDX samples #407

Closed
ssamberkar opened this issue Apr 11, 2020 · 3 comments
Closed

Filter mouse reads from PDX samples #407

ssamberkar opened this issue Apr 11, 2020 · 3 comments
Milestone

Comments

@ssamberkar
Copy link

Hi,

I have some mouse PDX samples for which I need to run a standard DGE workflow later. While the standard nf-core rnaseq is more than enough for the purpose, for my use-case it is missing a mouse reads filtering step.
@drpatelh - suggested a Kraken based solution.

I was wondering if someone can suggest a process with BBmap (https://bioinformaticsonline.com/pages/view/35033/bbsplit-read-binning-tool-for-metagenomes-and-contaminated-libraries)

Thanks.

@apeltzer
Copy link
Member

Out of personal experience, I'd also rather jump on the kraken2 train.

Both because it's more widely supported, used by more people and therefore subsequently also more tested. I also doubt that there will be a problem getting always updated kraken2 DBs / bioconda scripts, whereas all of the bb* tools are from a single developer who might just stop development at some point.

@drpatelh
Copy link
Member

FIxed in #700

@drpatelh
Copy link
Member

Docs will be here after v3.4 of the pipeline has been released.

So to summarise you provide the pipeline with --bbsplit_fasta_list of the format:

mm10,/path/to/mm10.fa
ecoli,/path/to/ecoli.fa
sarscov2,/path/to/sarscov2.fa

This means you can use custom names to name your contaminant genomes and the reads relative to the main reference will always be called *primary*fastq.gz.

The BBSplit index will have to be built at least once with this pipeline and then can be re-used for future runs so it doesn't have to be re-built over and over again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants