Filter mouse reads from PDX samples #407

ssamberkar · 2020-04-11T12:33:07Z

Hi,

I have some mouse PDX samples for which I need to run a standard DGE workflow later. While the standard nf-core rnaseq is more than enough for the purpose, for my use-case it is missing a mouse reads filtering step.
@drpatelh - suggested a Kraken based solution.

I was wondering if someone can suggest a process with BBmap (https://bioinformaticsonline.com/pages/view/35033/bbsplit-read-binning-tool-for-metagenomes-and-contaminated-libraries)

Thanks.

apeltzer · 2020-04-11T15:16:50Z

Out of personal experience, I'd also rather jump on the kraken2 train.

Both because it's more widely supported, used by more people and therefore subsequently also more tested. I also doubt that there will be a problem getting always updated kraken2 DBs / bioconda scripts, whereas all of the bb* tools are from a single developer who might just stop development at some point.

drpatelh · 2021-09-29T08:32:07Z

FIxed in #700

drpatelh · 2021-09-29T08:35:41Z

Docs will be here after v3.4 of the pipeline has been released.

So to summarise you provide the pipeline with --bbsplit_fasta_list of the format:

mm10,/path/to/mm10.fa
ecoli,/path/to/ecoli.fa
sarscov2,/path/to/sarscov2.fa

This means you can use custom names to name your contaminant genomes and the reads relative to the main reference will always be called *primary*fastq.gz.

The BBSplit index will have to be built at least once with this pipeline and then can be re-used for future runs so it doesn't have to be re-built over and over again.

apeltzer added the enhancement label Apr 11, 2020

drpatelh added this to the 3.4 milestone Sep 22, 2021

drpatelh mentioned this issue Sep 24, 2021

Add BBSplit for the removal of genomic contaminants #700

Merged

drpatelh closed this as completed Sep 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter mouse reads from PDX samples #407

Filter mouse reads from PDX samples #407

ssamberkar commented Apr 11, 2020

apeltzer commented Apr 11, 2020

drpatelh commented Sep 29, 2021

drpatelh commented Sep 29, 2021

Filter mouse reads from PDX samples #407

Filter mouse reads from PDX samples #407

Comments

ssamberkar commented Apr 11, 2020

apeltzer commented Apr 11, 2020

drpatelh commented Sep 29, 2021

drpatelh commented Sep 29, 2021