Skip to content
This repository has been archived by the owner on Aug 23, 2022. It is now read-only.

Determine whether reads are from PBAT automatically #33

Open
andrewdavidsmith opened this issue Aug 3, 2018 · 3 comments
Open

Determine whether reads are from PBAT automatically #33

andrewdavidsmith opened this issue Aug 3, 2018 · 3 comments

Comments

@andrewdavidsmith
Copy link
Contributor

This should be easy, based on analyzing the nucleotide composition of a collection of reads at the start of the file (e.g. 100k reads).

@bdecato
Copy link
Member

bdecato commented Aug 3, 2018

I think you're right that we can detect it computationally, but is that true in all cases?

Bismark has an explicit --pbat option and fails silently if you try mapping PBAT reads without it. What if we implement the nucleotide analysis function and print a warning if the composition doesn't match up with the flag?

@mengzhou
Copy link
Member

mengzhou commented Aug 5, 2018

I have another question: do we still need the AG_WILDCARD option for mapping single-end? The reads are the same for single-end PBAT.

@mengzhou
Copy link
Member

GSE86903
This dataset has mixed WGBS and PBAT protocols. Specifically, BS-seq_H9_hEpiLC_d4_1 and BS-seq_H9_hEpiLC_d4_2 are WGBS, while others are PBAT. Should be a good one to test this.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants