-
Notifications
You must be signed in to change notification settings - Fork 172
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Interpreting result files and selecting files for visualization and DAR analysis #413
Comments
Not one of the developers but I can try helping out. First of all, you have to run After running I am not sure if there is any kind of 'best practice' validation step post-pipeline run, but I personally just trust the Cromwell std err output, so if I see the line By the way, you can find the visualization of the workflow in an svg file generated by As for which IDR-processed output files (narrowPeaks) to use for your downstream analysis, that's something you might need to decide yourself depending on how many biological replicates you did, the nature of the experiment/study, etc. Might be worth discussing with your supervisor. You can also glean what 'overlap', Hope this helps at least. |
Thank you for the detailed explanation. It's always a pleasure read answers like this one =) |
Sorry to reopen this issue, but I was still wondering which file I should use for DAR analysis. Should I use the narrowPeak file in the peak/ folder? Or should I use Overlap Peaks / IDR Peaks? Or are they just to check the reproducibility of the peaks? @sufyazi Thank you! |
I think the narrowPeak files in the peak folder should be fine to use, though I personally just merge the overlap and IDR peak files and use these instead in my analysis. They are just more stringent peaks. But it might depend on what you want to do/see in your analysis so your mileage may vary and you might be fine just using the less stringent, background-filtered (bfilt) peak set file. |
Dear developers,
Firstly, I would like to express my appreciation for the detailed ATAC-seq pipeline that you developed. As a PhD student, I find this tool extremely helpful in analyzing ATAC-seq data. However, as someone who is still not 100% familiarized with all the terms and concepts in this field, I have a few questions that I would appreciate your help with.
Firstly, I am struggling to interpret some of the result files. Is there a tutorial or documentation that can explain each result file in more detail?
In addition, I am also unsure about which files are the final ones for visualization purposes, such as bigWigs or bam files, and which ones are ideal for posterior analysis using csaw. I would greatly appreciate it if you could provide some guidance on this matter.
Lastly, I recently ran the example subset data test ENCSR356KRQ_subsampled to check if the pipeline is running without any issues. Could you provide an updated html as a positive control to validate that everything worked fine? Or is there an output message that confirms that everything went okay? This is my QC file.
qc.html.txt
Best,
Rafael
The text was updated successfully, but these errors were encountered: