Skip to content
This repository has been archived by the owner on Jan 27, 2020. It is now read-only.

Update of Ascat and INSTALL_BIANCA.md #628

Merged
merged 45 commits into from
Feb 1, 2019
Merged

Conversation

malinlarsson
Copy link
Collaborator

Update of the Ascat process:
In bin/run_ascat.r, in the call to ascat.runAscat() the parameter gamma is set to 1. Gamma was previously not set, which means that the default value 0.55 was used. Gamma =1 is recommended when running Ascat on NGS data (please see https://www.crick.ac.uk/research/labs/peter-van-loo/software). The documentation of Ascat is updated accordingly.

The documentation of INSTALL_BIANCA.md was updated with some information that I learned during my own installation.

PR checklist

  • This comment contains a description of changes (with reason)
  • If you've fixed a bug or added code that should be tested, add tests!
  • Ensure the test suite passes (./scripts/test.sh -p docker -t ALL).
  • Documentation in docs is updated
  • CHANGELOG.md is updated
  • README.md is updated

Learn more about contributing: https://github.com/SciLifeLab/Sarek/blob/master/.github/CONTRIBUTING.md

Malin Larsson and others added 25 commits June 1, 2017 14:37
This is the first merge after renaming the upstream branch to Sarek.
Lots of changes compared with my previous version.
…better setting when LogR and BAF measurements are based on NGS data
Incorporates recent changes in the documentation of the master project.
Added a specific example of how to run Ascat on GRCh37 on Bianca
Added info about the --containerPaths flag
…le xxxx.cnv.txt that contain the predicted CNV regions. The format of xxxx.cnv.txt is the same, but the change is that all segments are now written out, not only the ones with copy number != 2. This will make it easier to compare the figure xxxx.ASCATprofile with the textfile xxxx.cnv.txt. Now all regions in the figure are included in the text file. I have also updated the documentation to reflect these changes.
small format update
@maxulysse
Copy link
Member

Thanks a lot for the PR.

Have you consider updating to a more recent ASCAT version?
If which cases would we want to change this gamma parameter?

@malinlarsson
Copy link
Collaborator Author

malinlarsson commented Aug 30, 2018 via email

@maxulysse
Copy link
Member

Thanks a lot, that would be marvelous.

Did you manage to make some test data so I can set up some test for ASCAT in Sarek?

@malinlarsson
Copy link
Collaborator Author

malinlarsson commented Aug 30, 2018 via email

@malinlarsson
Copy link
Collaborator Author

Another update included in this PR (forgot to mention it above):
the output file from Ascat containing all the CNV regions (sampled.cnvs.txt) has been updated. It now includes all segments across the genome, also those with normal copy number (i.e. where nMajor = 1 and nMinor =1). In the previous version the segments with normal copy number were excluded, but I think that is confusing since the native output from Ascat includes all segments.

@malinlarsson
Copy link
Collaborator Author

malinlarsson commented Aug 30, 2018 via email

@maxulysse
Copy link
Member

I can check what is there. How long time can the test-runs take?

Typically less than 40 min for a test.
So in our case, it would be from bams to ascat results, as we can't cut it in more pieces.

But I didn't write out specifically that all segments are included since this is the "default output format" from Ascat.

But since this is now the default output format, it could be good to update the docs accordingly and specify that as well, no?

@malinlarsson
Copy link
Collaborator Author

malinlarsson commented Aug 30, 2018 via email

@maxulysse
Copy link
Member

maxulysse commented Aug 30, 2018

OK. I have to check if that is possible. Perhaps if we generate a really “shallow” bam file for the full genome

You don't have to do that with this PR, don't worry too much about it.

Yes, I can ad a note on this.

That would be perfect.

More detailed description of the tumor.cnvs.txt file to explain that all segments are included in the file, also those with normal copy number.
@malinlarsson
Copy link
Collaborator Author

malinlarsson commented Aug 30, 2018 via email

@maxulysse maxulysse changed the base branch from master to dev January 17, 2019 10:06
@maxulysse
Copy link
Member

@malinlarsson
I changed the base for the PR from master to dev, as we're now doing that for Sarek.
I also fixed some merging issues.
If you're OK with all the changes, I'll merge it as soon as I have updated iGenomes as well.
Thanks a lot for the PR

@malinlarsson
Copy link
Collaborator Author

This PR now includes GC correction for Ascat. The process has been documented in docs/ASCAT.md, including how the GC correction files were generated.

Still to do: Update to Ascat 2.5.2, but can we do that in a separate PR?

@maxulysse maxulysse mentioned this pull request Jan 31, 2019
4 tasks
@maxulysse maxulysse merged commit e94cd48 into SciLifeLab:dev Feb 1, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants