-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is the pipeline idempotent? #201
Comments
Also |
Also missing random seed in |
The pipeline's Wham-related irreproducible output is related to using a random number generator with a floating seed in the Wham source code. The steps we took to fix this issue are:
|
There is a small discrepancy between multiple executions of
GATKSVPipelineSingleSample
on the same input with the same parameters. The discrepancy is observed in the following files:It seems the discrepancy originates from the
Whamg workflow
. The workflow useswhamg
(whamg source code
), which seems to have some randomness involved (zeeev/wham#51).I ran the
GATKSVPipelineSingleSample
workflow multiple times without call-caching (i.e., setting"read_from_cache" : false
and"write_to_cache" : false
in the options files), and ransvtest vcf
on the final VCFs of the separate executions comparing all to a common baseline. If the pipeline was idempotent, I would expect the same difference/similarity to the baseline in all the executions, however, the executions slightly differ from each other compared to the baseline. Some of the output metrics ofsvtest vcf
comparing a test run with a given UUID vs. the baseline are as follows.The text was updated successfully, but these errors were encountered: