Constructing pan-genome #1252
Replies: 6 comments 1 reply
-
Minigraph doesn't work well at high divergences. Near the beginning of the log, you should be able to see the mash distances of all your genomes to the reference, and it will even give you a warning if any seem too high. Are you able to share this part of your log? |
Beta Was this translation helpful? Give feedback.
-
Yes, some wheat genomes have higher mash distances from the reference. In this case, can we adjust the parameters to add these genomes to the pan-genome? |
Beta Was this translation helpful? Give feedback.
-
Yeah, there's supposed to be a warning for distances > 0.02 -- strange that it's not in your log. But anyway, You'd have to cut down your inputs to only genomes <0.02 from the reference, or you can make a tree (ex with
|
Beta Was this translation helpful? Give feedback.
-
Thanks for your reply and suggestions! |
Beta Was this translation helpful? Give feedback.
-
Sorry to bother you again, but I have another question about pan-genome construction. I hope you can help me figure it out: Does minigraph-cactus support gradual increase? For instance, by initially building a pan-genome 'Pn' using n sequences, and subsequently adding a new sequence labeled 'x' to extend the pan-genome from 'Pn' to 'Pn+1', instead of starting the construction of the pan-genome from scratch with these n+1 sequences. |
Beta Was this translation helpful? Give feedback.
-
No. You can add genomes in minigraph but not minigraph-cactus. |
Beta Was this translation helpful? Give feedback.
-
Dear Developer.
I am currently using Minigraph-Cactus to perform a wheat pan-genome build on 24 wheat genomes, sequences from specific regions on the same chromosome. However, in the result file (gfa), I find that only part of the wheat genome is included on path, not all of it.
I suspect this may be due to the high level of divergence between the different samples. Although the value of the
minIdentity
parameter has been set to0.5
in the cactus_progressive_config.xml configuration file, it did not achieve the results I was expecting. Therefore, I would like to ask for advice on how to modify the parameter in the configuration file to better handle the issue of divergence between samples and ensure that all chromosomes of the wheat genome are correctly included in the pan-genome, thus enabling a complete pan-genome construction for all samples.Thank you for your time and support.
Best regards.
Beta Was this translation helpful? Give feedback.
All reactions