Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use NCBI reference sequence instead of pangolin #682

Open
EmilyHaag opened this issue Jul 25, 2023 · 1 comment
Open

Use NCBI reference sequence instead of pangolin #682

EmilyHaag opened this issue Jul 25, 2023 · 1 comment
Labels

Comments

@EmilyHaag
Copy link
Collaborator

User suggestion from help email dated 7/21/23:

User suggests switching from pangolin reference data to NCBI sequence, as other resources use the NCBI sequence (BV-BRC, Covariant.org, Nextclade, and ViralZone).

"Between the pangolin and NCBI sequences, there are C8782T and T28144C. The former variant is silent, but the latter encodes ORF8:L84S, which means that the resources cited above and outbreak.info do not show the same data for ORF8 variants. Ultimately, outbreak.info cites ORF8:S84L for most variants, but other resources do not (except for some in ViralZone, which I took from outbreak.info without noticing the discrepancy; these will be corrected soon).
Would it be possible for you to swap the reference sequence so that all resources show the same variant positions for ORF8? This would eliminate the misleading ORF8:S84L shown in most of your variants and harmonize the data."

@EmilyHaag EmilyHaag added the data label Jul 25, 2023
@emmahodcroft
Copy link

Thanks for creating the issue!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants