Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Export column - c_hgvs_compat #877

Open
davmlaw opened this issue Aug 28, 2023 · 12 comments
Open

Export column - c_hgvs_compat #877

davmlaw opened this issue Aug 28, 2023 · 12 comments

Comments

@davmlaw
Copy link
Contributor

davmlaw commented Aug 28, 2023

https://app.testiny.io/VG/testcases/tcf/81/tc/173/edit

  • Change c_hgvs_full to chgvs_compat
  • Change MVLs to use that
  • Convert inv to delins
  • Also convert LRG to NM_ and Ensembl to RefSeq? (or link/raise another issue for that)
@davmlaw
Copy link
Contributor Author

davmlaw commented Sep 13, 2023

We should be able to turn it from INV to delisn pretty easily in Biocommons code:

Out[81]: VariantCoordinate(chrom='1', start=1637084, end=1637084, ref='TCCCAG', alt='CTGGGA')
In [86]: hgvs_variant = matcher.variant_coordinate_to_hgvs_variant(vc)
In [87]: hgvs_variant
Out[87]: NC_000001.10:g.1637084_1637089inv

In [88]: hgvs_variant._sequence_variant
Out[88]: SequenceVariant(ac=NC_000001.10, type=g, posedit=1637084_1637089inv, gene=None)

In [89]: sv = hgvs_variant._sequence_variant

In [90]: sv.posedit.edit.ref
Out[90]: 'TCCCAG'

In [91]: ref = sv.posedit.edit.ref

In [92]: sv.posedit.edit = NARefAlt(ref=ref, alt=reverse_complement(ref))

In [93]: sv
Out[93]: SequenceVariant(ac=NC_000001.10, type=g, posedit=1637084_1637089delinsCTGGGA, gene=None)

In [94]: str(sv)
Out[94]: 'NC_000001.10:g.1637084_1637089delinsCTGGGA'

@EmmaTudini
Copy link
Contributor

@davmlaw Has this been done? Is it ready for testing? We'll likely need this before the prod deploy

@EmmaTudini EmmaTudini added the HGVS label Oct 5, 2023
@davmlaw
Copy link
Contributor Author

davmlaw commented Oct 5, 2023

No this is still in progress

@davmlaw davmlaw changed the title Export column - chgvs_compat Export column - c_hgvs_compat Oct 27, 2023
@davmlaw
Copy link
Contributor Author

davmlaw commented Oct 27, 2023

Request for Ensembl / LRG is done in https://github.com/SACGF/variantgrid_private/issues/2541

@davmlaw davmlaw self-assigned this May 17, 2024
@davmlaw
Copy link
Contributor Author

davmlaw commented May 20, 2024

Integration is export only, so just need to get MVL to pull this column rather than the old one

davmlaw added a commit that referenced this issue May 20, 2024
@davmlaw
Copy link
Contributor Author

davmlaw commented May 20, 2024

Needs Shariant test

Tesimage.png
t by creating classification against eg NM_000127.3(EXT1):c.1315_1320inv (can just use random coords to make an inv)

To to classification -> export -> Select MVL - ensure that column comes out without an inv (eg c.1315_1320delTCACGTinsACGTGA)

@TheMadBug
Copy link
Member

To confirm, I believe this is only relevant when we switch to biocommons and then refresh all the imported allele infos, as pyhgvs does not resolve things to inv.

As we are looking to switch to biocommons after the next deploy (all Shariant environments still have HGVS_DEFAULT_METHOD="pyhvs")

@davmlaw can you confirm that this is only relevant for biocommons - I found no inv's in the MVL currently.

@davmlaw
Copy link
Contributor Author

davmlaw commented May 21, 2024

Yes, only biocommons has "inv"

@davmlaw
Copy link
Contributor Author

davmlaw commented Jun 5, 2024

Tested vg test (make sure to classify against a RefSeq transcript so it appears in MVL)

MVL c_nomen came out as c.1315_1320delTCACGTinsACGTGA

TheMadBug added a commit that referenced this issue Jun 18, 2024
* Panal App VUS/GUS relationship download

* Panal App VUS/GUS relationship download

* Providing methods to find more useful relationships between term and gene symbol for report

* issue #2647 - liftover refactor

* issue #2647 - start of bcftools liftover

* Issue #2647 - BCFTools liftover

* Issue #2647 - Remove vestigial NCBI remap traces

* Issue #2647 - enable in vg test

* Stop warning that doesn't apply here

* Issue #2647 - be able to create variant for long ones...

* This occasionally failed - execution continued after asking to change window

* issue #1054 - make bcftools liftover part of standard ANNOTATION settings

* issue #1054 - misc liftover issues

* Issue #1052 - analysis template version

* Make VEP have version in path

* Shariant test config - enable BCFTools liftover

* Upload initially shows error wrongly in pipeline race condition

* issue #818 - don't uniq on preprocess anymore

* issue #1056 - vcf_clean_and_filter convert contig headers

* issue #1057 - VEP deployment change to explicit version

* vg test config

* issue #980 - karyomapping use symbol instead of gene

* Renamed counts to Counts(germline only) on overlaps page (#1046)

* Fix issue where alleleOriginToggle was being called with undefined

* WIP fixing conda dependencies

* Update of conda environment (works again in conda)

* Add new evidence key for somatic testing

* Show better quick clinical significance values (for somatic and germline)

* Fix recently introduced bug in suggested terms for an allele

* Allow link for condition matching to appear on a classification even when not in edit mode

* issue #1059 - all variants node

* issue #1059 - all variants node

* Adjust node counts a little

* issue #1043 - export VCF

* Tidy up server status page removing redundant data (now go to Overall Status)

* Fix the evidence key values

* Get c.HGVS showing properly on variant details page again

* issue #758 - configure quick links via settings

* More comments in conda file

* Move condition text match from classification.html to JSON to keep things in sync
More requirements on when you will be linked to condition resolution

* Style fixes for new liftover

* Put liftover date on form

* Move overall data to the splash server status page

* Add Clinical Trails gov to quick links

* Add liftover to settings menu if variants menu disabled

* Update shariant prod settings to use VEP v108

* Make liftover page side menu change based on settings

* Better wording for changing clinical contexts within a discordance

* Tighten up the handling of embedded card or modal (used for triage but affected other modals by mistake)

* Better safety around condition text relationships

* For view metrics use heading of "Users" not "User"

* Add overall stats to liftover pages

* Slight formatting on liftover note

* Rework the internals of the view user activity reports

* More styling on liftover runs page

* More styling on liftover pages

* Tiny column alignment issue on liftover

* linting

* Fix exclusion of blank searches in metrics

* Subtle change of header wording for activity report

* Improve the formatting of missing IDs when making a batch for ClinVar

* Proper int to string handling for missing IDs

* issue #758 - quick links via settings

* Reworking of Lab Differences using new zippable functionality for ExportRow

* Put the allele URL into the lab compare (I think it was there before)

* Enable SEARCH_HGVS_GENE_SYMBOL_USE_MANE for shariant test

* Also try enabling SEARCH_HGVS_GENE_SYMBOL

* Fix bug where condition URL was calculated before seeing if we had condition text

* Enable extra variant annotations for testing

* Attempt at parsing OncogenicityClassification for pulling in ClinVar records

* issue #1193 - analysis audit log

* issue #1193 - analysis audit log

* issue #1193 - analysis audit log

* issue #1193 - analysis audit log

* issue #1193 - analysis audit log

* variantgrid_private#1193 - Audit log for analysis

* Get audit log name right in requirements

* issue #1060  - variant details page, create allele for all variants not just shorty clingen ones

* Sometimes can't reload analysis #1061

* issue #1193 - disable audit log for template copies etc

* issue #1193 - Audit log for analysis - click to expand and see JSON

* issue #1193 - Don't audit node cloning (was causing tests to fail)

* temp fix for #1053 - will work out data fix later

* Better categorising of ClinVar's Somatic/Germline records

* issue #1053 - cohort genotype versions - make common cohort version match cohort.version

* linting - format whitespace

* linting - remove unused imports

* Unused file

* linting - f string w/o interpolation

* linting - use generator

* Increase the ClinVar parser version to purge old cache

* Add django-audit-log to conda environment

* ClinVar REcords More work on parsing condition

* issue #1053 - missing import, make sure we retrieve only 1 cgc per cohort

* issue #877 - Export column - c_hgvs_compat

* Add MeSH as a non-local ontology set

* Ability to render multiple conditions against a single ClinVar record

* Don't abbreviate ref/alt in VCF

* Fix bug with filtering out non-human clinvar records

* Count homo sapiens, homo-sapiens, homosapiens the same as human for ClinVar records

* A script to close manually raised flags

* Raise classification change flag for both Classification and Somatic Clinical Significance

* Filter based on creator of flag

* Added prefixed clinvar batch export CB_ and clinvar export CE_ search functionality (#1066)

* issue #3604 - COSMIC search

* Update changelog

* Tidy up code for close flags, allow it to re-open flags

* Bug fixes for the condition checking code

* panalapp work

* PanelApp Compare: Fixes to stop infinite recurssion, correct method signatures.

* #3591 panel app export

---------

Co-authored-by: TheMadBug <jimmy.andrews@gmail.com>
Co-authored-by: Dave Lawrence <davmlaw@gmail.com>
@TheMadBug
Copy link
Member

I've updated the resolved info admin screen to be able to search on c.HGVS/c.HGVS compatible as well as show if there's a different in the column.

So we can now search for "inv" https://test.shariant.org.au/admin/classification/resolvedvariantinfo/?q=inv and see that the cononical c.HGVS has inv, but the compatible version is del/ins

@davmlaw
Copy link
Contributor Author

davmlaw commented Jan 29, 2025

@TheMadBug - I did a test case on this, it was only when I finished that I saw you had already done it.

Should we have the convention that the test case is in the top description?

I did mine in Classification/Download/MVL - https://app.testiny.io/VG/testcases/tcf/89/tc/184/edit while yours is in Variant

I think yours is "code centric" - I can see why you put it there because of the models. While I was trying to arrange it from a user perspective as the place it matters is the export. We should work out which way we want to put things?

@TheMadBug
Copy link
Member

Oops, I had put the test case at the end of the first comment, not at the end of the description where I normally do it.
That said, going forward, I'm happy to put test case links at the top of descriptions.

I've also added the label Has Test Case to relevant projects and renamed the variantgrid label has_testiny_test_case to Has Test Case for consistency.

So I have tried to reduced redundancy as well as make comprehensive test cases of functionality. A good example is showing the ACMG/Horak codes summaries, they appear both in the classification listing and the CSV download.
https://app.testiny.io/VG/testcases/tcf/88/tc/181/edit I have 5 test steps - sort order, not showing neutral strengths, showing override strengths, point based vs non-point based criteria.

In the test cases that cover the CSV download and the classification listing, I have done a subset of that testing, but reference the comprehensive test plan.
Lastly the whole process caused me to make the admin screen more useful be able to identiy the relevant data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants