Skip to content

Commit

Permalink
Handle scRNA biotype; Add AR to uniprot isoforms
Browse files Browse the repository at this point in the history
  • Loading branch information
ckandoth committed Jul 20, 2020
1 parent cd51d65 commit e1f5744
Show file tree
Hide file tree
Showing 3 changed files with 6 additions and 3 deletions.
1 change: 1 addition & 0 deletions data/isoform_overrides_uniprot
Original file line number Diff line number Diff line change
Expand Up @@ -874,6 +874,7 @@ ENST00000297988 AQP7 NM_001170.1 CCDS6541.1
ENST00000219660 AQP8 NM_001169.2 CCDS10626.1
ENST00000219919 AQP9 NM_020980.3 CCDS10165.1
ENST00000156471 AQR NM_014691.2 CCDS42013.1
ENST00000374690 AR NM_000044.3 CCDS14387.1
ENST00000377045 ARAF NM_001654.4 CCDS35232.1
ENST00000393609 ARAP1 NM_001040118.2 CCDS41687.1
ENST00000303965 ARAP2 NM_015230.3 CCDS3441.1
Expand Down
1 change: 1 addition & 0 deletions data/isoform_overrides_uniprot_from_biomart_91
Original file line number Diff line number Diff line change
Expand Up @@ -874,6 +874,7 @@ ENST00000297988 AQP7 NM_001170.1 CCDS6541.1
ENST00000219660 AQP8 NM_001169.2 CCDS10626.1
ENST00000219919 AQP9 NM_020980.3 CCDS10165.1
ENST00000156471 AQR NM_014691.2 CCDS42013.1
ENST00000374690 AR NM_000044.3 CCDS14387.1
ENST00000377045 ARAF NM_001654.4 CCDS35232.1
ENST00000393609 ARAP1 NM_001040118.2 CCDS41687.1
ENST00000303965 ARAP2 NM_015230.3 CCDS3441.1
Expand Down
7 changes: 4 additions & 3 deletions vcf2maf.pl
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@
Lys K Met M Phe F Pro P Ser S Thr T Trp W Tyr Y Val V Xxx X Ter * );

# Prioritize Sequence Ontology terms in order of severity, as estimated by Ensembl:
# http://useast.ensembl.org/info/genome/variation/predicted_data.html#consequences
# https://ensembl.org/info/genome/variation/prediction/predicted_data.html
sub GetEffectPriority {
my ( $effect ) = @_;
$effect = '' unless( defined $effect );
Expand Down Expand Up @@ -93,7 +93,7 @@ sub GetEffectPriority {
}

# Prioritize the transcript biotypes that variants are annotated to, based on disease significance:
# All possible biotypes are defined here: http://www.gencodegenes.org/gencode_biotypes.html
# All possible biotypes are defined here: https://www.gencodegenes.org/pages/biotypes.html
sub GetBiotypePriority {
my ( $biotype ) = @_;
$biotype = '' unless( defined $biotype );
Expand All @@ -117,8 +117,9 @@ sub GetBiotypePriority {
'sRNA' => 3, # Non-coding RNA predicted using sequences from RFAM and miRBase
'scaRNA' => 3, # Non-coding RNA predicted using sequences from RFAM and miRBase
'rRNA' => 3, # Non-coding RNA predicted using sequences from RFAM and miRBase
'scRNA' => 3, # Non-coding RNA predicted using sequences from Rfam and miRBase
'lincRNA' => 3, # Long, intervening noncoding (linc) RNAs, that can be found in evolutionarily conserved, intergenic regions
'lncRNA' => 3, # Long, noncoding (lnc) RNAs
'lncRNA' => 3, # Replaces 3prime_overlapping_ncRNA, antisense, bidirectional_promoter_lncRNA, lincRNA, macro_lncRNA, non_coding, processed_transcript, sense_intronic and sense_overlapping
'bidirectional_promoter_lncrna' => 3, # A non-coding locus that originates from within the promoter region of a protein-coding gene, with transcription proceeding in the opposite direction on the other strand
'bidirectional_promoter_lncRNA' => 3, # A non-coding locus that originates from within the promoter region of a protein-coding gene, with transcription proceeding in the opposite direction on the other strand
'known_ncrna' => 4,
Expand Down

0 comments on commit e1f5744

Please sign in to comment.