This repository has been archived by the owner on Jun 16, 2023. It is now read-only.
Updated analysis: use annotator
API in the rna-seq-expression-summary-stats
module
#122
Closed
1 task done
What analysis module should be updated and why?
The
rna-seq-expression-summary-stats
module should be updated to use theannotator
API atanalyses/long-format-table-utils/annotator/annotator-api.R
, afterWhat changes need to be made? Please provide enough detail for another participant to make the update.
Use the long-format table annotator API in an analysis module with the following steps:
OpenPedCan-analysis
or a subdirectory ofOpenPedCan-analysis
. This allows the API functionannotate_long_format_table
to locate annotation data files.source
thelong-format-table-utils/annotator/annotator-api.R
file.tibble::tbl_df
, convert the table totibble::tbl_df
withtibble::as_tibble
. After conversion, carefully check rownames, colnames, column classes (especially factors), and other properties that may affect the correctness of you code.c("Gene_symbol", "Gene_Ensembl_ID", "Disease")
are not all present in the colnames of the table to be annotated, add new columns or rename existing ones to have all these required columns.annotate_long_format_table
to add one or more of the available annotation columns, by specifying thecolumns_to_add
parameter in theannotate_long_format_table
function. Read the documentation comment of the function for usage.Following is an example usage in the
rna-seq-expression-summary-stats
module01-tpm-summary-stats.R
.What input data should be used? Which data were used in the version being updated?
data/gene-expression-rsem-tpm-collapsed.rds
data/histologies.tsv
analyses/independent-samples/results/independent-specimens.rnaseq.primary.eachcohort.tsv
When do you expect the revised analysis will be completed?
1 day.
Who will complete the updated analysis?
@logstar
The text was updated successfully, but these errors were encountered: