- Bump version for Bioc 3.19.
- Use GHCR instead of Dockerhub, update vignette accordingly.
- Increase code coverage.
- Catch case where no genes are found in the species in
map_genes()
& test case added.
- Added yeast (Saccharomyces cerevisiae) as a valid taxonomy choice.
remove_all_nas
- Can handle multiple cols.
plot_orthotree
- New arg
clades_rotate
- New func:
rotate_clades
- New arg
add_columns
- Handle both vectors and columns.
sort_rows_func
- Handle both vectors and columns.
filter_gene_df
- Avoid coercing single-col dataframe into vector.
- Flagged in #34
- Fix
test-report_orthologs
- Recognize either Gene.symbol or input_gene cols depending on when ortholog conversion was done.
- Fix
test-convert_orthologs
- Line 99 test had wrong number of cols.
map_genes_planosphere
- Add backup download strategy.
- Bump version.
convert_orthologs
- Can now take a user-supplied gene_map to use instead of gprofiler/homologene/babelgene.
- Lets user take advantage of all of the many:many ortholog mapping functions for organisms not covered in any of the databases (eg. earthworm).
- New helper function:
set_gprofiler
(to use beta version) map_genes
- Can now translate planarian gene synonyms
using subfunction:
map_genes_planosphere
.
- Can now translate planarian gene synonyms
using subfunction:
convert_orthologs
- Now returns original data when
input_species
==output
species (after performing method-specific species mapping) ANDstandardise_genes==FALSE
.
- Now returns original data when
add_synonyms
- When
syn_map
isNULL
, will use thegene_map
instead.
- When
map_orthologs_custom
- Remove
if(input_species==output_species) return(NULL)
- Remove
drop_non121
- Don't run grouped slicing when "input_gene","ortholog_gene" are the only cols.
- Remove unnecessary Suggests
map_species
get_all_orgs
: Whenspecies=NULL
, now returns an extra columns called "scientific_name_formatted".format_species_name
: New argsremove_parentheses
,remove_subspecies
,remove_subspecies_exceptions
report_orthologs
- Make much more efficient by only querying
ref_genes
once. - Added new internal func
report_orthologs_i
instead of recursion to make this easier. - Ensure that
map
andreport
get rbound separately and returned according to thereturn_report
arg.
- Make much more efficient by only querying
format_species
- Export function that was previously named
format_species_name
.
- Export function that was previously named
all_species
:- New exported function
- Originally implemented a version of this in
EWCE::list_species
, but decided to extend it and export it here.
get_silhouettes
- Previously was internal func:
gather_images
. - Now an exported function.
- Previously was internal func:
plot_orthotree
- Add "Invertebrates" to default
clades
- Add "Invertebrates" to default
- Update README to showcase more functions.
drop_non121
- New arg
symbol_only
to ONLY consider gene symbols (not ensemble IDs) when identifying non-121 orthologs. - This make a drastic difference in the number of 1:1 orthologs that get dropped!
- New arg
gather_images
- Update to use newly released
rphylopic
1.0.0 which uses the new phylopic API. - Add another
tryCatch
for when the SVG is available but not the png.
- Update to use newly released
is_human
- Add "9606" and "homo sapiens sapiens" species ID to list of options.
all_genes_babelgene
- Don't filter by support when speices is human,
because this column will always be
NA
since it's irrelevant for humans.
- Don't filter by support when speices is human,
because this column will always be
- Fix unit tests:
- report_orthologs
dMcast
- Fix
stats::pass
-->stats::na.pass
. Weirdly, only a problem on Linux. Did base R change a fundamental function name?
- Fix
- Bumped to 1.5.1 for Bioc devel 3.17
- Merged upstream devel.
- Now using
rworkflows
for GHA.- Removed Dockerfile
- Host
orthogene
data resources on Zenodo: - Upgrade TimeTree phylogeny to v5 (2022):
- 50k+ species --> 137k+ species!
- Replace
dplyr::%>%
usage with|>
- Add CITATION file
prepare_tree
:- Ignore species name case and trim "'" when filtering tree.
map_species
:- Add
trimws
step to remove flanking " " or "'".
- Add
- Remove
Matrix.utils
since it's now deprecated.- Reimplement the
dMcast
function as a new internal function withinorthogene
, since that's the only function I use fromMatrix.utils
.
- Reimplement the
- Fix GHA workflow now that r-lib/actions@master has been removed.
- Make
test-map_orthologs_babelgene
less stringent with the number of genes expected.
- Add inst/grofiler_namespace.csv.gz for documentation purposes.
create_background
:- Ensure user-supplied
bg
gets used: #22
- Ensure user-supplied
- Properly document internal data so that
devtools::document
doesn't expect them to be exported objects.
plot_orthotree
: Pass uptree_source
arg.
aggregate_mapped_genes
:- Pass up additional args from
map_genes
. - Add
map_orthologs
as a way to creategene_map
automatically, whengene_map=NULL
andinput_species!=output_species
. - Split
species
intoinput_species
andoutput_species
args. - Change
method
-->agg_method
, and usemethod
to pass tomap_orthologs
instead. - Pass up additional args from
map_orthologs
. - Add link to detailed explanation of matrix aggregation/expansion
in
many2many_rows
docs. - Automatically pick best method for many:1 or many:many mapping.
as_integers
: new arg that usesfloor
.- Rename
FUN
toagg_fun
.
- Pass up additional args from
- Add new data
gprofiler_namespace
. Used to validatetarget=
arg ingprofiler2
functions. - Upgraded
aggregate_mapped_genes
:- Can now used
gene_map
made bymap_orthologs
ormap_genes
. - Can now handle many:many relationships.
- Will automatically pick the best method to perform aggregation and/or expansion.
- Can now used
- Removed internal function
aggregate_mapped_genes_twice
- Extracted aggregation args from
non121_strategy
and placed them in their own new own (agg_fun
) since these options are no longer mutually exclusive due to many:many expansion/aggregation. - Pass up
as_DelayedArray
- Bump to v1.3.0 and R >=4.2 now that we're developing on Bioc 3.16.
- Add ISSUE_TEMPLATE.
prepare_tree
:- Add
tree_source
options: path / URL / OmaDB / UCSC / timetree
- Add
map_genes
: Fix report at completion.- Add safeguards against using aggregation when
gene_df
isn't a matrix. - Removed
DelayedMatrixStats
Import (no longer needed). - Fix all unit tests and examples after making all updates.
- Recognize sparse/dense matrix or delayedarray in
check_agg_args
.
- Make
test-map_orthologs_babelgene
less stringent with the number of expected genes.
- Increment to v2.0 now that Bioc 3.15 is the RELEASE.
map_orthologs_babelgene
- Add "Bad credentials" check for
piggyback
. - Add
use_old
as an optional arg so I can switch to more recent versions ofbabelgene::orthologs_df
if need be. - Use updated built-in
babelgene::orthologs_df
by default. - Throw error if trying to map between two non-human species.
- Filter support==NA mappings by default, not but support>=2
like
babelgene
does by default (even whenbabelgene::orthologs(min_support = 1)
). - See here for discussion of discrepancies with babelgene maintainer: igordot/babelgene#2
- Add "Bad credentials" check for
- Removed
aggregate_rows_delayedarray
as it wasn't being used and was far less efficient than the other methods anyway (which are also compatible with DelayedArray matrices anyway). * New unit tests:load_data
aggregate_mapped_genes(method='stat')
sparsity
infer_species
- Remove
source_all
as it included alibrary
call.
- Update GHA
- Fix failing benchmarking tests.
convert_orthologs(method="babelgene")
now gets gene mappings fromall_genes_babelgene
insteadbabelgene::orthologs
(which doesn't seem to work very well, despite being dedicated for this purpose).map_species
:- Avoid running this function redundantly when nested in multiple layers of other functions.
common_species_names_dict
now return "scientific_name" by default, instead of "taxonomy_id"- Match
map_species
method to whatever method is being used in the function it's wrapped within, to avoid dropping species due to naming differences. - Add "id" column (e.g. "celegans") to all org databases to enhance their searchability.
- Add
map_species_check_args
.
- Ensure proper method-specific
output_format
when passing species to other functions.
plot_orthotree
: Automated plotting of phylogenetic trees with 1:1 ortholog report annotations. Includes several subfunctions:prepare_tree
(exported): Read, prune and standardise a phylogenetic tree.gather_images
(internal): More robust way to find and import valid phylopic silhouettes. Will make PR requests torphylopic
andggimage
/ggtree
to include this functionality.
- Added unit tests for
report_orthologs
, especially whenmethod="babelgene"
. - GitHub Actions:
- Merge both GHA workflows into one, as implemented in
templateR
.
- Merge both GHA workflows into one, as implemented in
- Added citation info to README.
- Save
all_genes_babelgene
ortholog data to orthogene-specific cache instead of tempdir to avoid re-downloading every R session.
- Made GHA less dependent on hard-coded R/bioc versions.
- Now on Bioconductor release 3.14.
- Docker containers automatically built and pushed to DockerHub via GitHub Actions.
- Dockerfile provided to build and check any R package efficiently with AnVil.
- CRAN checks and Bioc checks run via GitHub Actions.
- Added documentation on using Docker container to README.
- Documentation website now automatically built via GitHub Actions.
- Code coverage tests now automatically run and uploaded via GitHub Actions.
- Replaced R-CMD GHA with bioc-check GHA.
- Added new badges.
- Adjusted vignette yamls to make resulting htmls smaller.
orthogene
now supportsDelayedArray
objects asgene_df
input.create_background
now usesall_genes
when all 3 species are the same.
- Added new function
create_background
. - Added new function
infer_species
. report_orthologs
andconvert_orthologs
can now handle cases whereinput_species
is the same asoutput_species
.- Add internal function
get_all_orgs
to easily list all organisms from different packages. - Added
all_genes
method "babelgene".
report_orthologs
no longer throws error due to not findingtar_genes
.
- Allow all messages to be suppressed in
report_orthologs
.
- License switched to GPL-3 (to be compliant with Bioc).
- New method "babelgene" added to
convert_orthologs
.
- License switched to GPL3 (>=3).
GenomeInfoDbData
now required.
orthogene
released to Bioconductor.