An R package to access data from RGB Kew’s APIs.
kewr is meant to make accessing data from one of RGB Kew easier and to provide a consistent interface their public APIs.
This package should cover:
- World Checklist of Vascular Plants
- Plants of the World Online
- International Plant Names Index
- Kew Names Matching Service
- Kew’s Tree of Life
- Kew Reconciliation Service
New sources will be added as they come up.
kewr is not on CRAN yet but you can install the latest development version from GitHub:
# install.packages("devtools")
devtools::install_github("barnabywalker/kewr")
Functions in this package all start with a prefix specifying what action you want to perform and a suffix referring to the resource.
Four of the resources (POWO, WCVP, IPNI, and ToL) are databases storing
flora, taxonomic, nomenclatural, or genetic information. These three
resources all have a search_*
and lookup_*
.
The lookup_
functions can be used to retrieve a particular record by
its unique IPNI ID:
lookup_powo("320035-2")
lookup_wcvp("320035-2")
lookup_ipni("320035-2")
IPNI contains records for authors and publications, which can also be
retrieved using the lookup_ipni
function:
lookup_ipni("20885-1", type="author")
lookup_ipni("987-2", type="publication")
The ToL uses its own ID system. These IDs can be found by first searching the database.
lookup_tol("2717")
All four of these databases can be searched as well:
search_powo("Poa annua")
search_wcvp("Poa annua")
search_ipni("Poa annua")
search_tol("Poa annua")
And all, except the ToL, use filters and keywords for more advanced searches:
search_powo(list(genus="Poa", distribution="Madagascar"),
filters=c("accepted", "species"))
search_wcvp(list(genus="Poa"), filters=c("accepted", "species"))
search_ipni(list(genus="Poa", published=1920),
filters=c("species"))
The number of search results returned are determined by the limit
keyword:
search_powo(list(genus="Poa"), limit=20)
search_wcvp(list(genus="Poa"), limit=20)
search_ipni(list(genus="Poa"), limit=20)
search_tol("Poa", limit=20)
The next page for a set of search results can be requested using the
request_next
function:
results <- search_powo(list(genus="Poa"))
request_next(results)
Tree and gene data can be loaded directly from ToL into R.
For instance, you can load the whole Tree of Life.
load_tol()
Or a gene tree for a particular gene.
gene_info <- lookup_tol("51", type="gene")
load_tol(gene_info$tree_file_url)
Or a FASTA file for a specimen.
specimen_info <- lookup_tol("1296")
load_tol(specimen_info$fasta_file_url)
The corresponding files can also be downloaded for use later or in other programmes.
specimen_info <- lookup_tol("1296")
download_tol(specimen_info$fasta_file_url)
The whole of WCVP can be download to a directory using:
download_wcvp()
The KNMS resource is only used for matching names to records in POWO/WCVP:
match_knms(c("Poa annua", "Magnolia grandifolia", "Bulbophyllum sp."))
Single names can also be matched to IPNI using the KRS resources.
match_krs("Poa annua")
KRS is slower for matching many names, as a request needs to be made for each one. But it has the advantage of allowing more complex matching:
match_krs(list(genus="Solanum", species="sanchez-vegae", author="S.Knapp"))
Each function in this package returns an object that stores the original response as well as the content of the response parsed into a list. This is to give the user as much flexibility as possible and to make debugging things a bit easier.
But this can be hard to use, so all the results objects can be tidied as
a tibble
:
results <- search_powo("Poa annua")
tidy(results)
You can get information about how to cite kewr
by using:
citation("kewr")
You can also get the citation to use for each data service using the different results objects:
r <- search_wcvp("Poa")
kew_citation(r)