Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Taxonomic levels Genus names at Family rank #13

Open
microsud opened this issue Oct 16, 2023 · 1 comment
Open

Taxonomic levels Genus names at Family rank #13

microsud opened this issue Oct 16, 2023 · 1 comment

Comments

@microsud
Copy link

Dear Authors,
Congrats on this huge task and thanks for creating this resource.
When exploring the data at family level, I noticed an issue regarding taxonomy that was highlighted couple of year back in the dada2 silva database. I am not sure if the source database used in this study is from this repository but there seems to be the same issue.
mikemc/dada2-reference-databases#1

# BiocManager::install('seandavi/MicroBioMap')
library(MicroBioMap)
library(mia)
library(dplyr)
cpd <- getCompendium()
saveRDS(cpd, 'MicroBioMapDataTSE.rds')

vcpd <- mia::subsetByPrevalentFeatures(cpd, 
                                       detection=0.0001, 
                                       prevalence=0.01,
                                       as_relative = TRUE)

dplyr::count(as.data.frame(rowData(vcpd)), family)

examples of genus level names at family level are Anaerococcus, Ezakiella, Finegoldia, Parvimonas, Peptoniphilus etc

Best wishes,
Sudarshan

@rabdill
Copy link
Collaborator

rabdill commented Jan 8, 2024

Hi Sudarshan, thank you for bringing this to our attention. We did use SILVA v138.1 in the original pipeline, but you're right that we hadn't accounted for the additional "bad taxa." We've made a new data release (v1.0.1) that fixes these by putting the existing names at the proper level. We filled in the gaps with "(unclassified)" rather than filling in additional taxonomic information that in some cases conflicted with other SILVA info. It's available on Zenodo and will be integrated into the next package release.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants