-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
potential data biases #9
Comments
What features, tools or functionality would be most helpful in probing your datasets and assessing for biases?Missing data in trait databases is a persistent problem affecting analyses. The most common approach is to delete missing cases but this can introduce additional biases and reduce statistical power of analyses and affect model selection and inference. In these cases imputation might be more appropriate and there are a number of approaches suggested, making use of both relationships between traits as well as taxonomic relationships. proposed toolstaxonomic biases:
imputation:Currently exploring use of Crossvalidated imputation error can also be used to assess contribution of individual to traits to overall imputation error. Model/trait selection:
Your input is needed!Feel free to leave suggestions on formalising such a process, useful tools and approaches or get in touch if you have an idea for a feature to add. |
Cool. I'd also add these resources: Lots of these are for newsrooms but I thought they might be useful for everything. |
Thanks for these! Going to also add them to #10 as a lot refers to basic data quality checks. |
A interesting added feature could be functionality for exploring potential data biases of analytical datasets.
The text was updated successfully, but these errors were encountered: