-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implementing test statistics based on bootstrapnet confidence intervals #35
Comments
Hi Dan, Thanks for opening the issue here. To my understanding, you want to compare the preserve plots in 2021 (n=32) to preserve plots from 2016 (n=32). If you manage to get network metrics for each plot (I understand that you have a network for each plot) then you can try a t-test or a permutation test to check for significant differences between the two distributions of network metrics (each of n = 32, one for 2021, and the other for 2016). I can give you some R code for running a permutation test applicable to two distributions if you need that (I can post it in a comment below). Note that, the non-parametric approach (permutation test) is a bit more conservative (doesn't detect a difference that "easily" as a t-test), but doesn't require the assumption of normality as the t-test. One risk in your design is pseudoreplication. And that, in your case, is indicated by how close/far the plots are. For example, you or other experts in ecology would consider that a distance of 100 m (not sure what your distance is, so I am just picking a random number) between the plots is big enough to not have pseudoreplication? For example, is 100 m a big enough distance so that the insect species from one plot do not create interactions with the plants from another plot? In other words, so that you do not risk sampling the same species again because of the short distance? This applies to the plants as well. You need to be far enough between plots so that you sample the entire diversity of that ecosystem/landscape which you analyze or compare with another (or across time as in your case). Your sample of plots needs to be as representative as possible, capturing the natural diversity without the risk of being too dense spatially. If is too dense spatially, then you need to pool together plots and that will unavoidably mean losing from the potential sample size (n = 32 can become n = 10 or so). I do not know exactly what an optimal distance is, for example, some of my colleagues who have more ecological expertise told me that bumblebees can fly up to 5 km during a pollination day. I think you can make a nice discussion about this in your analysis nevertheless. If the pseudoreplication is indeed an issue (most likely yes), then you need to pool all those 32 networks in one big network for each time frame, one for 2021 and one for 2016 as you did in the figures above (I presume). Those graphs show that you reach saturation, that is, you sampled enough so that you can capture the network diversity of each period. If the confidence intervals overlap (case of LL - lower level species, the plants), then you do not have a significant difference, which indicates that the plant community didn't change much across time. The opposite applies for the insect species (higher trophic level), where you have significant differences when looking at "robustness". You might want to check other network and species level metrics from the bipartite package. Hope that what I wrote here makes some sense. Feel free to continue with your thoughts as I also like to understand better these ecological aspects. Best, |
Thank you Valentin, your commentary is very helpful! I would appreciate the code for permutation tests in the case that my data neither fit assumptions of normality nor are improved by transformation. If not for this exact application, it could prove useful for other analyses within my project. The spatial scale is a concern as my study site is only approximately 100m x 175m. I was primarily expecting pseudoreplication with this design but I am doing a review of the literature to find some indications that using my plots as true replicates could be appropriate. If my question was examining microsite heterogeneity and how that influences plant-pollinator interactions I could see an argument supported by previous work. For my question and purposes I don't think that is an appropriate application, but please let me if you come across any work that is contrary to this assumption. |
Here is an example with R code regarding the permutation test - link. I do think that pseudoreplication is an issue at such a small scale and very well pointed that with the scale one changes the question. |
Hi @danielsandacz , I am curious how you progressed with the analysis. I want to collect some study cases where Best, |
Hi @Valentinitnelav/Bootstrapnet,
I successfully used bootstrapnet to compare network and species level
indices between three different networks. The networks represent three
combinations of study year and management type, which I have labelled the
following: 2016 Preserve, 2021 Preserve, and 2021 Neighbor. When comparing
species level indices, I only reported the results from one flowering
species as I am particularly interested in *Cirsium pitcheri* and how its
role differed over time and management history because 1) it is a federally
threatened plant and 2) a core species. I'm attaching two plots (one
network level and one species level) as examples.
To confirm the bootstrapnet analysis, I used the nullmodel function in
bipartite to generate the same null distributions and test for differences
between the networks using Kruskall-Wallis tests and then pairwise Wilcoxon
tests. In addition, I used econullnetr to generate null distributions and
test for how the observed metric differed from the null. I am happy to
share my completed thesis in the spring if that would be of more help!
…On Tue, Feb 22, 2022 at 3:37 AM Valentin Ștefan ***@***.***> wrote:
Hi @danielsandacz <https://github.com/danielsandacz> ,
I am curious how you progressed with the analysis. I want to collect some
study cases where bootstrapnet package was used.
Best,
Valentin
—
Reply to this email directly, view it on GitHub
<#35 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AV7HTHTFKCF6X34HGFOPELDU4NKNVANCNFSM5FUMD4WQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you were mentioned.Message ID:
***@***.***>
--
Daniel Sandacz (he/him/his)
Northwestern University Class '22
Master of Science in Plant Biology and Conservation
|
@danielsandacz , sounds cool! When you are done with your thesis, or whenever you have the time would be great if you can write about your analysis as a study case of using If you want, you can write the study case in the form of a reproducible RMarkdown document. You could also post that in RPubs as I did for the permutation example above. Is a convenient way of sharing analyses as compiled html documents. I leave this issue open because in our group we still think about an analysis pipeline based on the bootstrapped values. You might want to check also the work of our colleague, Elena, where we tried to tackle this. Motivans Švara, Elena, et al. "Effects of different types of low‐intensity management on plant‐pollinator interactions in Estonian grasslands." Ecology and evolution 11.23 (2021): 16909-16926. Link |
The example below is code for comparing robustness between network plots in a nature preserve and network plots on adjacent land of neighbors. In this situation the confidence intervals provide evidence that the neighbor plots are more robust than the preserve plots, particularly when considering the higher level (pollinators). I am curious if I could take this a step further and implement a test statistic based on the confidence intervals generated or another type of statistical test. Sample size is a limitation for the specific example I provide, but I am interested in any recommendations based on the example provided or future analysis where I compare the preserve plots in 2021 (n=32) to preserve plots from 2016 (n=32).
Where "preserve" (n=32) and "neighbor" (n=8) are matrices of plant-pollinator interactions across different plots
The text was updated successfully, but these errors were encountered: