-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathCONTENT.txt
executable file
·111 lines (73 loc) · 5.89 KB
/
CONTENT.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
### What happens if there is no difference between groups
present pictures of the simulations from the numerator and denominator if the null (no mean difference) and the alternative (mean difference) are true
- draw these directly from the F and shifted F distributions
- same for denominator
Provide also some intuition about why this seemingly is enough - with infer based permutation + large sample distribution plot
# ANOVA table and components
Show the different components (see Davison)
and how they are stored - show how these relate to the calculations.
# Example of Baumann for pretest 1, including graphics (raincloud plots?)
# Discuss APA recommandations for reporting
### APA recommandations for reporting the test statistic
### Graphical representation of groups, etc.
### Example 1: psychology paper
### Example 2: teaching to read
### Assumptions (redux) and a mathematical formulation
### Estimation is based on (least squares - sample averages here)
### Balanced samples: why are they ubiquitous and benefits
### I cried power!
Balanced samples are the best allocation when the variance is the same for each group. Show this with the two-sample t-test.
carData::Baumann
> This study investigated the effectiveness of explicit instruction in think aloud as a means to promote elementary students' comprehension monitoring abilities. Sixty-six fourth-grade students were randomly assigned to one of three experimental groups: (a) a Think-Aloud (TA) group, in which students were taught various comprehension monitoring strategies for reading stories (e.g., self-questioning, prediction, retelling, rereading) through the medium of thinking aloud; (b) a Directed Reading-Thinking Activity (DRTA) group, in which students were taught a predict-verify strategy for reading and responding to stories; or (c) a Directed Reading Activity (DRA) group, an instructed control, in which students engaged in a noninteractive, guided reading of stories. The primary quantitative analyses involved two planned orthogonal contrasts—effect of instruction (TA + DRTA vs. 2 x DRA) and intensity of instruction (TA vs. DRTA)—for three whole-sample dependent measures: (a) an error detection test, (b) a comprehension monitoring questionnaire, and (c) a modified cloze test. Results of effect of instruction contrasts revealed that TA and DRTA students were more skillful at comprehension monitoring than DRA students (TA + DRTA>DRA for all three measures). Results of intensity of instruction contrasts indicated that although TA-trained students had greater awareness of comprehension monitoring abilities (TA>DRTA for the questionnaire), DRTA students' performance equaled (TA = DRTA for the cloze test) or exceeded (TA<DRTA for the error detection test) that of the TA students. Qualitative data from student interviews, however, revealed that TA students both reported and demonstrated using a greater depth and breadth of comprehension monitoring abilities than either DRTA or DRA students. It was concluded that both TA and DRTA strategies are effective for enhancing elementary students' comprehension monitoring abilities but that additional research is needed to determine the relative effectiveness of TA and DRTA approaches.
Example of application + citation
Thinking outside the box:
- potential confounders?
- to extrapolate to the population, we potentially need to reweigth using demographic information
- blocking
The global null hypothesis of the one-way analysis of variance (ANOVA) problem is $\mathscr{H}_0: \mu_1 = \mu_2 = \mu_3$, where $\mu_i$ $(i=1, \ldots, 3)$ is the expectation of group $i$.
Homogeneity
Null hypothesis and comparing means of multiple groups
Assumptions
Plot your data
Data compression: what you need to report?
EXPERT TIPS: visualisation tools, histograms and line charts for ordered factors.
Reminder: must be handling the confounding via randomization or blocking (explicitly)
Where do we get our power from? Decomposition of the sum of squares TOTAL = BETWEEN + WITHIN
Anova table
What happens under the null
What happens under the alternative: some illustrations with simulated data
Potential ways of assessing the null hypothesis
- what assumptions are we making in the process?
- where do we draw our power from
- Alternative test statistics: Welch and permutation (and their validity)
- Power considerations: choosing the sample size
- estimate of effect size, type I error, number of levels (subsample size), variability
- Careful with observed power (using the observed effect size in the calculation)
- Practical significance versus statistical significance
- Thinking outside the box: how to allocate sample sizes when we do not have equal variance
-What about unbalanced experiments? An example of what can go WRONG?
- The experiment is not properly conducted; see S 2.4.6 in Lawson (2014)
- lurking variables: effective teaching methods, confounded with instructor/class time/voluntary participation
-> independence of errors to unit
-> measurements are not normal (or continuous)
- need experimental units and large enough sample sizes
- Example with variance increasing together with the level of the response
## Contrasts and pairwise tests
Reparametrizing the model
Statistical fallacy: data dredging
Multiple comparisons and family-wise error rates
Preplanned experiments: why penalie large studies
How much data do you need to reliably estimate
Methods for controlling multiplicity
Thinking outside the box: no free lunch.
library(tidyverse)
ng <- 5
k <- 15
n <- ng*k
set.seed(123456)
fakesamp <- data.frame(response = rnorm(n = n, sd = 2) + rep(rgamma(n = ng, scale = 2, shape = 10), each = k),
group = rep(LETTERS[1:ng], each = k))
centeredfakesamp <- fakesamp %>% group_by(group) %>% mutate(response = as.numeric(scale(response, scale = FALSE)))
g1 <- ggplot(data = fakesamp, mapping = aes(x = group, y = response, col = group)) +
geom_point() + theme(legend.position = "none")
g1