Matthew McBee
21 October, 2016
Constraints, incentives, and normative practices:
Data peeking is interim data analysis.
Reasons for data peeking:
End Study Early if:
Continue Study If:
The universe exists in two states:
Data peeking is harmless (relative to NHST) if seeing the intermin result does not, in any way, change your future behavior.
\[ \textbf{Is this realistic?} \]
\[ \textbf{Really?} \]
Optional stopping is the practice of iteratively adding subjects and updating statistical hypothesis tests.
Iterate until p-value becomes “decisive.”
(Alternatively: iterate until p<.05).
If sample sizes for interim analyses can be prespecified:
Pocock bounds find equal critical values for all interim tests.
O'Brian-Fleming bounds set a higher critical value for the early tests than the later ones.
Both solutions require equal \( n \) between interim analyses.
GroupSeq can do this. \[ \begin{aligned} \\ \\ \\ p(\theta|y) & & \propto & & p(y|\theta) & & p(\theta) \\ \\ (\textit{posterior} & & & & (\textit{likelihood}) & & (\textit{prior}) \\ \end{aligned} \]
(8 heads in 10 coin flips)
(80 heads in 100 coin flips)
(8 heads in 10 coin flips)
(800 heads in 1,000 coin flips)
(4 heads in 5 coin flips)
Given two hypotheses \( H_0 \) and \( H_1 \), and some data \( x \).
\[ \begin{aligned} \frac{p(H_0|x)}{p(H_1|x)} = & & \frac{p(x|H_0)}{p(x|H_1)} & & \frac{p(H_0)}{p(H_1)} \\ & & & & \\ \textit{Posterior odds} & & \textit{Bayes Factor} & & \textit{Prior odds} \\ \end{aligned} \]
The Bayes Factor describes how relative probability of \( H_1 \) versus \( H_0 \) should be updated given the data.
\( H_1 \) and \( H_0 \) are represented by competing priors.
Usually the prior for \( H_0 \) is strongly concentrated at zero while prior \( H_1 \) is spread over the parameter space.
JASP. Free software with SPSS-like point-and-click interface. Download from jasp-stats.org.
R package BayesFactor.
Bayes Factors can only be computed for relatively simple models such as t-tests, ANOVA, and linear regression.
The specific nature of these priors matters for computing the Bayes Factor. However, there are certain defauls that have been show to work well in most circumstances (Morey & Rouder, 2015).
BayesFactor package has three selectable priors for \( H_1 \).BayesFactor default but is user-adjustable.Jeffreys (1961)
| BF | Interpretation |
|---|---|
| 0 to 5 | barely worth mentioning |
| 5 to 10 | substantial |
| 10 to 15 | strong |
| 15 to 20 | very strong |
| >20 | decisive |
Kass & Raftery (1995)
| BF | Interpretation |
|---|---|
| 0 to 2 | not worth more than a bare mention |
| 2 to 6 | positive |
| 6 to 10 | strong |
| >10 | very strong |
\( \delta=0 \), stop collecting data when p < .05 or n = 200
\( \delta=0 \), stop collecting data when BF > 10 or n = 200
\( \delta=0 \), stop collecting data when p < .05 or n = 200
\( \delta=0 \), stop collecting data when BF > 10 or n = 200
\( \delta=0 \), stop collecting data when p < .05 or n = 200
\( \delta=0 \), stop collecting data when BF > 10 or n = 200
\( \delta=.2 \), stop collecting data when p < .05 or n = 200
\( \delta=.2 \), stop collecting data when BF > 10 or n = 200
Etz, A., Gronau, Q. F., Dablander, F., Eldsbrunner, P. A., & Baribault, B. (2016). How to become a Bayesian in eight easy steps: An annotated reading list. Preprint posted on the Open Science Framework. https://osf.io/cpvfk/
JASP Team (2016). JASP (Version 0.7.5.6) [Computer software]
Kass R. E. & Raftery, A. E. (1995). Bayes Factors. Journal of the American Statistical Association, 90(430), 773-795.
Lakens, D. (2014). Performing high-powered studies efficiently with sequential analysis. Social Science Research Network. doi: http://dx.doi.org/10.2139/ssrn.2333729