Generalized Jeffreys’s Approximate Objective Bayes Factor

============================================================================================================

0. Getting Started

Source	Sum of Squares	Degrees of Freedom	Mean Square	$F$-Ratio
Conditions	$\textit{SS}_{\ \text{between}}$	$a-1$	$\textit{MS}_{\ \text{between}}$	$\frac{ \textit{MS}_{\ \text{between}}}{\textit{MS}_{\ \text{within}}}$
Error	$\textit{SS}_{\ \text{within}}$	$N-a$	$\textit{MS}_{\ \text{within}}$
Total	$\textit{SS}_{\ \text{total}}$	$N-1$

The condition (between-groups) sum of squares is $\textit{SS}_{\ \text{between}}=\sum_{i=1}^a n_i\left(\bar{Y}_{i\centerdot}-\bar{Y}_{\centerdot\centerdot}\right)^2=\sum_{i=1}^a n_i\bar{Y}_{i\centerdot}^2-N\bar{Y}_{\centerdot\centerdot}^2$

The error (within-groups) sum of squares is $\textit{SS}_{\ \text{within}}=\sum_{i=1}^a\sum_{j=1}^{n_i}\left(Y_{ij}-\bar{Y}_{i\centerdot}\right)^2$

The total sum of squares is $\textit{SS}_{\ \text{total}}=\sum_{i=1}^a\sum_{j=1}^{n_i}\left(Y_{ij}-\bar{Y}_{\centerdot\centerdot}\right)^2=\sum_{i=1}^a\sum_{j=1}^{n_i} Y_{ij}^2-N\bar{Y}_{\centerdot\centerdot}^2$

The total number of observations is $N=\sum_{i=1}^a n_i$

The grand mean is $\bar{Y}_{\centerdot\centerdot}=\frac{1}{N}\sum_{i=1}^a\sum_{j=1}^{n_i} Y_{ij}$

The condition means are $\bar{Y}_{i\centerdot}=\frac{1}{n_i}\sum_{j=1}^{n_i} Y_{ij}$

1. Methods

“BayesFactor” R Package

Exploratory hypothesis testing: $\mathcal{H_0}:\mu_1=\dotsb=\mu_a$ versus $\mathcal{H_1}:$ not all means are equal, assuming fixed effects.

$\hspace{12.43em}\mathcal{H_0}:\sigma_t^2=0$ versus $\mathcal{H_1}:\sigma_t^2\neq0$, assuming random effects.

Reparametrization $\mu_i=\mu+t_i=\mu+\sigma_\epsilon d_i$ with $\sum_{i=1}^a d_i=0$

\[\begin{align} \tag{1} \mathcal{M}_1:\ Y_{ij}&=\mu+\sigma_\epsilon d_i+\epsilon_{ij}\\ \text{versus}\quad\mathcal{M}_0:\ Y_{ij}&=\mu+\epsilon_{ij},\\ \qquad\qquad&\epsilon_{ij}\overset{\text{i.i.d.}}{\sim}\mathcal{N}(0,\sigma_\epsilon^2)\quad\text{ for condition } i=1,\dotsb,a;\text{ (unbalanced) subject } j=1,\dotsb,n_i \end{align}\]

\[\begin{equation} \tag{2} \pi(\mu,\sigma_\epsilon^2)\propto 1/\ \sigma_\epsilon^{2} \end{equation}\]

\[\begin{equation} \tag{3} d_i^\star\mid g\overset{\text{i.i.d.}}{\sim}\mathcal{N}(0,g) \end{equation}\]

\[\begin{equation} \tag{4} g\sim\text{Scale-inv-}\chi^2(1,h^2) \end{equation}\]

\[\begin{equation} \tag{5} (d_1^\star,\dotsb,d_{a-1}^\star)=(d_1,\dotsb,d_a)\cdot\mathbf{Q} \end{equation}\]

\[\begin{equation} \tag{6} \mathbf{I}_a-\frac{1}{a}\mathbf{J}_a=\mathbf{Q}\cdot\mathbf{Q}^\top \end{equation}\]

By default, $h=0.5$ for the fixed effects (in this case) and $h=1$ for the random effects in Eq. 4.

$\mathbf{Q}$ is an $a\times(a-1)$ matrix of the $a-1$ eigenvectors of unit length corresponding to the nonzero eigenvalues of the left side term in Eq. 6.
For example, when $a=3$, the projected standardized effects are

\[ (d_1^\star,\ d_2^\star)=(d_1,\ d_2,\ d_3)\cdot \begin{pmatrix} \frac{\sqrt{6}}{3} & 0 \\ -\frac{\sqrt{6}}{6} & \frac{\sqrt{2}}{2} \\ -\frac{\sqrt{6}}{6} & -\frac{\sqrt{2}}{2} \end{pmatrix} \]

In the other direction (given $d_1+d_2+d_3=0$),

\[ (d_1,\ d_2,\ d_3)^\top= \begin{pmatrix} \frac{\sqrt{6}}{3} & 0 \\ -\frac{\sqrt{6}}{6} & \frac{\sqrt{2}}{2} \\ -\frac{\sqrt{6}}{6} & -\frac{\sqrt{2}}{2} \end{pmatrix}\cdot(d_1^\star,\ d_2^\star)^\top \]

computeQ <- function(a) {
  #' Implement Eq. 6
  S <- diag(a) - matrix(1, nrow=a, ncol=a) / a
  e <- qr(S)
  Q <- qr.Q(e) %*% diag(sign(diag(qr.R(e))))
  Q[, which(abs(e$qraux) > 1e-3), drop=F]
}
computeQ(3)

##            [,1]          [,2]
## [1,]  0.8164966  3.732125e-17
## [2,] -0.4082483  7.071068e-01
## [3,] -0.4082483 -7.071068e-01

The Jeffreys–Zellner–Siow (JZS) Bayes factor for the Bayesian balanced one-way (between-subjects) analysis of variance (ANOVA) is computed by integrating across one dimension in Eq. 7 (corrigendum; Morey et al., 2011, p. 374). $n_i=n$ for all $i$.

\[\begin{equation} \tag{7} \textit{JZS-BF}_{10}=(2\pi)^{-\frac{1}{2}}h\cdot\int_0^\infty(1+ng)^{\frac{a(n-1)}{2}}\left(1+\frac{ng}{1+\frac{a-1}{a(n-1)}\cdot F}\right)^{-\frac{an-1}{2}}g^{-\frac{3}{2}}e^{-\frac{h^2}{2g}}\text{d}g, \end{equation}\]

JZS2BF10 <- function(h=0.5, n, a, F_ratio) {
  #' Input - 
  #' h:            the prior scale on the variability of fixed effects, i.e.,
  #'               `rscale="medium"` or `rscale=1/sqrt(2)` in BayesFactor::ttestBF, or
  #'               `rscaleFixed="medium"` or `rscaleFixed=0.5` in BayesFactor::anovaBF
  #' n:            number of observations per balanced group
  #' a:            number of groups
  #' F_ratio:      F-ratio
  #' Output - the JZS Bayes factor for balanced one-way ANOVA claimed in Eq. 7

  coef <- (2*pi)^(-0.5) * h
  integrand <- function(g) {
    K <- 1 + n*g / (1 + F_ratio * (a-1)/(a*n-a))
    (1+n*g)^((a*n-a)/2) * K^((1-a*n)/2) * g^(-1.5) * exp(-h^2/(2*g))
  }
  unname(coef * integrate(integrand, lower=0, upper=Inf)$value)
}

test <- aov(weight ~ group, PlantGrowth) # one-way ANOVA with three balanced groups, n = 10
F_ratio <- summary(test)[[1]]["group", "F value"] # F-ratio

## https://forum.cogsci.nl/discussion/9448/verifying-jzs-bayes-factor-for-one-way-anova
JZS2BF10(n=10, a=3, F_ratio=F_ratio) # verifying Eq. 7

## [1] 3.896995

BayesFactor::anovaBF(weight ~ group, PlantGrowth, progress=F) # testing fixed effects

## Bayes factor analysis
## --------------
## [1] group : 3.896995 ±0.01%
## 
## Against denominator:
##   Intercept only 
## ---
## Bayes factor type: BFlinearModel, JZS

Fixed or Random Effects

BayesFactor::anovaBF(weight ~ group, PlantGrowth, whichRandom="group", progress=F) # testing random effects

## Bayes factor analysis
## --------------
## [1] group : 3.060886 ±0%
## 
## Against denominator:
##   Intercept only 
## ---
## Bayes factor type: BFlinearModel, JZS

Fixed effects - assumed to be constant for all trials

Random effects - a subset of the entire population of treatments

fit1_fixed <- lm(weight ~ group, PlantGrowth)
fit0_fixed <- lm(weight ~ 1, PlantGrowth)
anova(fit1_fixed, fit0_fixed) # testing fixed effects

## Analysis of Variance Table
## 
## Model 1: weight ~ group
## Model 2: weight ~ 1
##   Res.Df    RSS Df Sum of Sq      F  Pr(>F)  
## 1     27 10.492                              
## 2     29 14.258 -2   -3.7663 4.8461 0.01591 *
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

summary(test)

##             Df Sum Sq Mean Sq F value Pr(>F)  
## group        2  3.766  1.8832   4.846 0.0159 *
## Residuals   27 10.492  0.3886                 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

fit1_rand <- nlme::lme(weight ~ 1, PlantGrowth, ~1 | group, method="ML")
fit0_rand <- nlme::gls(weight ~ 1, PlantGrowth, method="ML") # if `REML`, same as fit0_fixed
anova(fit1_rand, fit0_rand) # testing random effects

##           Model df      AIC      BIC    logLik   Test  L.Ratio p-value
## fit1_rand     1  3 66.29798 70.50157 -30.14899                        
## fit0_rand     2  2 66.82084 69.62323 -31.41042 1 vs 2 2.522865  0.1122

Source	Sum of Squares	Degrees of Freedom	Mean Square	\(F\)-Ratio
Conditions	\(\textit{SS}_{\ \text{between}}\)	\(a-1\)	\(\textit{MS}_{\ \text{between}}\)	\(\frac{ \textit{MS}_{\ \text{between}}}{\textit{MS}_{\ \text{within}}}\)
Error	\(\textit{SS}_{\ \text{within}}\)	\(N-a\)	\(\textit{MS}_{\ \text{within}}\)
Total	\(\textit{SS}_{\ \text{total}}\)	\(N-1\)

Generalized Jeffreys’s Approximate Objective Bayes Factor

Part Four: One-Way Analysis of Variance

Zhengxiao Wei*, Puneet Velidi, Shreena Nisha Kalaria, Yimeng Liu, Céline M. Laumont, Brad H. Nelson, Farouk S. Nathoo

2024-08-02 version 0.35.9000 ✉*: sherloconan{at}gmail{dot}com

0. Getting Started

1. Methods

“BayesFactor” R Package

BIC Approximation

Approximate Bayes Factor

Extended JAB

Pearson Type VI

Test-Based Bayes Factor

Savage–Dickey Density Ratio

2. Simulation

3. Graphic Results

Boxplot (\(\textit{BF}_{01}\))

Boxplot (Percent Errors)

Line Chart (Proportions of Agreement)

Eureka!

4. Decision Rules

5. Finite Sample Correction

6. Higher Dimensions

Problem

Code

Boxplot (\(\textit{BF}_{01}\))

7. Two-Way ANOVA

Models

Code

Boxplot (\(\textit{BF}_{01}\))

8. eJAB

Zhengxiao Wei*, Puneet Velidi, Shreena Nisha Kalaria, Yimeng Liu,
Céline M. Laumont, Brad H. Nelson, Farouk S. Nathoo