Regularity Conditions for Consistency of Jeffreys Approximate Objective Bayes Factor in Seven or More Experiments

============================================================================================================

1. Introduction

To test the hypotheses $\mathcal{H}_0:\boldsymbol{\theta}=\boldsymbol{\theta}_0$ versus $\mathcal{H}_1:\boldsymbol{\theta}\neq\boldsymbol{\theta}_0$, we define the extended Jeffreys approximate objective Bayes factor (eJAB) in Eq. 1. \[\begin{equation} \tag{1} \mathit{eJAB}_{01}=\sqrt{N}\exp\left\{-\frac{1}{2}\frac{N^{1/q}-1}{N^{1/q}}\cdot Q_{\chi^{2}_{q}}(1-p)\right\}, \end{equation}\]

where $q$ is the size of the parameter vector $\boldsymbol{\theta}$, $N$ is the sample size, $Q_{\chi^{2}_{q}}(\cdot)$ is the quantile function of the chi-squared distribution with $q$ degrees of freedom, and $p$ is the $p$-value from a null-hypothesis significance test. We further define $\mathit{eJAB}_{10}=1\ \!/\ \!\mathit{eJAB}_{01}$.

The sample size $N$ is

the total number of observations for for $t$-tests, linear regression, logistic regression, one-way ANOVA, and chi-squared tests.
the number of events for Cox models.
the number of independent observations for one-way repeated-measures ANOVA (Nathoo & Masson, 2016).

The degrees of freedom are

$q=1$ for $t$-tests, linear regression, logistic regression, and Cox models, where $\mathcal{H}_0$ specifies a point-null hypothesis (Wagenmakers, 2022).
$q=I-1$ for one-way ANOVA and one-way repeated-measures ANOVA, where $I$ is the number of conditions.
$q=(R-1)(C-1)$ for chi-squared tests, where $R$ and $C$ are the numbers of rows and columns, respectively.

Since the Bayes factor is the ratio of the marginal likelihoods of two competing models, $q=p_1-p_0$ also represents the difference in the number of free parameters between the alternative and null models. We will discuss the hypotheses and model specifications in Examples below and Section 5, particularly when $q>1$.

Derivation

The Savage–Dickey density ratio is a special form of the Bayes factor for nested models by dividing the value of
the posterior density over the parameters for the alternative model evaluated at the hypothesized value by
the prior for the same model evaluated at the same point.

\[\begin{equation} \tag{2} \textit{BF}_{01}:=\frac{p(\boldsymbol{y}\mid \mathcal{H}_0)}{p(\boldsymbol{y}\mid \mathcal{H}_1)}=\frac{\color{#0055A4}{p(\boldsymbol{\theta}=\boldsymbol{\theta}_0\mid\boldsymbol{y},\mathcal{H}_1)}}{\color{#EF4135}{p(\boldsymbol{\theta}=\boldsymbol{\theta}_0\mid \mathcal{H}_1)}} \end{equation}\]

Under standard regularity conditions and with a large sample size $N$, we can approximate $p(\boldsymbol{\theta}\mid\boldsymbol{y},\mathcal{H}_1)$ as a multivariate Gaussian distribution centered at the maximum likelihood estimate (MLE) $\hat{\boldsymbol{\theta}}$, with the observed Fisher information matrix $\left[\ I(\hat{\boldsymbol{\theta}})\ \right]_{ij}$ serving as the precision matrix (i.e., the inverse of the covariance matrix). $\lvert\ \!\cdot\ \!\rvert$ denotes the determinant. Thus,

\[\begin{equation} \tag{3} p(\boldsymbol{\theta}=\boldsymbol{\theta}_0\mid\boldsymbol{y},\mathcal{H}_1)\approx(2\pi)^{-\frac{q}{2}}\cdot\lvert I(\hat{\boldsymbol{\theta}})\rvert^{\frac{1}{2}}\cdot\exp{\left\{-\frac{1}{2}(\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})^\top\cdot I(\hat{\boldsymbol{\theta}})\cdot(\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})\right\}} \end{equation}\]

Next, we assume the prior $\boldsymbol{\theta}\sim\boldsymbol{\mathcal{N}}_q\!\left(\hat{\boldsymbol{\theta}},\ N^{1/q}\cdot I^{-1}(\hat{\boldsymbol{\theta}})\right)$, which, for $q=1$, corresponds to the normal unit-information prior. For $q>1$, the prior becomes more informative, placing more weight on the MLE.

\[\begin{equation} \tag{4} p(\boldsymbol{\theta}=\boldsymbol{\theta}_0\mid \mathcal{H}_1)=(2\pi)^{-\frac{q}{2}}\cdot N^{-\frac{1}{2}}\cdot\lvert I(\hat{\boldsymbol{\theta}})\rvert^{\frac{1}{2}}\cdot\exp{\left\{-\frac{1}{2N^{1/q}}(\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})^\top\cdot I(\hat{\boldsymbol{\theta}})\cdot(\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})\right\}} \end{equation}\]

The Wald test statistic is $W=(\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})^\top\cdot I(\hat{\boldsymbol{\theta}})\cdot(\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})\mathrel{\dot\sim}\chi_{q}^2\ $ under $\mathcal{H}_0$.

Plugging Eq. 3 and 4 into Eq. 2, we obtain the following $\mathit{eJAB}_{01}$ expression.

\[\begin{align} \mathit{eJAB}_{01}&=\frac{p(\boldsymbol{\theta}=\boldsymbol{\theta}_0\mid\boldsymbol{y},\mathcal{H}_1)}{p(\boldsymbol{\theta}=\boldsymbol{\theta}_0\mid \mathcal{H}_1)} \\ &=\sqrt{N}\exp\left\{-\frac{1}{2}\frac{N^{1/q}-1}{N^{1/q}}\cdot (\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})^\top\cdot I(\hat{\boldsymbol{\theta}})\cdot(\boldsymbol{\theta}_0-\hat{\boldsymbol{\theta}})\right\} \\ &=\sqrt{N}\exp\left\{-\frac{1}{2}\frac{N^{1/q}-1}{N^{1/q}}\cdot W\right\} \\ &=\sqrt{N}\exp\left\{-\frac{1}{2}\frac{N^{1/q}-1}{N^{1/q}}\cdot Q_{\chi^{2}_{q}}(1-p)\right\} \end{align}\]

\(\mathcal{H}_1\)	\(\pi_1\)	\(\pi_2\)	\(1-\pi_1-\pi_2\)	1	\(\mathcal{H}_0\)	\(\pi_1\)	\(\pi_2\)	\(1-\pi_1-\pi_2\)	1
	\(\pi_1-\delta\)	\(\pi_2+\delta\)	\(1-\pi_1-\pi_2\)	1		\(\pi_1\)	\(\pi_2\)	\(1-\pi_1-\pi_2\)	1
	\(\pi_1+\delta\)	\(\pi_2-\delta\)	\(1-\pi_1-\pi_2\)	1		\(\pi_1\)	\(\pi_2\)	\(1-\pi_1-\pi_2\)	1
	\(3\pi_1\)	\(3\pi_2\)	\(3-3\pi_1-3\pi_2\)	3		\(3\pi_1\)	\(3\pi_2\)	\(3-3\pi_1-3\pi_2\)	3

Regularity Conditions for Consistency of Jeffreys Approximate Objective Bayes Factor in Seven or More Experiments

Zhengxiao Wei, Puneet Velidi, Shreena Nisha Kalaria, Yimeng Liu, Céline M. Laumont, Brad H. Nelson, Farouk S. Nathoo*

2024-10-08 version 0.18.9000 ✉*: nathoo{at}uvic{dot}ca

1. Introduction

Derivation

Remarks

Example: Two-Way ANOVA

Example: Contingency Table

2. Lemma

3. Regularity Conditions

4. Proof

5. Seven Simulations

\(t\)-Test

R1

R2

Local Alternative

Linear Regression

R1

R2

Logistic Regression

R1

R2

ANOVA

R1

R2

Repeated-Measures ANOVA

R1

R2

\(\chi^{2}\)-Test

R1

R2

Cox Model

R1

R2

6. Unit-Information Prior

One-Way ANOVA with Large Number of Groups

Chi-Squared Tests for Independence

7. Nonparametric

Wilcoxon Signed-Rank Test

R1

R2

Mann–Whitney \(U\)-Test

R1

R2

Kruskal–Wallis \(H\)-Test

R1

R2

8. clogit

R1

R2

Regularity Conditions for Consistency of
Jeffreys Approximate Objective Bayes Factor
in Seven or More Experiments

Zhengxiao Wei, Puneet Velidi, Shreena Nisha Kalaria, Yimeng Liu,
Céline M. Laumont, Brad H. Nelson, Farouk S. Nathoo*