Generalized Jeffreys’s Approximate Objective Bayes Factor

============================================================================================================

0. Getting Started

\[\begin{equation} \tag{1} Y_i\mid x_i\sim\text{Bernoulli}(p_i)=p_i^{y}\cdot(1-p_i)^{1-y}\quad\text{for }y\in\{0,1\}\text{ and }i=1,\dotsb,n \end{equation}\]

$\mathbb{E}[Y_i\mid x_i]=p_i$ and the most commonly used link function is the log-odds $\eta_i=\text{logit}(p_i)\equiv\ln\left(\frac{p_i}{1-p_i}\right)$ for $p_i\in(0,1)$.

\[\begin{align} \tag{2} \mathcal{M}_1:\ \eta_i&=\beta_0+\beta_1 x_i\\ \text{versus}\quad\mathcal{M}_0:\ \eta_i&=\beta_0 \end{align}\]

Recall that the logistic sigmoid function is invertible, and its inverse is the logit function.

Thus, $p_i=\frac{\exp\{\eta_i\}}{\exp\{\eta_i\}+1}=\frac{1}{1+\exp\{-\beta_0-\beta_1 x_i\}}$, which maps any real-valued number into the range $(0,1)$.

Other link functions that can be used include the probit link for modeling normal latent variables
and the complementary log-log link when the probability $p$ is close to 1 or 0.

Generalized Linear Model (GLM)

Stochastic part

\[\begin{align} Y_i\sim f(y_i;\boldsymbol{\theta}_i)&=\exp\left\{a(y_i)\cdot b(\boldsymbol{\theta}_i)+c(\boldsymbol{\theta}_i)+d(y_i)\right\} \\ a(y_i)&=y_i\quad\text{canonical form}\implies\text{exponential family} \\ b(\boldsymbol{\theta}_i)&\quad\text{natural parameter(s)} \end{align}\]

Systematic part: linear predictors

\[\begin{equation} \boldsymbol{\eta}=\mathbf{X}\boldsymbol{\beta} \end{equation}\]

Link function(s)

\[\begin{equation} \eta_i=g\left(\mu_i(\boldsymbol{\theta}_i)\right),\ \text{where }\mu_i(\boldsymbol{\theta}_i)=\mathbb{E}[Y_i\mid\boldsymbol{x}_i] \end{equation}\]

$Y\sim\operatorname{Pois}(\lambda),\hspace{2.1em}\mathbb{E}[Y]=\text{Var}(Y),\hspace{2em}\text{equidispersion}$

$Y\sim\textit{B}(n,p),\hspace{2.25em}\mathbb{E}[Y]>\text{Var}(Y),\hspace{2.02em}\text{underdispersion}$

$Y\sim\textit{NB}(r,p),\hspace{1.7em}\mathbb{E}[Y]<\text{Var}(Y),\hspace{2em}\text{overdispersion}$

GLM 101

Why is there no error term in Eq. 2? See Wooldridge (2010, p. 565-567), “Econometric Bible”.

	mean	se_mean	sd	2.5%	25%	50%	75%	97.5%	n_eff	Rhat
beta0	-0.4950	0.0028	0.4197	-1.3453	-0.7707	-0.4866	-0.2094	0.3032	22213.30	0.9999
beta1	1.8821	0.0063	0.8985	0.2598	1.2569	1.8314	2.4454	3.7682	20395.86	1.0001
lp__	-24.3457	0.0094	1.0415	-27.1401	-24.7364	-24.0238	-23.6118	-23.3355	12269.44	1.0000

	mean	se_mean	sd	2.5%	25%	50%	75%	97.5%	n_eff	Rhat
beta0	-0.4167	0.0036	0.3793	-1.1727	-0.6672	-0.4093	-0.1636	0.3134	11194.66	1.0004
lp__	-24.1490	0.0063	0.7288	-26.2092	-24.3111	-23.8672	-23.6889	-23.6398	13273.91	1.0003

Generalized Jeffreys’s Approximate Objective Bayes Factor

Part Three: Simple Logistic Regression Simulation

Zhengxiao Wei*, Puneet Velidi, Shreena Nisha Kalaria, Yimeng Liu, Céline M. Laumont, Brad H. Nelson, Farouk S. Nathoo

2024-07-24 version 0.31.9000 ✉*: sherloconan{at}gmail{dot}com

0. Getting Started

1. Methods

“BFpack” R Package

\(t\)-Test

Regression

BIC Approximation

Approximate Bayes Factor

JAGS

Stan

2. Simulation

3. Graphic Results

Boxplot (Percent Errors)

Boxplot (\(\textit{BF}_{01}\))

Line Chart (Proportions of Agreement)

4. Decision Rules

5. Finite Sample Correction

6. clogit

Stan

Simulations

7. eJAB

Zhengxiao Wei*, Puneet Velidi, Shreena Nisha Kalaria, Yimeng Liu,
Céline M. Laumont, Brad H. Nelson, Farouk S. Nathoo