Corey S. Sparks, PhD
February 24, 2016
Up until now, we have used (G)LMM's to analyze data that were “clustered”
The next topic will introduce a modeling strategy that allows us to consider clustered data, but in a different fashion
GLMM's are commonly referred to as conditional models, because the model coefficients “\( \beta \)'s” are condition on the random effects in the model.
Likewise, the mean if conditional on the random effects. This is another way of saying that the mean for a given covariate pattern is conditional on the group that the particular person is in.
\( \mu_{ij}^c = E(Y_{ij} | u_j) = X_{ij}\beta + u_j \)
In contrast, Generalzed Estimating Equations are referred to as marginal models because they only estimate the overall mean.
\( \mu_{ij} = X_{ij}\beta \)
Lee and Nelder, 2004 provide a very good description of how these two methods compare to one another
A basic form of the model would be:
\( Y_{ij} = \beta_0 + \sum_k X_{ijk} \beta_k + CORR + error \)
Ordinary models will tend to over estimate the standard errors for the \( \beta \)'s for time varying predictors in a model with repeated observations, because these models do not account for the correlation within clusters \ observations over time.
Likewise, the standard errors of time invariant predictors will be under estimated
Given the mean function for the model and a specified correlation function, the model parameters may be estimated by finding the solution for:
\[ U(\beta) = \sum_i ^n \frac{\delta \mu_{ij}}{ \delta \beta_k} V_i ^{-1} (Y_{ij} - \mu(\beta)) \]
Which gives estimates of the \( \beta \)'s for the linear mean function.
For three time points per person, the ordinary regression model correlation in residuals within clusters/persons over time can be thought of as the matrix:
\[ \begin{bmatrix} \sigma^2 & 0 & 0 \\ 0 & \sigma^2 &0 \\ 0 & 0 & \sigma^2 \end{bmatrix} \]
which assumed the variances are constant and the residuals are independent over time
But in a GEE, the model include the actual correlation between measurements over time:
\[ \begin{bmatrix} \sigma_1 ^2 & a & c \\ a & \sigma_2 ^2 &b \\ b & c & \sigma_3 ^2 \end{bmatrix} \]
Which allows the variances over time to be different, as well as correlations between times to be present.
\[ \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 &0 \\ 0 & 0 & 1 \end{bmatrix} \]
\[ \begin{bmatrix} 1 & \rho & \rho \\ \rho & 1 &\rho \\ \rho &\rho & 1 \end{bmatrix} \]
\[ \begin{bmatrix} 1 & \rho & \rho^2 \\ \rho & 1 &\rho\\ \rho^2 & \rho & 1 \end{bmatrix} \]
\[ \begin{bmatrix} 1 & \rho_1 & \rho_2 \\ \rho_1 & 1 &\rho_3 \\ \rho_2 & \rho_3& 1 \end{bmatrix} \]