Health Literacy Dashboard

Welcome

This is a Quarto dashboard that provides an overview of health literacy in Newham. The dashboard is based on data from the Health Literacy Survey conducted by Newham Council and UCL. The survey collected data on health literacy, IT literacy, and numeracy among residents in Newham. The dashboard provides an interactive visualization of the survey data, allowing users to explore the relationship between health literacy and other factors.

Plot
Explanation

Overall, this plot visually summarizes the average effects of different variables on health literacy, showing both the point estimates and the uncertainty (confidence intervals) around those estimates. The vertical line at zero helps viewers quickly assess whether the effects are positive, negative, or insignificant.

This corresponds to the gradients in the AME scatter plots.

Plot
Explanation

Individual scatter plots for each variable, showing how values vary across different categories (name). Each plot includes jittered points for individual observations, red lines connecting the mean values of adjacent categories, and clearly labeled axes. Finally, all the plots are displayed together in a grid format, allowing for easy comparison across different variables.

The vertical spread of the point corresponds to the width of the bars in the forest plot.

Plot
Explanation

Calculates the probabilities of ranking for different variables based on their values. It creates a bar plot that visually represents these probabilities, showing how frequently each variable appears in the top ranks (1 to 4). This allows for easy comparison of ranking probabilities among different categories, making it clear which variables are more likely to be ranked highly.

Notice that the second most probable variable for ranking first isn’t necessarily the most probable variable for ranking second. This is an idnication of the uncertainty in the data.

Plot
Explanation

Calculates and visualizes the cumulative probability of ranking for different variables, effectively showing how likely each variable is to achieve a rank of 1, 2, 3, or 4 or higher. The resulting line plot allows for easy comparison between variables, illustrating their performance across ranks and helping to identify which variables consistently rank higher.

People talk about the area under the curve (AUC) as a measure of the impact. If we were to value the first 4 ranks equally, then the AUC would be 1. If we were to value the first rank more than the second, then the AUC would be less than 1.

Plot
Explanation

$sex


$age


$ethnicity


$uk_born


$english_lang


$qualification


$workingstatus


$job_status


$gross_income


$own_home


$imd

Compute average effects and uncertainty for various variables across different strata, then generates a series of plots to visualize these effects. Each plot displays the relationship between variable names and their average marginal effects on health literacy, highlighting how these effects vary by stratum.

TODO

Multilevel Regression and Post-Stratification (MRP)

Multilevel regression with poststratification (MRP) is a statistical technique that combines multilevel modeling with poststratification to estimate subnational or small-area characteristics from survey data . It accounts for hierarchical structures in the data (such as individuals nested within groups) while also adjusting for known population characteristics through poststratification, providing more precise estimates for small areas or subgroups.

The MRP equations can be represented as follows. The MRP regression is

\[ \pi_i = Pr(y_{i} = 1) = g^{-1} \left( \beta_0 + \beta_{1,j[i]} + \beta_{2,j[i]} + \ldots + \beta_{p,j[i]} \right) \]

with predictions

\[ \hat{\pi}_{i} = g^{-1}\left( \hat{\beta}_0 + \hat{\beta}_{1,j[i]} + \hat{\beta}_{2,j[i]} + \ldots + \hat{\beta}_{p,j[i]} + \hat{u}_{0j} \right) \] where \[\begin{align*} \hat{\pi}_{i} & \text{ is the predicted outcome for individual } i, \\ \beta_{0,j[i]}, \beta_{1,j[i]}, \ldots, \beta_{p,j[i]} & \text{ are coefficients,} \\ u_{0j} & \text{ is the random intercept for group } j, \\ \epsilon_{i} & \text{ is the error term for individual }. \end{align*}\]

After obtaining predictions at the group level, post-stratification involves weighting these predictions by the population distribution of key stratification variables, such as age, gender, or ethnicity. This can be represented as

\[ \hat{Y}^*_{g} = \sum_{j} w_{gj} \hat{Y}_{j} \] where \[\begin{align*} \hat{Y}^*_{g} & \text{ is the post-stratified estimate for stratum } g, \\ \hat{Y}_{j} & \text{ is the predicted outcome for group } j, \\ w_{gj} & \text{ is the weight for group } j \text{ in stratum } g. \end{align*}\]