Exploratory Analysis: Varying Slopes
Exploring whether there is variation in the relationship between unemployment rate and the share of population in long-term illness (selecting the 8 MSOAs containing OAs with the highest unemployment rates in Liverpool). A great variability in the relationship between unemployment rates and the percentage of population in long-term illness can be observed. This visual inspection suggests that accounting for differences in the way uenmployment rates relate to long-term illness is important.
Simple feature collection with 9 features and 2 fields
Geometry type: MULTIPOLYGON
Dimension: XY
Bounding box: xmin: 335032 ymin: 387777 xmax: 338576.1 ymax: 395022.4
Projected CRS: Transverse_Mercator
msoa_cd unemp geometry
1 E02001354 0.5000000 MULTIPOLYGON (((337491.2 39...
2 E02001369 0.4960630 MULTIPOLYGON (((335272.3 39...
3 E02001366 0.4461538 MULTIPOLYGON (((338198.1 39...
4 E02001365 0.4352941 MULTIPOLYGON (((336572.2 39...
5 E02001370 0.4024390 MULTIPOLYGON (((336328.3 39...
6 E02001390 0.3801653 MULTIPOLYGON (((335833.6 38...
7 E02001354 0.3750000 MULTIPOLYGON (((337403 3949...
8 E02001385 0.3707865 MULTIPOLYGON (((336251.6 38...
9 E02001368 0.3648649 MULTIPOLYGON (((335209.3 39...
Estimating Varying Intercept and Slopes Models
The correlation of fixed effects indicates a negative relationship between the intercept and slope of the average regression model: as the average model intercept tends to increase, the average strength of the relationship between unemployment rate and long-term illness decreases and vice versa.Linear mixed model fit by REML ['lmerMod']
Formula: unemp ~ lt_ill + (1 + lt_ill | msoa_cd)
Data: oa_shp
REML criterion at convergence: -4762.8
Scaled residuals:
Min 1Q Median 3Q Max
-3.6639 -0.5744 -0.0873 0.4565 5.4876
Random effects:
Groups Name Variance Std.Dev. Corr
msoa_cd (Intercept) 0.003428 0.05855
lt_ill 0.029425 0.17154 -0.73
Residual 0.002474 0.04974
Number of obs: 1584, groups: msoa_cd, 61
Fixed effects:
Estimate Std. Error t value
(Intercept) 0.047650 0.008635 5.519
lt_ill 0.301259 0.028162 10.697
Correlation of Fixed Effects:
(Intr)
lt_ill -0.786
Interpretation of coefficients:
The fixed intercept indicates that the average unemployment rate is 5% if the percentage of population with long-term illness is zero. The fixed slope indicates that the average relationship between unemployment rate and long-term illness is positive across MSOAs: as the percentage of population with long-term illness increases by 1 percentage point, the unemployment rate increases by 0.3.
Caterpillar plots allow for an understanding of the general pattern of the random effects. Only one MSOA seems to have a statistically significantly different intercept, or average unemployment rate. Significant variability exists in the association between unemployment rate and long-term illness across MSOAs. This means that geographical differences in the relationship between unemployment rate and long-term illness can explain the significant differences in average unemployment rates in the varying intercept only model.
Random Effects Map of Liverpool
As expected, a greater share of population in long-term illness is associated with higher local unemployment. The relationship between unemployment rate and long-term illness tends to be stronger and positive in northern MSOAs.
Linear mixed model fit by REML ['lmerMod']
Formula:
unemp ~ z_lt_ill + z_no_qual + (1 + z_lt_ill | msoa_cd)
Data: oa_shp
REML criterion at convergence: -4940.7
Scaled residuals:
Min 1Q Median 3Q Max
-3.6830 -0.5949 -0.0868 0.4631 6.3556
Random effects:
Groups Name Variance Std.Dev. Corr
msoa_cd (Intercept) 8.200e-04 0.02864
z_lt_ill 2.161e-06 0.00147 -0.04
Residual 2.246e-03 0.04739
Number of obs: 1584, groups: msoa_cd, 61
Fixed effects:
Estimate Std. Error t value
(Intercept) 0.1163682 0.0039201 29.68
z_lt_ill -0.0003130 0.0003404 -0.92
z_no_qual 0.3245811 0.0221347 14.66
Correlation of Fixed Effects:
(Intr) z_lt_l
z_lt_ill -0.007
z_no_qual -0.015 -0.679
In order to assess different models and test whether additional predictors improve model fit, look for lower scores for these indicators: Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), Loglik and Deviance (for nested models), and Chi-squared statistic.
Data: oa_shp
Models:
model6: unemp ~ lt_ill + (1 + lt_ill | msoa_cd)
model7: unemp ~ z_lt_ill + z_no_qual + (1 + z_lt_ill | msoa_cd)
npar AIC BIC logLik deviance Chisq Df
model6 6 -4764.7 -4732.5 2388.3 -4776.7
model7 7 -4956.5 -4918.9 2485.2 -4970.5 193.76 1
Pr(>Chisq)
model6
model7 < 2.2e-16 ***
---
Signif. codes:
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
The results indicate that adding an individual-level predictor (the proportion of population with no qualification) provides a model with better fit.