Exploratory Analysis

Column

Exploratory Analysis: Varying Slopes

Exploring whether there is variation in the relationship between unemployment rate and the share of population in long-term illness (selecting the 8 MSOAs containing OAs with the highest unemployment rates in Liverpool). A great variability in the relationship between unemployment rates and the percentage of population in long-term illness can be observed. This visual inspection suggests that accounting for differences in the way uenmployment rates relate to long-term illness is important.

Simple feature collection with 9 features and 2 fields
Geometry type: MULTIPOLYGON
Dimension:     XY
Bounding box:  xmin: 335032 ymin: 387777 xmax: 338576.1 ymax: 395022.4
Projected CRS: Transverse_Mercator
    msoa_cd     unemp                       geometry
1 E02001354 0.5000000 MULTIPOLYGON (((337491.2 39...
2 E02001369 0.4960630 MULTIPOLYGON (((335272.3 39...
3 E02001366 0.4461538 MULTIPOLYGON (((338198.1 39...
4 E02001365 0.4352941 MULTIPOLYGON (((336572.2 39...
5 E02001370 0.4024390 MULTIPOLYGON (((336328.3 39...
6 E02001390 0.3801653 MULTIPOLYGON (((335833.6 38...
7 E02001354 0.3750000 MULTIPOLYGON (((337403 3949...
8 E02001385 0.3707865 MULTIPOLYGON (((336251.6 38...
9 E02001368 0.3648649 MULTIPOLYGON (((335209.3 39...

Varying Intercept and Slopes

Column

Estimating Varying Intercept and Slopes Models

The correlation of fixed effects indicates a negative relationship between the intercept and slope of the average regression model: as the average model intercept tends to increase, the average strength of the relationship between unemployment rate and long-term illness decreases and vice versa.
Linear mixed model fit by REML ['lmerMod']
Formula: unemp ~ lt_ill + (1 + lt_ill | msoa_cd)
   Data: oa_shp

REML criterion at convergence: -4762.8

Scaled residuals: 
    Min      1Q  Median      3Q     Max 
-3.6639 -0.5744 -0.0873  0.4565  5.4876 

Random effects:
 Groups   Name        Variance Std.Dev. Corr 
 msoa_cd  (Intercept) 0.003428 0.05855       
          lt_ill      0.029425 0.17154  -0.73
 Residual             0.002474 0.04974       
Number of obs: 1584, groups:  msoa_cd, 61

Fixed effects:
            Estimate Std. Error t value
(Intercept) 0.047650   0.008635   5.519
lt_ill      0.301259   0.028162  10.697

Correlation of Fixed Effects:
       (Intr)
lt_ill -0.786

Interpretation of coefficients:

The fixed intercept indicates that the average unemployment rate is 5% if the percentage of population with long-term illness is zero. The fixed slope indicates that the average relationship between unemployment rate and long-term illness is positive across MSOAs: as the percentage of population with long-term illness increases by 1 percentage point, the unemployment rate increases by 0.3.

RE Plots

Column

Caterpillar plots allow for an understanding of the general pattern of the random effects. Only one MSOA seems to have a statistically significantly different intercept, or average unemployment rate. Significant variability exists in the association between unemployment rate and long-term illness across MSOAs. This means that geographical differences in the relationship between unemployment rate and long-term illness can explain the significant differences in average unemployment rates in the varying intercept only model.

RE Map

Column

Random Effects Map of Liverpool

As expected, a greater share of population in long-term illness is associated with higher local unemployment. The relationship between unemployment rate and long-term illness tends to be stronger and positive in northern MSOAs.

Model Building

Column

Reestimating the model adding an individual-level predictor: the share of population with no educational qualification. Furthermore, predictors are now centered.
Linear mixed model fit by REML ['lmerMod']
Formula: 
unemp ~ z_lt_ill + z_no_qual + (1 + z_lt_ill | msoa_cd)
   Data: oa_shp

REML criterion at convergence: -4940.7

Scaled residuals: 
    Min      1Q  Median      3Q     Max 
-3.6830 -0.5949 -0.0868  0.4631  6.3556 

Random effects:
 Groups   Name        Variance  Std.Dev. Corr 
 msoa_cd  (Intercept) 8.200e-04 0.02864       
          z_lt_ill    2.161e-06 0.00147  -0.04
 Residual             2.246e-03 0.04739       
Number of obs: 1584, groups:  msoa_cd, 61

Fixed effects:
              Estimate Std. Error t value
(Intercept)  0.1163682  0.0039201   29.68
z_lt_ill    -0.0003130  0.0003404   -0.92
z_no_qual    0.3245811  0.0221347   14.66

Correlation of Fixed Effects:
          (Intr) z_lt_l
z_lt_ill  -0.007       
z_no_qual -0.015 -0.679

Model Comparison

Column

In order to assess different models and test whether additional predictors improve model fit, look for lower scores for these indicators: Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), Loglik and Deviance (for nested models), and Chi-squared statistic.

Data: oa_shp
Models:
model6: unemp ~ lt_ill + (1 + lt_ill | msoa_cd)
model7: unemp ~ z_lt_ill + z_no_qual + (1 + z_lt_ill | msoa_cd)
       npar     AIC     BIC logLik deviance  Chisq Df
model6    6 -4764.7 -4732.5 2388.3  -4776.7          
model7    7 -4956.5 -4918.9 2485.2  -4970.5 193.76  1
       Pr(>Chisq)    
model6               
model7  < 2.2e-16 ***
---
Signif. codes:  
0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

The results indicate that adding an individual-level predictor (the proportion of population with no qualification) provides a model with better fit.