Introduction The UK has comparatively high rates of working age economic inactivity (EI) due to poor health.
Methods This paper uses a novel modelling framework, and data from the UKHLS, to estimate how much of this health-related EI (HREI) may be ‘due to’ i) poor general health directly, or due to two types of socioeconomic driver of health and inactivity: ii) low qualifications; and iii) low wages.
Findings Considering each driver independently, poor general health may explain up to UNKNOWN% of HREI, whereas low wages may explain up to UNKNOWN%, and low qualifications may explain up to UNKNOWN%.
DiscussionWrite something insightful here
Notes/Todos
Adapt health code and run
Adapt quals code and run
Create wages code and run
Adapt hhincome code
Key Findings/Contributions
Introduction
Methods
The data
The data used to fit the models are all valid observations from wave a to j of the UKHLS. By valid we mean all predictor and response variables are included.
The model
The model uses multinomial logistic regression to predict the economic (in)activity state in the next time period (approximately one year) based on the economic activity state in the current time period, the individual’s age, sex, and those specific drivers of interest.
Foundational Model
The foundational model specification aims to adequately control for the effects that age, current state and sex have on transition probabilities between states. To recap, we know the following:
That state at time T influences state at time T+1, including that there is path dependence.
That transition propensities between states vary systematically by sex (in particular regarding the long-term carer state)
That transitions between states vary by age, but in different ways for different states, and in ways that aren’t linear with age.
The foundational model specification operationalises the above knowledge as follows:
i.e. that next state \(S_{T+1}\) is predicted on the current state \(S_T\), sex (the \(male\) term so female is the reference category), the interaction of current state and sex \(S_T * male\) , and a flexible function of age \(bspline(x, 5)\) .
The model is implemented using the multinom function of the nnet package as follows
Exposure models extend the foundation with one or more additional variables. These variables are the exposures of interest, and for which we want to estimate the influence on economic activity levels and flows.
For a single exposure \(Z\), the equation simply extends the foundational model specification as follows:
In some cases (as with estimating the effects of health as an exposure) interaction terms are included between exposure variables as well. The decision about whether to include such interactions is made based on both our understanding of the extent to which factors are likely to interact in practice, and the penalised model fit as assessed using metrics like AIC and BIC.
The simulation
We simulate three different population groups:
Populations of various representative working ages, male and female, whom we all assume start off as in employment
Populations of various representative working ages, male and female, who all start off as unemployed
A representative population of varying ages, sexes, current statuses, and driver states
Results
Descriptive Results
Observed transitions between states
Within each wave, people are observed in each of the two economically active states, and each of the five economically inactive states. As the UKHLS are longitudinal, they can be used to calculate the proportion of those observed in each state one wave who then either stay in that state the following wave, or migrate to any of the other six states. Table 1 shows these proportions as a single table. The rows in the first column indicate the state someone was observed for wave T, and the each of the states on the columns to the right indicate a possible state they could be observed the next wave (wave T+1). The order of the states is the same across rows and columns, meaning that the cells along the top-left to bottom-right diagonal indicate the proportions of those observed to stay in the same state from one wave to the next.
Table 1: Observed transition probabilities between economic (in)activity states between years. Rows indicate state transitioning from
Active
Inactive
Employed
Unemployed
Inactive student
Inactive care
Inactive long term sick
Inactive retired
Inactive other
Active
Employed
0.956
0.017
0.003
0.009
0.004
0.009
0.002
Unemployed
0.309
0.434
0.009
0.115
0.095
0.022
0.016
Inactive
Inactive student
0.368
0.089
0.461
0.048
0.015
0.003
0.015
Inactive care
0.136
0.081
0.007
0.705
0.030
0.024
0.017
Inactive long term sick
0.044
0.087
0.002
0.043
0.774
0.043
0.006
Inactive retired
0.071
0.016
0.001
0.034
0.042
0.827
0.008
Inactive other
0.308
0.129
0.020
0.218
0.052
0.057
0.217
Within Table 1 the diagonal cell values show that some economic (in)activity states are more persistent than others. For example, the overall probability of someone who is employed one wave remaining employed the next wave is over 95%, the proportion remaining retired is almost 83%. Conversely, the probability of someone unemployed remaining unemployed between waves is 43%, which is still higher than the probability of moving to employment (31%). From unemployment, there is also around a one-in-ten chance of moving either to inactive care, or to long-term sickness, but less than a 1% probability of becoming a full-time student in the next wave.
For those states other than employment, the conditional probability of moving into employment is worth comparing. We can see that the conditional probability of moving from full time study (third row) to employment (first column) is 37%, which is higher than the 31% conditional probability of moving from unemployment (second row) to employment (first column). In this sense, the state of being a full-time student is closer to employment than the state of being unemployed, even though unemployment is considered economic activity whereas full time study is considered economic inactivity. The high level of heterogeneity between economically inactive states is why it is so important not to collapse these states into a single category.
The transition rates observed vary markedly by sex, as shown in Table 2.
The main differences by sex shown in Table 2 relates to the full-time care state. For working age females who are unemployed, 20% transition into full-time care the next wave; for males who are unemployed, the rate of transition to full-time care is 3%. The rates of remaining in or moving out of full-time care also differ by sex. For females, the probability of remaining in full-time care between waves is 71%; for males, 55%. Rates transition from full-time care to either long-term sickness or employment are similar by sex, whereas rates of transition from long-term sickness to unemployment (and so job-seeking) are around twice as high for males (16%) than females (8%).
There are also marked differences in transition probabilities by age group, as illustrated in Table 3, which compares transition probabilities between states for persons aged between 25 and 45 years of age inclusive (Table 3 (a)), with those of working age aged over 45 years of age (Table 3 (b))
Table 3: Transition probabilities by broad age group
(a) Younger (25-45 years of age)
Active
Inactive
Employed
Unemployed
Inactive student
Inactive care
Inactive long term sick
Inactive retired
Inactive other
Active
Employed
0.962
0.017
0.004
0.012
0.003
0.000
0.002
Unemployed
0.328
0.433
0.014
0.133
0.076
0.000
0.016
Inactive
Inactive student
0.364
0.088
0.472
0.050
0.014
0.001
0.012
Inactive care
0.153
0.081
0.009
0.723
0.021
0.000
0.013
Inactive long term sick
0.066
0.117
0.005
0.060
0.740
0.004
0.008
Inactive retired
0.118
0.029
NA
0.029
0.382
0.441
NA
Inactive other
0.375
0.175
0.035
0.208
0.041
NA
0.167
(b) Older (56 years of age and above)
Active
Inactive
Employed
Unemployed
Inactive student
Inactive care
Inactive long term sick
Inactive retired
Inactive other
Active
Employed
0.948
0.017
0.001
0.006
0.005
0.020
0.003
Unemployed
0.283
0.435
0.003
0.090
0.121
0.052
0.015
Inactive
Inactive student
0.404
0.101
0.374
0.035
0.025
0.025
0.035
Inactive care
0.099
0.081
0.002
0.668
0.050
0.074
0.027
Inactive long term sick
0.034
0.073
0.001
0.035
0.790
0.062
0.005
Inactive retired
0.071
0.016
0.001
0.034
0.040
0.829
0.008
Inactive other
0.256
0.093
0.009
0.226
0.060
0.100
0.255
As might be expected, the rates of transition into retirement are considerably higher in older ages (Table 3 (b)) than younger ages (Table 3 (a)), and the probabilities of someone remaining retired almost twice as high in older than younger ages. Rates of employment are somewhat higher in the younger ages than higher ages, and so are the probabilities of moving from unemployment to employment. Rates of transition into full-time care are somewhat higher in the younger age category.
The two age categories presented above are somewhat arbitrary, and do not capture adequately how the probability of being in each of the states, and moving to other states, varies over the working age life course. As an example of this, figure X shows how the probability of remaining employed, unemployed, a full-time carer, a student, or long-term sick varies over five year intervals.
Show R Code
tempData <- ind_data_standardised |>mutate(age_group =cut(age, seq(25, 65, by =5)) ) |>filter(!is.na(age_group)) |>mutate(same_status = next_status == this_status) |>group_by( this_status, sex, age_group ) |>summarise(n =sum(same_status),N =length(same_status) ) |>mutate(proportion_stay = n / N) |>ungroup() age_group_labels <-unique(as.character(tempData$age_group))tempData |>filter(sex !="missing") |>mutate(age_num =as.numeric(age_group)) |>mutate(age_group =as.character(age_group)) |>filter(this_status %in%c("Employed", "Unemployed", "Inactive care", "Inactive student", "Inactive long term sick", "Inactive retired")) |>ggplot(aes(x=age_num, y = proportion_stay, colour = this_status)) +geom_line() +scale_x_continuous(breaks =1:7, labels = age_group_labels) +geom_point(aes(shape = this_status)) +facet_wrap(~sex) +labs(x ="Five year age group", y ="Probability of remaining in state between wave",title ="Probability of remaining in a state between wave by age group and sex" )
Figure 1: Probability of staying in a category by sex and age group
These stayer probabilities are somewhat artificial for some ages - for example of remaining retired at ages where retirement is unlikely - but do show how the probabilities of remaining in each state vary over the life course, as well as differ by sex. The probabilities of transition between each of these states also changes over sex and age, and so a foundational model which controls for these varying associations in these standard (unmodifiable) demographic variables is important before reasonable estimates of the additional (potentially modifiable) exposures can be produced. The purpose of the foundational model specification is to do this.
Simulation Model Results
Modelling health effects
The effect of suboptimal health as an exposure was assessed using SF-12 scores, subdivided into the physical health and mental health subdomains, and then standardised over the observed population to have a mean of 0 and standard deviation of 1.
Four different exposure model specifications were considered:
MH only
PH only
MH and PH as independent effects
MH and PH including an interaction term
Each of these was compared for penalised model fit against the foundational model specification using AIC and BIC, with lower scores preferred.
The penalised model fit of these specifications is shown