Richard Martin
Ministry of Post Secondary Education and Future Skills
Disclaimer: The views expressed are those of the author and do not necessarily reflect those of the Government of British Columbia.
Sources of structural change include:
technology
COVID-19
government policy
resource depletion
These shocks shift occupational labour demand
Entry and exit may account for part of the adjustment, but typically some workers must move across occupations to meet changing demand
What governs mobility between occupations?
It is natural to expect mobility to decline with occupational distance
We therefore ask which notion of occupational distance best rationalizes local mobility?
This requires:
Considering multiple notions of occupational distance
A method to compare model fit
Recognizing two mobility regimes
Local mobility: gradual movement in occupational space
Non-local mobility: jumps that bypass distance frictions
Distance should explain local transitions but fail to explain non-local moves
Local mobility can be modeled using entropy-regularized optimal transport, which produces flows consistent with observed origin and destination margins
This framework allows competing distance metrics to be compared in a clean “horse race”
In simulations, when the data-generating process is known, the method recovers the correct distance structure
This provides a framework for testing whether skill similarity or hierarchical structure better explains local mobility
Workers whose highest educational attainment is unchanged between Censuses may move locally in occupational space
A change in highest educational attainment between Censuses can act like a wormhole, enabling jumps across distant occupations
Implication:
Gravity-style mobility models predict that flows increase with occupation size and decline with occupational distance
This mirrors Newtonian gravity: \(F \propto \frac{m_1 m_2}{d^2}\)
Two natural candidates:
Skill similarity: based on 161 O*NET dimensions
Institutional hierarchy: based on 5 digit NOC classification
| Digit | Level | Description |
|---|---|---|
| 1 | Broad Occupational Category | 10 broad categories |
| 2 | TEER Category | 6 levels |
| 3 | Sub-major Group | Occupational cluster |
| 4 | Minor Group | Narrow family |
| 5 | Unit Group | Detailed occupation |
\[ P_{ij} = \exp(\alpha_i + \beta_j - \gamma C_{ij}) \]
In horse racing, handicaps add weight to stronger horses, making the outcome less informative about the horses’ true speed
If the friction specification \(C_{ij}\) is wrong — and it will be — fixed effects may absorb part of the misspecification
We prefer misspecification to appear as lack of fit rather than being masked by parameter adjustment
The goal of our horse race is to cleanly identify how well each friction rationalizes the observed flows
Of course, winning the horse race does not imply the winner was particularly “fast”
Entropy-regularized optimal transport enforces the observed marginals exactly:
\[ \sum_j P_{ij} = a_i, \qquad \sum_i P_{ij} = b_j \]
via iterative row and column rescaling
Equilibrium employment levels are taken as given; we model the mobility flows consistent with those levels
This corresponds to a decentralized random utility model where mobility utilities include Gumbel shocks, producing logit choice probabilities
\[ \mathcal{L} = \underbrace{\sum_{ij} P_{ij} C_{ij}}_{\text{mass}\times\text{distance}} + \varepsilon\times \underbrace{\sum_{ij} P_{ij}(\log P_{ij}-1)}_{\text{negative entropy}} + \sum_i f_i \underbrace{\left(a_i - \sum_j P_{ij}\right)}_{\text{origin constraint}} + \sum_j g_j \underbrace{\left(b_j - \sum_i P_{ij}\right)}_{\text{destination constraint}} \]
First-order conditions:
\[ \frac{\partial \mathcal{L}}{\partial P_{ij}} = C_{ij} + \varepsilon \log P_{ij} - f_i - g_j = 0 \]
Rearranging:
\[ \log P_{ij} = \frac{f_i}{\varepsilon} + \frac{g_j}{\varepsilon} - \frac{C_{ij}}{\varepsilon} \]
Exponentiating:
\[ P_{ij} = u_i\, e^{-C_{ij}/\varepsilon}\, v_j, \]
Because the objective is strictly convex, the optimal solution \(P^{\star}\) is unique
If the Sinkhorn algorithm converges, it must converge to \(P^{\star}\)
\(P^{\star}\)
Gravity estimates margins and frictions jointly
Sinkhorn conditions on the margins, so frictions are identified from the residual structure of flows
With a misspecified cost matrix, a gravity model merely measures sensitivity to the wrong notion of distance
When the goal is to compare alternative cost matrices, conditioning on the margins allows cleaner identification of frictions
Sinkhorn is the right tool for this job
\(\varepsilon\) is the scale of random utility shocks (Gumbel noise) affecting mobility choices
Low \(\varepsilon\)
High \(\varepsilon\)
Generate artificial flows using
\[ C = (1-w)C_{skill} + wC_{hier} \]
with
\[ \varepsilon = 1 \]
Then estimate models using:
We evaluate model fit using the Kullback–Leibler divergence between the observed transition matrix \(P\) and the predicted matrix \(\hat P\)
In our simulations \(P\) is the true transition matrix, so KL measures the divergence between the estimated model and the truth
\[ KL(P \parallel \hat P) = \sum_{ij} P_{ij}\log\frac{P_{ij}}{\hat P_{ij}} \]
Some occupations draw from very specific educational pipelines, while others recruit from a broad mix of credentials
\[ KL(p \parallel p_0) \]
\[ \text{specificity} = \log(KL) - \widehat{E}[\log(KL) \mid \log(T)] \]
Why?
Regress distance traveled on size-corrected KL divergence measures from the CIP–NOC table
Education specificity
→ How concentrated are graduates from a program across occupations
→ Interpreted as likelihood the program acts as a wormhole entrance
Destination gating
→ How concentrated are entrants to an occupation across programs
→ Interpreted as likelihood the occupation acts as a wormhole exit
Controls: 2-digit NOC origin fixed effects (2016 occupation)
→ Origin occupation may influence both education choice and distance traveled
Hypothesis
Race two distance metrics on two samples:
one where distance should rationalize local mobility (highest educational attainment constant)
one where it should not (highest educational attainment changed)
Test for heterogeneity in mobility frictions across the labour market:
Characterize the relationship between distance traveled and education/destination specificity among the inter-census attainment group