Overview

Purpose

  • This companion is designed to ensure transparency and reproducibility for the paper “Occupational Mobility Is Not One Thing: Evidence from the Canadian Labour Market”. It provides technical detail on distance metrics and normalization, specificity measures.

Contents

  • Skill distance: O*NET-based occupational skill vectors, dimensionality reduction via PCA, and the resulting continuous distance matrix.

  • Hierarchical distance: Taxonomy-based discrete distances derived from the NOC structure, resulting in labour market silos with internal TEER platforms.

  • Cost matrix normalization: percentile-based anchoring/normalization across distance metrics.

  • Occupation’s education specificity: A size-adjusted measure of concentration in educational inflows into occupations: how different is this occupation than the labour market as a whole in terms of educational pathways.

  • Education’s occupation specificity: A size-adjusted measure of concentration in occupational outcomes across educations (cross product of fields of study and attainment): how different is this education compared with education in general in terms of occupational pathways.

Skill Distance

Column

Rank Correlation

Column

Normalized ONET Skill Distance (left mouse button hold and drag to zoom)

Hierarchical distance

Column

Description

Hierarchical distances encode institutional and career-ladder barriers implied by occupational taxonomies:

  1. All five digits match: distance = 0
  2. First four digits match: distance = 1
  3. First three digits match: distance = 2
  4. First digit matches: distance = 3 + |ΔTEER|
  5. Otherwise: distance = 9
  • These distances are then normalized by division by 4, as described on the page “Cost matrix normalization”.
  • The Spearman correlation is 0.289, indicating that institutional proximity and skill similarity are positively but only modestly related.
  • The two metrics encode overlapping yet distinct occupational geometries.

Distance counts

Column

Normalized hierarchical distances (left mouse button hold and drag to zoom)

Cost matrix normalization

Anchoring to the Informative Cost Region

The hierarchical distance contains a large mass of maximally distant pairs (distance = 9), which represent transitions the taxonomy treats as categorically distant. Because these pairs provide little information about substitution intensity among plausible transitions, we define the informative region as the set of non-maximal, non-zero hierarchical distances.

Within this region, we compute the 25th, 50th, and 75th percentiles of hierarchical cost. These conditional quantiles represent increasingly broad but still economically meaningful transition margins.

Each anchor value is then mapped to its unconditional percentile in the full hierarchical distribution. The skill distance matrix is calibrated by selecting cost values at these same unconditional percentiles. This procedure ensures that calibration aligns comparable substitution margins across metrics, rather than matching arbitrary numerical magnitudes.

Percentile-based anchoring is invariant to monotonic transformations of the cost scale and therefore preserves the rank ordering of transition costs in each metric.

Calibration Anchors Across Distance Metrics
Conditional Quantile Hierarchical Anchor Unconditional Percentile Skill Anchor
0.25 3.000 0.040 7.145
0.50 4.000 0.081 8.590
0.75 5.000 0.105 9.253

The median non-maximal hierarchical transition lies at approximately the 8th percentile of the full hierarchical distribution; the skill anchor is defined at this same percentile to ensure comparable substitution intensity.

Occupation’s Education Specificity

Column

High specificity indicates few educational pathways into the occupation; residualized specificity is not correlated with log occupation size (Pearson=-0.03, Spearman=0.03).

Column

Education’s Occupation specificity

Column

High specificity indicates few occupational pathways from the education; residualized specificity is not correlated with log occupation size (Pearson=0.07, Spearman=-0.01).

Column