title: “UK Wellbeing Infrastructure Spine (UK-WIS)” output:
html_document ———————
Overview
This project builds a reproducible UK small-area data spine that
links multiple domains of place-based infrastructure and environment to
a common geographic unit.
The core unit is area11, which combines:
- LSOA11 for England and Wales
- Data Zone 2011 for Scotland
The term “area11” refers to the harmonised geographic unit
underpinning UK-WIS, combining LSOA 2011 geographies in England and
Wales with Data Zone 2011 geographies in Scotland.
These are harmonised into a single spine:
final/area11_master_features.csv
The aim is to create a modular, extensible dataset that can
support:
- place-based wellbeing and mental health research
- policy-relevant infrastructure mapping
- development of composite place-based indicators
- future development of a Wellbeing Systems Index
Current release
Version: v1 (15 June 2026)
Status: Stable release
The current release contains:
- 41,729 area11 units
- England: 32,844 LSOA11
- Scotland: 6,976 Data Zone 2011
- Wales: 1,909 LSOA11
Quality control status:
- 0 duplicate area11 identifiers
- 0 missing area11 identifiers
- Complete coverage for transport, greenspace, and cultural
infrastructure domains
- Libraries available for England only
- Structural missingness documented and retained
Structure of the data spine
The spine is built using a modular pipeline:
- 10_ingest_*
- 11_build_*_area11_features
- 12_build_*_admin_features
- 90_build_area11_master_dataset
- 91_build_data_dictionary_area11
- 95_build_analysis_extracts_area11
Each domain is processed independently and then joined to the master
dataset:
final/area11_master_features.csv
This approach allows domains to be added, removed, updated, or
rebuilt without affecting the wider system.
Domains currently included
Transport (NaPTAN)
Source:
- NaPTAN Stops dataset (active stops only)
Coverage:
- Full Great Britain coverage
- No structural missingness
Key variables:
- transport_stops_n
- transport_stops_density_per_km2
- transport_stops_dist_nearest_m
- transport_bus_n
- transport_rail_n
- transport_metro_tram_n
- transport_ferry_n
- transport_air_n
- transport_has_any_stop
Notes:
- Represents access to public transport infrastructure
- Distances are calculated from an internal area11 representative
point to the nearest stop
- Density measures are standardised by area11 area (km²)
Greenspace (OS Open Greenspace)
Source:
- OS Open Greenspace greenspace_site polygons
Coverage:
- Full Great Britain coverage
- No structural missingness
Key variables:
- greenspace_n
- greenspace_area_total_km2
- greenspace_area_prop
- greenspace_area_prop_capped
- greenspace_density_per_km2
- greenspace_dist_nearest_m
- greenspace_has_any
Notes:
- Based on polygon intersection with area11 units
- greenspace_area_prop may exceed 1 due to overlapping polygons
- greenspace_area_prop_capped provides an interpretable version
constrained to 0–1
Cultural infrastructure (Mapping Museums)
Source:
Coverage:
- Great Britain coverage where museum records have usable
coordinates
- No structural missingness
Key variables:
- culture_museums_n
- culture_museums_density_per_km2
- culture_museums_dist_nearest_m
- culture_museums_has_any
- culture_museums_source_mapping_museums
- culture_museums_structural_missing
Notes:
- Represents museum infrastructure
- Built using a point-based feature workflow
- Counts represent museum locations within each area11 unit
- Distances are calculated from an internal area11 representative
point to the nearest museum
Libraries
Source:
- English Public Libraries Location Dataset (2024)
Coverage:
- England only
- Scotland and Wales treated as structural missing
Key variables:
- libraries_n
- libraries_density_per_km2
- libraries_dist_nearest_m
- libraries_source_coverage
- libraries_count_interpretation
Notes:
- Structural missingness is retained explicitly
- Missing values represent absence of source coverage rather than
absence of libraries
- Scotland and Wales account for 8,885 structurally missing area11
units (21.3% of the release)
Domains under development
Access amenities (ONS)
Source:
- ONS Access to Amenities datasets
- Multiple LAD-level workbooks
Coverage:
- England and Wales only
- LAD-level only
- Scotland currently treated as structural missing
Current status:
- Raw data available
- Not yet linked to the master dataset
Potential variables:
- access_community_facilities_total_n
- access_community_facilities_per_100k
- access_community_facilities_area_total_km2
- access_community_facilities_area_per_100k
Notes:
- Administrative contextual indicators
- Values would be broadcast from LAD to constituent area11 units
- Suppressed values should be treated as missing rather than zero
- Scotland should remain structural missing unless an equivalent
source is identified
Master dataset
The canonical dataset is:
final/area11_master_features.csv
Contains:
- harmonised area11 geography
- administrative lookup variables
- transport infrastructure indicators
- greenspace indicators
- cultural infrastructure indicators
- library infrastructure indicators
Release statistics:
- n = 41,729
- unique area11_code = 41,729
- duplicate identifiers = 0
- missing identifiers = 0
Publication outputs
Release outputs include:
- area11_master_features.csv
- area11.release.summary.v1.15june2026.csv
- area11.country.descriptives.v1.15june2026.csv
- area11.country.domain.summary.v1.15june2026.csv
- qc.domain.summary.v1.15june2026.csv
- area11.table1.domain.summary.v1.15june2026.csv
Publication figures:
- map.transport.stops.density.rescaled.v1.15june2026.png
- map.greenspace.area.prop.rescaled.v1.15june2026.png
- map.culture.museums.density.rescaled.v1.15june2026.png
- map.four.panel.infrastructure.v1.15june2026.png
Supporting documentation
The following files are generated automatically:
- final/data_dictionary_area11.csv
- final/file_inventory.csv
- final/build_outputs_log.csv
These provide:
- variable-level documentation
- complete file inventory
- output registry and provenance tracking
Design principles
- Modularity: domains built independently
- Transparency: structural missingness retained
- Reproducibility: all outputs script-generated
- Interpretability: consistent naming conventions
- Scalability: easy to extend with new domains
- One canonical UK-wide spine with optional country-specific
extracts
Known limitations
- Libraries are currently England only
- Access amenities are not yet linked
- Access amenities are LAD-level rather than area11-level
- Greenspace polygons may overlap
- Transport indicators do not include service frequency or travel
time
- Museum coverage depends on Mapping Museums records with usable
coordinates
- Distance measures are Euclidean rather than network-based
Future development
Potential future additions include:
- Access amenities
- Health service accessibility
- Additional cultural infrastructure datasets
- Nature and environmental indicators
- Housing and built environment indicators
- Composite infrastructure indicators
- Wellbeing Systems Index development
New domains can be incorporated without modifying the existing spine
architecture.
Project status
Release status: v1 frozen (15 June 2026)
Completed:
- area11 geographic framework
- transport domain
- greenspace domain
- cultural infrastructure domain
- library domain
- master dataset construction
- data dictionary
- file inventory
- outputs log
- publication maps
- descriptive summaries
- release documentation
Current coverage:
- Transport: complete
- Greenspace: complete
- Cultural infrastructure: complete
- Libraries: England only
The area11 spine is now suitable for:
- methodological papers
- descriptive spatial analysis
- place-based wellbeing research
- small-area mental health research
- development of composite indicators
- future linkage to health and wellbeing outcomes
- Wellbeing Systems Index development