Systematic testing of methods for generating spatial microdata

library(devtools) # package for R developers install_github("robinlovelace/testmsim") # install latest version library(testmsim) # load the package help(package = testmsim) # show package documentation

Barthelemy, J. and Toint, P. L. (2012) ‘Synthetic Population Generation Without a Sample’, Transportation Science. INFORMS, 47(2), pp. 266–279. doi: 10.1287/trsc.1120.0408.

Harland, K. (2013) ‘Microsimulation model user guide: flexible modelling framework’, National centre for research methods. Leeds: University of Leeds; NCRM (NCRM working papers). doi: http://eprints.ncrm.ac.uk/3177/2/microsimulation\_model.pdf.

Lovelace, R. and Ballas, D. (2013) ‘“Truncate, replicate, sample”: A method for creating integer weights for spatial microsimulation’, Computers, Environment and Urban Systems. Elsevier Ltd, 41, pp. 1–11. doi: 10.1016/j.compenvurbsys.2013.03.004.

Lovelace, R., Ballas, D., Leeuwen, E. van and Birkin, M. (2015) ‘Evaluating the performance of Iterative Proportional Fitting for spatial microsimulation: new tests for an established technique’, Journal of Artificial Societies and Social Simulation.

Pritchard, D. R. and Miller, E. J. (2012) ‘Advances in population synthesis: fitting many attributes per agent and fitting to household and person margins simultaneously’, Transportation, 39(3), pp. 685–704. doi: 10.1007/s11116-011-9367-4.

Talk outline

What I'll be talking about

Introduction

What is spatial microsimulation?

Why spatial microsimulation?

1: To tackle the modifiable areal unit problem

Why spatial microsimulation?

2: To estimate missing data

Why spatial microsimulation?

3: As a foundation for modelling

What spatial microsimulation is not

Many ways to generate spatial microdata

Opportunities to broaden tests

Main methods of population synthesis

Motivation

Problem: each researcher has their own 'horse' in the race

Past testing efforts in the literature

The 'model experiment' genre

My model experiments 1

Testing optimised IPF in R (ipfp) vs 'FMF'

My model experiments 2

Testing different techniques for 'integerisation'

Are more tests warranted?

But how to measure performance?

Typical results from model tests

Testing the IPF algorithm

Based on recently accepted paper

In Journal of Artificial Societies and Social Simulation

Setting-up model the experiments

Project organisation

Try it yourself!

Replicable results

Results 1: impacts of different parameters

Root mean-squared error (RMSE)

Results 2: summary for modellers

Results 3: computational efficiency

Results 4: 'Goodness of fit' measures

Little impact on results

Broadening the tests

CO in FMF vs IPF in R

External validation

Work in progress

An R package for testing population synthesis methods

Introduction to testmsim

Wider context of spatial microsimulation

Issues within the field

Teaching spatial microsimulation

Spatial microsimulation introductory textbook

Discussion with SMART people

Collaboration for mutual benefit

Key References