- Introduction
- Data and Methods
- Policy Implications
2017-08-31
Project outline:
(Source: CfIT)
If for some reason you could not longer use a car, would you find it… really inconvenient more or less every day never?
Simple measures:
Complex measures:
A demonstration of how official datasets can be augmented with geographical data from OpenStreetMap.
An application of Machine Learning algorithms to widely available transport datasets.
A tutorial on Big Data and Machine Learning targetted at transport planners, consultants and policy-makers relatively new to data science.
Write-up of the policy implications of the research.
An R package, mlCars, containing the source code, example data and reports to ensure reproducibility. The package can be accessed here: https://github.com/Robinlovelace/mlCars/
Big data is an umbrella term. We define it as:
"unconventional datasets that are difficult to analyze using established methods. Often this difficulty relates to size but the form, format, and complexity are equally important" (Lovelace et al. 2016).

By machine learning, we mean simply that the functional form of the model is not specified by the user.
There are two main types of machine learning (Hastie, Tibshirani, and Friedman 2016):
supervised or unsupervised. In supervised learning, the goal is to pre- dict the value of an outcome measure based on a number of input measures; in unsupervised learning, there is no outcome measure, and the goal is to describe the associations and patterns among a set of input measures.
Many transport problems can be framed as supervised Machine Learning problems.
## Response Variable
Directed Acyclic Graphs (DAGs) were used to explore causality between the variables.
Tricky to create the DAG and implement - Rob Long.
Oportunties:
Risks:
install.packages("keras")
Car dependency is the cause of a variety of social, environmental and economic issues in West Yorkshire.
Creating more sustainable regions by reducing car dependency and developing multi-modal mobility requires a multi-faceted approach that involves both ‘hard’ and ‘soft’ policy interventions.
Machine Learning and big data present new opportunities for the transport sector.
Big data and Machine Learning offer the promise of more efficient and cheaper transport models based on rich, finely-grained data across a wider range of indicators.
However, they should not supplant domain expertise or contextual knowledge in transport policy and planning. They are best used as aides to human decision-making rather than replacements.
Barriers to the effective and ethical uptake of innovative new technologies in transport include: access to skills, investment in R&D and organisational culture.
Robin Lovelace - Institute for Transport Studies (ITS), University of Leeds. Email: r.lovelace@leeds.ac.uk
Liam Bolton - Red Ninja / University College London. Email: liamthomasbolton@gmail.com
Anable, Jillian. 2005. “‘Complacent Car Addicts’ or ‘Aspiring Environmentalists’? Identifying Travel Behaviour Segments Using Attitude Theory.” Transport Policy 12 (1): 65–78. doi:10.1016/j.tranpol.2004.11.004.
Hastie, Trevor, Robert Tibshirani, and Jerome Friedman. 2016. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition. 2nd edition. New York, NY: Springer.
Lovelace, Robin, Mark Birkin, Philip Cross, and Martin Clarke. 2016. “From Big Noise to Big Data: Toward the Verification of Large Data Sets for Understanding Regional Retail Flows.” Geographical Analysis 48 (1): 59–81. doi:10.1111/gean.12081.
Lovelace, Robin, Anna Goodman, Rachel Aldred, Nikolai Berkoff, Ali Abbas, and James Woodcock. 2017. “The Propensity to Cycle Tool: An Open Source Online System for Sustainable Transport Planning.” Journal of Transport and Land Use 10 (1). doi:10.5198/jtlu.2016.862.