White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

A Framework for Big Data in Urban Mobility and Movement Patterns Analysis

Odiari, Eusebio Amechi (2018) A Framework for Big Data in Urban Mobility and Movement Patterns Analysis. PhD thesis, University of Leeds.

[img]
Preview
Text (.pdf format)
PhD Thesis Eusebio Odiari.pdf - Final eThesis - complete (pdf)
Available under License Creative Commons Attribution-Noncommercial-Share Alike 2.0 UK: England & Wales.

Download (14Mb) | Preview
[img] Archive (zipped (compressed) data schema and metadata)
Thesis-software-data.zip - Final eThesis - complete (pdf)
Available under License Creative Commons Attribution-Noncommercial-Share Alike 2.0 UK: England & Wales.

Download (45Mb)

Abstract

Novel large consumer datasets (called ‘Big Data’) are increasingly readily available. These datasets are typically created for a particular purpose, and as such are skewed, and further do not have the broad spectrum of attributes required for their wider application. Railway ticket data are an example of consumer data, which often have little or no supplementary information about the passengers who purchase them, or the context in which the ticket was used (like crowding-level in the train). These gaps in consumer data present challenges in using these data for planning, and inference on the drivers of mobility choice. Heckman’s in-depth discussion of ‘sample selection’ bias and ‘omitted variables’ bias (Heckman, 1977), and Rubin’s seminal paper on ‘missing values’ (Rubin, 1976) laid the framework for addressing omitted variables and missing data problems today. On the strength of these, a powerful set of complementary concerted methodologies are developed to harness railways consumer (ticketing) data. A novel spatial microsimulation methodology suitable for skewed interaction data was developed to combine LENNON ticketing, National Rail Travel Survey, and Census interaction data, to yield an attribute-rich micro-population. The micro-population was used as input to a GIS network, logistically constrained by the transit feed specification (GTFS). This identifies the context of passenger mobility. Bayesian models then enable the identification of passenger behaviour, like missing daily trip rates with season tickets, and flows to group stations. Case studies using the micro-level synthetic data reveal a mechanism of rail-heading phenomena in West Yorkshire, and the impact of a new station at Kirkstall Forge. The spatial microsimulation and GIS-GTFS methods are potentially useful to network operators for the management and maintenance on the railways. The representativeness of the micro-level population created has the potential to alter multi-agent transport simulation genres, by precluding the need for the complexities of utility-maximizing traffic assignment.

Item Type: Thesis (PhD)
Related URLs:
Keywords: big-data, spatial microsimulation, population synthesis, GIS network, General-Transit-Feed-Specification, spatial analysis, Bayesian modelling, urban mobility, movement patterns
Academic Units: The University of Leeds > Faculty of Environment (Leeds)
The University of Leeds > Faculty of Environment (Leeds) > School of Geography (Leeds)
The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds)
Identification Number/EthosID: uk.bl.ethos.772840
Depositing User: Dr Eusebio / E.A. Odiari
Date Deposited: 16 Apr 2019 09:40
Last Modified: 21 Feb 2020 13:33
URI: http://etheses.whiterose.ac.uk/id/eprint/23525

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Actions (repository staff only: login required)