White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Reducing the Errors in High Resolution Environmental Modelling

Makrai, Gabor (2018) Reducing the Errors in High Resolution Environmental Modelling. PhD thesis, University of York.

This is the latest version of this item.

gabormakrai_phd_thesis_revised_wreo.pdf - Examined Thesis (PDF)
Available under License Creative Commons Attribution-Noncommercial-No Derivative Works 2.0 UK: England & Wales.

Download (11Mb) | Preview


Air pollution modelling is one of the key tools for researchers, scientists, and urban planners to support the sustainable development of the urban environment. This modelling tool is critical for the users in the age of rapid urbanization to understand pollution distribution in the modelling area. Recent updates in air quality regulations are challenging the state-of-the-art air pollution modelling techniques by requiring accurate predictions on a high temporal level, i.e. predictions at the hourly level rather than the annual level. Current state-of-the-art models are designed to have good prediction accuracy on the low temporal resolution by assuming that the pollution is in steady state. Making predictions on higher temporal resolution violates this assumption and cause inaccurate predictions. There are existing statistical modelling approaches for air pollution modelling, however, these approaches also struggle to make accurate predictions on higher temporal resolution. This work is looking into the development of a statistical regression based air pollution model which produces accurate high temporal level predictions by utilizing advanced regression algorithm to exploit the hidden knowledge in data with high temporal resolution. The analysis of the predictions of multiple advanced statistical regression algorithms is investigated to determine the most accurate approach hence the Random Forest Regression method is proposed for the given regression task. A novel model ensemble method is then developed to utilize multiple Random Forest Regression models trained on the different subset of the available input data. Motivated by the high computational requirement of the developed methods, this thesis also investigates the scalability and the robustness of the developed methods. Based on the experience gained from this investigation, this work proposes further model ensemble methods to improve the accuracy of the statistical regression approach for air pollution modelling. The developed air pollution model presented in this thesis produces more accurate hourly concentration level predictions than the current state-of-the-art method, hence, the approach gives the opportunity for better understanding of the pollution in the urban area.

Item Type: Thesis (PhD)
Academic Units: The University of York > Computer Science (York)
Identification Number/EthosID: uk.bl.ethos.759921
Depositing User: Mr Gabor Makrai
Date Deposited: 23 Nov 2018 16:34
Last Modified: 19 Feb 2020 13:04
URI: http://etheses.whiterose.ac.uk/id/eprint/21979

Available Versions of this Item

  • Reducing the Errors in High Resolution Environmental Modelling. (deposited 23 Nov 2018 16:34) [Currently Displayed]

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Actions (repository staff only: login required)