Ma, Chunchao (2023) Multi-output Gaussian Processes for Large-scale Multi-class Classification and Hierarchical Data. PhD thesis, University of Sheffield.
Abstract
Multi-output Gaussian processes (MOGPs) can concurrently deal with multiple tasks by exploiting the correlation between different outputs. MOGPs have been mostly used for multi-output regression datasets, where the responses of each output are continuous values. However, MOGPs have inferior performance in some complex structured datasets. For example, MOGPs demand a large computational complexity in large-scale multi-class classification. The most common type of data in multi-class classification problems consists of image data, and MOGPs are not specifically designed to handle image datasets so MOGPs have poor performance on image data that has the nature of high dimensionality. Most applications of MOGPs are restricted to regression problems with a reduced number of outputs; and particularly, MOGPs present a limited performance on hierarchical datasets, i.e., datasets where the observations are connected to each other by means of parent-child relationships forming a tree structure. In this thesis, we address the aforementioned issues by proposing three new extensions of MOGPs separately. First, we develop a novel MOGP model to deal with large-scale multiclass classification by subsampling both training data sets and classes in each output. Second, we propose a novel model to deal with image input data sets by incorporating a convolutional kernel, which can effectively capture information from images, into our developed model above. Finally, we present a new hierarchical MOGP model with latent variables to handle hierarchical datasets, where we use a hierarchical kernel function to capture the correlation within hierarchical data structures and use latent variables to explore dependencies between outputs. The new models are applied in various synthetic and real datasets. The results of this thesis indicate that our proposed models can improve prediction performance in corresponding datasets.
Metadata
Supervisors: | Álvarez, Mauricio A. and Vasilaki, Eleni |
---|---|
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Depositing User: | Mr Chunchao Ma |
Date Deposited: | 26 Sep 2023 08:57 |
Last Modified: | 26 Sep 2023 08:57 |
Download
Final eThesis - complete (pdf)
Filename: ChunchaoMa_thesis180233681.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.