Ma, Chunchao (2023) Multi-output Gaussian Processes for Large-scale Multi-class Classification and Hierarchical Data. PhD thesis, University of Sheffield.
Abstract
Multi-output Gaussian processes (MOGPs) can concurrently deal with multiple tasks by
exploiting the correlation between different outputs. MOGPs have been mostly used for
multi-output regression datasets, where the responses of each output are continuous values.
However, MOGPs have inferior performance in some complex structured datasets. For
example, MOGPs demand a large computational complexity in large-scale multi-class
classification. The most common type of data in multi-class classification problems consists
of image data, and MOGPs are not specifically designed to handle image datasets so MOGPs
have poor performance on image data that has the nature of high dimensionality. Most
applications of MOGPs are restricted to regression problems with a reduced number of
outputs; and particularly, MOGPs present a limited performance on hierarchical datasets,
i.e., datasets where the observations are connected to each other by means of parent-child
relationships forming a tree structure.
In this thesis, we address the aforementioned issues by proposing three new extensions
of MOGPs separately. First, we develop a novel MOGP model to deal with large-scale multiclass
classification by subsampling both training data sets and classes in each output. Second,
we propose a novel model to deal with image input data sets by incorporating a convolutional
kernel, which can effectively capture information from images, into our developed model
above. Finally, we present a new hierarchical MOGP model with latent variables to handle
hierarchical datasets, where we use a hierarchical kernel function to capture the correlation
within hierarchical data structures and use latent variables to explore dependencies between
outputs.
The new models are applied in various synthetic and real datasets. The results of this thesis
indicate that our proposed models can improve prediction performance in corresponding
datasets.
Metadata
Supervisors: | Álvarez, Mauricio A. and Vasilaki, Eleni |
---|---|
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Depositing User: | Mr Chunchao Ma |
Date Deposited: | 26 Sep 2023 08:57 |
Last Modified: | 26 Sep 2023 08:57 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:33129 |
Download
Final eThesis - complete (pdf)
Filename: ChunchaoMa_thesis180233681.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.