Zwiessele, Max (2017) Bringing Models to the Domain: Deploying Gaussian Processes in the Biological Sciences. PhD thesis, University of Sheffield.
Abstract
Recent developments in single cell sequencing allow us to elucidate
processes of individual cells in unprecedented detail. This detail
provides new insights into the progress of cells during cell type
differentiation. Cell type heterogeneity shows the complexity of cells
working together to produce organ function on a macro level. The
understanding of single cell transcriptomics promises to lead to the
ultimate goal of understanding the function of individual cells and
their contribution to higher level function in their environment.
Characterizing the transcriptome of single cells requires us to
understand and be able to model the latent processes of cell functions
that explain biological variance and richness of gene expression
measurements. In this thesis, we describe ways of jointly modelling
biological function and unwanted technical and biological confounding
variation using Gaussian process latent variable models. In addition
to mathematical modelling of latent processes, we provide insights
into the understanding of research code and the significance of
computer science in development of techniques for single cell
experiments.
We will describe the process of understanding complex
machine learning algorithms and translating them into usable
software. We then proceed to applying these algorithms. We show how
proper research software design underlying the implementation can lead
to a large user base in other areas of expertise, such as single cell gene
expression. To show the worth of properly designed software underlying
a research project, we show other software packages built upon the
software developed during this thesis and how they can be applied to
single cell gene expression experiments.
Understanding the underlying function of cells seems within reach
through these new techniques that allow us to unravel the
transcriptome of single cells. We describe probabilistic techniques of
identifying the latent functions of cells, while focusing on the
software and ease-of-use aspects of supplying proper research code to
be applied by other researchers.
Metadata
Supervisors: | Lawrence, Neil |
---|---|
Keywords: | Gaussian process, probabilistic modelling, life sciences, molecular biology, software engineering |
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Identification Number/EthosID: | uk.bl.ethos.727282 |
Depositing User: | Mr Max Zwiessele |
Date Deposited: | 27 Nov 2017 09:12 |
Last Modified: | 12 Oct 2018 09:47 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:18492 |
Download
Thesis
Filename: MaxZwiesseleThesis.pdf
Description: Thesis
Licence:
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.