West, Daniel ORCID: https://orcid.org/0000-0002-4522-170X (2021) Using self-organising maps to cluster complex biological data. MSc by research thesis, University of York.
Abstract
Cancer is a common disease during the modern age which requires accurate detection and prediction of its development. Prostate cancer is an interesting form as it is rarely fatal, yet requires surgical excision to remove, which itself may have adverse effects. Therefore, it is important to assess correctly each patient to minimise risk from cancer progression and from treatment side effects.
Raman spectroscopy is an analytical technique which has gained interest in the analysis of biological specimens, as it is a robust technique which produces distinct molecular signals which can be used to identify biomolecules. The sheer volume and dimensionality of spectral data necessitates computational analysis: this work covers the use of self-organising maps for investigating such data.
Self-organising maps are a machine learning technique which spot patterns and reduce dimensionality in high dimensional datasets in an unsupervised manner. Their use can help to discern clusters within the dataset which may not be readily apparent.
The use of self-organising maps to analyse Raman spectral data from human cell samples is an underexplored area of research. This work forms a feasibility study for the use of self-organising maps for such an application, and shows that they are able to correctly cluster cancer and non-cancer samples from a blinded dataset with optimum parameters. Moreover, the optimised SOM shows delineation into three clusters, one of normal prostate data and two of prostate cancer data. Analysis of these clusters shows spectral differences related to lipid composition, an observation which has been linked to more aggressive cancer progression.
Metadata
Supervisors: | Stepney, Susan and Hancock, Yvette |
---|---|
Publicly visible additional information: | The source code for the MySOM module is freely available at https://github.com/thenakedcellist/prostate/blob/master/mysom/mysom.py |
Awarding institution: | University of York |
Academic Units: | The University of York > Computer Science (York) |
Depositing User: | Dr Daniel West |
Date Deposited: | 12 Nov 2021 19:08 |
Last Modified: | 12 Nov 2021 19:08 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:29618 |
Download
Examined Thesis (PDF)
Filename: West_107002947_CorrectedThesisClean.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.