Watson, Matthew ORCID: https://orcid.org/0000-0001-8980-2699 (2022) A Data-Driven Investigation Into Similarity Measures For Global Chemical Products. MSc by research thesis, University of York.
Abstract
Croda is one of the largest chemical companies in the United Kingdom, producing and distributing products across the globe. It is the aim of this research to provide Croda a means for determining the similarity and therefore interchangeability of their products between manufacturing sites. To do this, numerous analytical approaches including the Bhattacharyya distance, Mahalanobis distance, hierarchical clustering, distribution modelling and separation are investigated. Novel approaches to outlier detection and exploratory analysis are also examined. These analyses are applied to three data sets, each corresponding to a chemical product - Tween20, BrijCS20 and Glycerox HE. These data sets consist of the mass charge ratios and their abundance obtained via MALDI-TOF mass spectrometry.
Of the analyses conducted, hierarchical clustering as well as distribution fitting yielded the most promise, although both methods were susceptible to outliers. The Gaussian model, for example, fits the data for the products quite accurately but is less accurate for higher masses. In conclusion, it is found that finding the desired similarity measure is extremely challenging.
Metadata
Supervisors: | Wilson, Julie |
---|---|
Awarding institution: | University of York |
Academic Units: | The University of York > Mathematics (York) |
Depositing User: | Mr Matthew Watson |
Date Deposited: | 05 Dec 2022 14:23 |
Last Modified: | 05 Dec 2022 14:23 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:31959 |
Download
Examined Thesis (PDF)
Filename: WATSON_202031087_ThesisCleaned.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.