Obi, Jude Chukwura (2016) Application of Statistical Computing to Statistical Learning. PhD thesis, University of Leeds.
Abstract
This study focuses on supervised learning, an aspect of statistical learning. The supervised learning is concerned with prediction, and prediction problems are distinguished by the output predicted. The output of prediction is either a categorical or continuous variable. If the output is a categorical variable, we have classification otherwise what obtains is regression. We therefore identify classification and regression as two prediction tools.
We further identify many features commonly shared by these prediction tools, and as a result, opine that it may be possible to use a regression function in classification or vice versa. Thus, we direct our research towards classification,and intend to:
(i) Compare the differences and similarities between two main classifiers namely,
Fisher's Discriminant Analysis (FDA) and Support Vector Machine (SVM).
(ii) Introduce a regression based classification function, with acronym RDA (Regression
Discriminant Analysis).
(iii) Provide proof that RDA and FDA are identical.
(iv) Introduce other classification functions based on multiple regression variants (ridge regression and Lasso) namely, Lasso Discriminant Analysis (LaDA) and
Ridge Regression Discriminant Analysis (RRDA).
We further conduct experiments using real world datasets to verify if the error rates of RDA and FDA on the same datasets are identical or not. We also conduct similar experiments to verify if differences arising from the error rates of using LaDA, RRDA, FDA and Regularized Fisher's Discriminant Analysis (RFDA) on the same datasets are statistically different from each other or not. In the end, we explore benefits that may derive from the use of LaDA as a classifier, particularly in connection with variable selection.
Metadata
Supervisors: | Thwaites, Peter |
---|---|
Related URLs: | |
Keywords: | Statistical Learning |
Awarding institution: | University of Leeds |
Academic Units: | The University of Leeds > Faculty of Maths and Physical Sciences (Leeds) The University of Leeds > Faculty of Maths and Physical Sciences (Leeds) > School of Mathematics (Leeds) > Statistics (Leeds) |
Identification Number/EthosID: | uk.bl.ethos.707058 |
Depositing User: | Dr. Jude Obi |
Date Deposited: | 03 Apr 2017 11:29 |
Last Modified: | 29 Apr 2019 15:00 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:16741 |
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.