Woolston, Andrew Stephen (2012) Working with collinearity in epidemiology: development of collinearity diagnostics, identifying latent constructs in exploratory research and dealing with perfectly collinear variables in regression. PhD thesis, University of Leeds.
Abstract
Collinearity plays an integral role in regression studies involving epidemiological data. Variables often form part of a common biological mechanism or measure the same element of a latent structure. It is a natural feature of most data and as such it is rarely possible to physically control for collinearity in data collection. A focus is placed on the analytical assessment of the data. Departures from independence can severely distort the interpretation of a model and the role of each covariate. This leads to increased inaccuracy as expressed through the regression coefficients and increased uncertainty as expressed through coefficient standard errors. Such a feature has the potential to impact on the clinical conclusions formed from regression studies.
The work in this thesis first considers an assessment of the impact of collinearity on model parameters and the conclusions formed. A new collinearity index is developed which incorporates the role of the response in moderating the impact of collinearity. The idea for the new index is developed using vector geometry and extended to a general measure. The work in collinearity is later extended to consider the formation of a dependency structure from a collection of collinear variables. A novel methodology, labelled the matroid approach, is coded and implemented on a metabolic syndrome dataset to extract a latent structure that could represent this clinical construct. Comparisons are subsequently made to existing exploratory factor analysis and clustering methods in the literature. Finally, the unique problem of perfect collinearity is considered in a lifecourse and age-period-cohort setting. The justification of constraint and non-constraint regression methods is considered in an attempt to provide ‘solutions’ to the identification problem generated by collinearity.
Metadata
Supervisors: | Gilthorpe, M. and Tu, Y.K. and Baxter, P. |
---|---|
ISBN: | 978-0-85731-227-3 |
Awarding institution: | University of Leeds |
Academic Units: | The University of Leeds > Faculty of Medicine and Health (Leeds) > School of Medicine (Leeds) |
Identification Number/EthosID: | uk.bl.ethos.559148 |
Depositing User: | Repository Administrator |
Date Deposited: | 19 Nov 2012 14:08 |
Last Modified: | 07 Mar 2014 11:21 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:2951 |
Download
Woolston_AS_Medicine_PhD_2012
Filename: Woolston_AS_Medicine_PhD_2012.pdf
Licence:
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 2.5 License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.