Extending the Graphical Representation of four KEGG Pathways for a Better Understanding of Prostate Cancer Using Machine Learning of Graphical models

Abstract

This thesis shows a novel contribution to computational biology alongside with developed machine learning methods. It shows how the graphical representation of KEGG pathways can be refined using machine learning of graphical models. The focus mainly is on a set of graphical models called Bayesian networks. Throughout this thesis , different ways of learning Bayesian networks are discussed. The work is based on Affymetrix gene expression microarray profiles
and penalised Gaussian linear models. Penalisation in linear models includes choosing the most important parents and estimating the associated coefficients simultaneously using L1-regression. The sparse dataset that is generated from Affymetrix microarray technology is the key point in this thesis when learning Bayesian networks. Thus, the work in this thesis can be viewed as developing robust methods to avoid overfitting that usually associated with gene expression datasets and contributing to invoke more details about a well known discrepancy in KEGG pathways. So,the problem we have is to learn from a large number of candidates, small samples,(p>>n), and for such problem the goal is to apply model selection methods that hopefully achieve an accurate prediction , interpretable models, and stable models. The prediction and the most powerful predictors can be improved by using methods that trade-off between bias and variance. Also, providing which predictors are meaningful rather than using all predictors will provide interpretable models, and finally by choosing the most important predictors, a small change in the data will not result in large changes in the subset of predictors which consequently gives the stability to the models that are learnt.

Metadata

Supervisors:	Cussens, James
Awarding institution:	University of York
Academic Units:	The University of York > Computer Science (York)
Identification Number/EthosID:	uk.bl.ethos.547333
Depositing User:	MR ADEL ABDULLAH M ALORAINI
Date Deposited:	08 Nov 2011 15:13
Last Modified:	08 Sep 2016 12:21
Open Archives Initiative ID (OAI ID):	oai:etheses.whiterose.ac.uk:1711

Download

ThesisMain

Filename: ThesisMain.pdf

Licence:
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License

CLICK TO DOWNLOAD

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Extending the Graphical Representation of four KEGG Pathways for a Better Understanding of Prostate Cancer Using Machine Learning of Graphical models

Abstract

Metadata

Download

ThesisMain

Export

Statistics