White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Computational studies of protein interactions and genetic regulation

Ward, Joseph (2013) Computational studies of protein interactions and genetic regulation. PhD thesis, University of Leeds.

Available under License Creative Commons Attribution-Noncommercial-Share Alike 2.0 UK: England & Wales.

Download (6Mb) | Preview


The work in this thesis is split into two parts. The introduction and following two chapters pertain to the investigation of gene regulation using Chip-seq data and linear modelling. The final chapter pertains to the prediction of hot-spot residues in protein-protein interactions. The rapid escalation in the speed and quality of DNA sequencing has lead to a wealth of data for the location of transcription factor binding and histone modifications across the genomes. Using Chromatin ImmunoPrecipitation followed by sequencing (ChIP-seq) data, we have generated a new binding metric based on the enrichment of the read-counts for each gene. Eight datasets from mouse macrophage cells (two histone modifications, five transcription factors, DNase I hypersensitivity) were used to model the binding of RNA polymerase II. It was found that a linear model just using the DNase I hypersensitivity and histone modification data was better than any of the models containing the transcription factor data. Investigation of the outlying genes for the model revealed no pattern in their Gene Ontology terms or macrophagespecific genes. Human embryonic stem cell data (23 transcription factor and 24 histone modification datasets) were used in combination with LASSO regression to model the binding of RNA polymerase II. The resultant models contrasted with the results from the mouse macrophage linear models in that using the histone modifications data in combination with the transcription factor data lead to the best models. A much more complicated picture of the regulation of RNA polymerase II binding was produced using the LASSO models. Protein-protein interactions are essential for every function within a cell and being able to predict them has large consequences for drug discovery and understanding the vast proteininteraction networks that occur within cells. Predicting protein-protein interactions is difficult due to the large number of possible conformations; predicting hot-spot residues can greatly reduce this. InterBasePro was compared with experimental data and subsequently adaptation was done to assess its usefulness for predicting hot-spot residues. An alternative approach was also made into classifying hot-spot residues based on atomic contacts.

Item Type: Thesis (PhD)
ISBN: 978-0-85731-417-8
Academic Units: The University of Leeds > Faculty of Biological Sciences (Leeds)
Identification Number/EthosID: uk.bl.ethos.581746
Depositing User: Repository Administrator
Date Deposited: 04 Nov 2013 12:51
Last Modified: 07 Mar 2014 11:28
URI: http://etheses.whiterose.ac.uk/id/eprint/4665

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Actions (repository staff only: login required)