Mehboob, Javeria (2025) Investigating determinants of periplasmic protein solubility in Escherichia coli through expression optimisation and N-terminal proteomics. PhD thesis, University of York.
Abstract
Escherichia coli (E. coli) is the organism of choice for recombinant protein production in both industrial and academic settings. Over the years, E. coli has revolutionised bioprocessing due to its well-characterised genome, fast growth kinetics and low cost, providing time and optimisation-based advantages. Most therapeutic proteins such as human insulin, monoclonal antibody fragments and pharmaceutically relevant proteins are produced in E. coli. Despite all of the aforementioned examples of periplasmic protein production, E. coli can present complications which incur additional solubilisation steps and costs in small and large-scale bioprocesses. Whilst protein production in the E. coli periplasm is well studied, key knowledge gaps remain regarding protein insolubility due to translocation.
In this research, substrate binding proteins from E. coli (SapA) and Serratia marcescens (YppA), produced and localised in the periplasm were used as a test set to understand protein solubility. Various expression optimisation approaches as well as proteomics methods were utilised to characterise factors affecting the solubility of the two target proteins.
For EcSapA, a wide variety of expression optimisation methods were applied: with particular emphasis on protein primary sequencing studying the effects of Lys to Arg ratio along with manipulating various plasmid strain combinations to solubilise SapA. Membrane protein production strains were shown to withstand soluble production of EcSapA, which enabled further exploration of the properties of production strain and the predictive structural basis of the protein.
For SmYppA, unlabelled and labelled proteomics approaches were utilised to reveal its temperature dependant solubility. This research has produced a novel proteomics dataset for studying the effects of temperature, induction and media on the solubility of a periplasmic substrate binding protein in E. coli. The HUNTER N-terminal proteomics method yielded high quality peptide data providing extensive information on the host proteome as well as detailed insights into the state and quality of the recombinant SmYppA. The data presented in this thesis highlights secretion defects along with evidence of mis-cleavages revealing novel signal recognition sequences for the signal peptidase. This work demonstrates the applicability of labelling-based mass spectrometry methods for understanding the complex solubility phenotype.
Metadata
| Supervisors: | Thomas, Gavin and Dowle, Adam |
|---|---|
| Keywords: | Escherichia coli, N-terminal proteomics, Mass spectrometry. |
| Awarding institution: | University of York |
| Academic Units: | The University of York > Biology (York) |
| Date Deposited: | 16 Jan 2026 15:24 |
| Last Modified: | 16 Jan 2026 15:24 |
| Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:38056 |
Download
Examined Thesis (PDF)
Embargoed until: 16 January 2027
Please use the button below to request a copy.
Filename: Mehboob_208059200_PhD_Corrected_Thesis_Clean.pdf
Export
Statistics
Please use the 'Request a copy' link(s) in the 'Downloads' section above to request this thesis. This will be sent directly to someone who may authorise access.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.