Freeman, Timothy Martin ORCID: https://orcid.org/0000-0002-4497-681X (2021) Quality and clinical utility of genomic variants in complex diseases. PhD thesis, University of Sheffield.
Abstract
Continuous improvements in high-throughput genomic sequencing over the past two decades have made it exponentially faster and cheaper, enabling its routine use in the clinic and scientific research. Genomic prognostic tools make use of personalised genomic data to aid clinical decision making and inform patients of disease outcomes, allowing enhanced tailoring of treatment beyond traditional prognostic tools, which are insufficient for understanding the nuances of individual complex disease cases. This relies upon accurate sequencing data and effective quality control. We have developed improved genomic prognostic tools for use in the clinic and demonstrate a novel method for quality control of genomic sequencing data with broad applicability.
Non-small-cell lung cancer (NSCLC) is the second most common cancer type in both males and females globally. Previous attempts to predict survival time for cancer patients have used genomic prognostic tools based on the burden of tumour mutations and neoantigens, but with limited success. We developed greatly improved classifiers of tumour mutation and neoantigenic burden showing strong 5-year survival differences between early-stage NSCLC patients. By using these together, we showed additional increases in prognostic efficacy, with the best survival group displaying a ~92% decreased risk of death in a 5-year period compared to the worst survival group.
To improve the accuracy of sequencing data for uses such as this, we developed the first tool for automatically cataloguing systematic sequencing biases for a sequencing pipeline, and we demonstrated its value in human and SARS-CoV-2 sequencing quality control with Illumina and Oxford Nanopore sequencing. We discovered and blacklisted a range of false positive variants, and investigated the causes of these. Identifying these errors contributed to multiple studies, altering research conclusions. We share these tools to provide continued improvements to genomic prognostics and sequencing accuracy affecting a wide range of fields.
Metadata
Supervisors: | Wang, Dennis |
---|---|
Related URLs: |
|
Keywords: | cancer genomics; prognosis; mutation burden; copy number; patient stratification; cancer immunology; oncogenic pathways; patient-derived xenograft; sequencing bias; sequencing error; whole genome; quality control; COVID-19; SARS-CoV-2; Spike; diversity; evolution; |
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) > Medicine (Sheffield) |
Identification Number/EthosID: | uk.bl.ethos.846601 |
Depositing User: | Mr Timothy Freeman |
Date Deposited: | 10 Jan 2022 12:13 |
Last Modified: | 01 Mar 2022 10:53 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:29976 |
Download
Final eThesis - complete (pdf)
Filename: PhD_thesis_Timothy_Freeman.pdf
Description: A thesis submitted in partial fulfilment of the requirements for the degree of Doctor of Philosophy
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.