White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Detecting Biomedical Relations using Distant Supervision

Roller, Roland (2015) Detecting Biomedical Relations using Distant Supervision. PhD thesis, University of Sheffield.

Available under License Creative Commons Attribution-Noncommercial-No Derivative Works 2.0 UK: England & Wales.

Download (797Kb) | Preview


This work concerns the detection of relationships between key information in biomedical publications, such as treatments for diseases or side-effects of drugs. Given a sentence containing some medical concepts the goal is to determine their relationship to each other. Supervised machine learning methods are a very popular way to address this problem and often provide reliable results. Those methods require manually labelled examples to extract characteristics of particular relationships in order to detect similar information in unlabelled data. However, manually labelled data is not always available and its generation is time consuming and expensive. The main objective of this thesis is the exploration of distant supervision, a method which generates those labelled examples automatically using prior knowledge to detect relationships between key facts. First, relation extraction using a limited amount of training data is explored to detect adverse-drug effects in natural language. Then, work focuses on automatically labelling data using a large biomedical knowledge base, the Unified Medical Language System (UMLS). The effectiveness of a popular evaluation method that does not require manually labelled data is examined in more detail. The main goal is the investigation of whether UMLS is suitable to be used to label data automatically so as to detect similar information in natural language. Finally, a method to reduce falsely labelled instances in the automatically generated data is presented and found to improve the detection of relationships.

Item Type: Thesis (PhD)
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield)
The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield)
Identification Number/EthosID: uk.bl.ethos.695993
Depositing User: Roland Roller
Date Deposited: 04 Nov 2016 12:58
Last Modified: 12 Oct 2018 09:29
URI: http://etheses.whiterose.ac.uk/id/eprint/13892

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Actions (repository staff only: login required)