Roller, Roland (2015) Detecting Biomedical Relations using Distant Supervision. PhD thesis, University of Sheffield.
Abstract
This work concerns the detection of relationships between key information in biomedical publications, such as treatments for diseases or side-effects of drugs. Given a sentence containing some medical concepts the goal is to determine their relationship to each other.
Supervised machine learning methods are a very popular way to address this problem and often provide reliable results. Those methods require manually labelled examples to extract characteristics of particular relationships in order to detect similar information in unlabelled data. However, manually labelled data is not always available and its generation is time consuming and expensive.
The main objective of this thesis is the exploration of distant supervision, a method which generates those labelled examples automatically using prior knowledge to detect relationships between key facts.
First, relation extraction using a limited amount of training data is explored to detect adverse-drug effects in natural language. Then, work focuses on automatically labelling data using a large biomedical knowledge base, the Unified Medical Language System (UMLS). The effectiveness of a popular evaluation method that does not require manually labelled data is examined in more detail. The main goal is the investigation of whether UMLS is suitable to be used to label data automatically so as to detect similar information in natural language. Finally, a method to reduce falsely labelled instances in the automatically generated data is presented and found to improve the detection of relationships.
Metadata
Supervisors: | Mark, Stevenson |
---|---|
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Identification Number/EthosID: | uk.bl.ethos.695993 |
Depositing User: | Roland Roller |
Date Deposited: | 04 Nov 2016 12:58 |
Last Modified: | 12 Oct 2018 09:29 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:13892 |
Download
revised_version_roller
Filename: revised_version_roller.pdf
Licence:
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.