Sun, Tianda (2022) Relation Extraction from Financial Reports. MSc by research thesis, University of York.
Abstract
This project mainly focuses on using deep learning methods to extract relations
from the so called 10-K SEC financial reports, and adds them to an ontology for
further use. A 10-K report is a comprehensive report submitted by public compa-
nies each year to publish their financial performance. In the US, the 10-K reports
are required by the U.S Security and Exchange Commission (SEC) to provide the
investors with information on a company on which they can base their decisions to
invest. It is far more detailed than the annual report where it describes the com-
pany’s potential to succeed so it is useful for investors to refer to. In this research,
we mainly focus on the distant supervision method to construct the dataset from
the Financial Industry Business Ontology(FIBO) [2] and evaluate the performance
of two distant supervision relation extraction models. Additionally, we discuss the
potential flaws of distant supervision method on this task and investigate some pos-
sible improvements such as anaphora resolution to enhance the knowledge base,
and point out further research direction for the domain-specific relation extraction
area. In addition, this research provides results to Can Erten, a PhD student at the
University of York, who will use the ontology from the reports in his research.
Metadata
Supervisors: | Dimitar, Kazakov |
---|---|
Keywords: | Relation extraction, Natural language processing, Anaphora resolution |
Awarding institution: | University of York |
Academic Units: | The University of York > Computer Science (York) |
Depositing User: | Mr Tianda Sun |
Date Deposited: | 20 Jun 2022 10:41 |
Last Modified: | 20 Dec 2022 00:31 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:30949 |
Download
Examined Thesis (PDF)
Filename: Sun_204000368_CorrectedThesisClean (3).pdf
Description: Thesis of Master by Research
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.