Setzer, Andrea (2002) Temporal information in newswire articles : an annotation scheme and corpus study. PhD thesis, University of Sheffield.
Abstract
Many natural language processing applications, such as information extraction, question
answering, topic detection and tracking, would benefit significantly from the ability to
accurately position reported events in time, either relatively with respect to other events or
absolutely with respect to calendrical time. However, relatively little work has been done
to date on the automatic extraction of temporal information from text.
Before we can progress to automatically position reported events in time, we must gain
an understanding of the mechanisms used to do this in language. This understanding can
be promoted through the development of all annotation scheme, which allows us to identify
the textual expressions conveying events, times and temporal relations in a corpus of 'real'
text.
This thesis describes a fine-grained annotation scheme with which we can capture all
events, times and temporal relations reported ill a text. To aid the application of the scheme
to text, a graphical annotation tool has been developed. This tool not only allows easy markup
of sophisticated temporal annotations, it also contains an interactive, inference-based
component supporting the gathering of temporal relations. The annotation scheme and the
tool have been evaluated through the construction of a trial corpus during a pilot study. In
this study, a group of annotators was supplied with a description of the annotation scheme
and asked to apply it to a trial corpus.
The pilot study showed that the annotation scheme was difficult to apply, but is feasible
with improvements to the definition of the annotation scheme and the tool. Analysis of
the resulting trial corpus also provides preliminary results on the relative extent to which
different linguistic mechanisms, explicit and implicit, are used to convey temporal relational
information in text.
Metadata
Keywords: | Natural language processing |
---|---|
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Identification Number/EthosID: | uk.bl.ethos.247243 |
Depositing User: | EThOS Import Sheffield |
Date Deposited: | 06 Jan 2017 11:56 |
Last Modified: | 06 Jan 2017 11:56 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:14436 |
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.