- contents -- entities -- signals -- coreference -- relationships -- general guidelines -- recipe -- histopath -- radiology -
- previous -- next -
The annotations: a summary
An annotation is a piece of information attached to some text, usually describing the text in some way. An annotation may be:
- attached to a particular region of a document, such as to a word or group of words.
- attached to the document as a whole, and independent of a particular span of text
In CLEF, we are interested in the sorts of entities found in clinical documents: drugs, body parts, diseases and so on. Annotating a document is the task of marking the mentions of these entities in a document, describing their type, and perhaps adding other annotations to the document to describe the relationships between these things. Typically, annotation software will display annotations by highlighting the text to which it is attached with some colour.
This section gives descriptions of the CLEF gold standard annotations for entities and relationships, and what things they refer to. It tells you:
- What CLEF annotations stand for
- How to make sense of annotations in a ready-annotated CLEF document
- What an annotation means if you find it attached to a piece of text in a CLEF document
It does not tell you:
- How to add annotations to an un-annotated document
- The detail of the mapping between surface text and either entities or relationships.
- It does not say how to decide whether a piece of surface text should have an annotation.
For these things, refer to the sections on annotating entites and annotating relationships in text.