White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

A Ranking Approach to Summarising Twitter Home Timelines

Rout, Dominic (2015) A Ranking Approach to Summarising Twitter Home Timelines. PhD thesis, University of Sheffield.

[img]
Preview
Other (PDF File)
dominic_rout_thesis.pdf
Available under License Creative Commons Attribution-Noncommercial-No Derivative Works 2.0 UK: England & Wales.

Download (1405Kb) | Preview

Abstract

The rise of social media services has changed the ways in which users can communicate and consume content online. Whilst online social networks allow for fast and convenient delivery of knowledge, users are prone to information overload when too much information is presented for them to read and process. Automatic text summarisation is a tool to help mitigate information overload. In automatic text summarisation, short summaries are generated algorithmically from extended text, such as news articles or scientific papers. This thesis addresses the challenges in applying text summarisation to the Twitter social network. It also goes beyond text, exploiting additional information that is unique to social networks to create summaries which are personal to an intended reader. Unlike previous work in tweet summarisation, the experiments here address the home timelines of readers, which contain the incoming posts from authors to whom they have explicitly subscribed. A novel contribution is made in this work the form of a large gold standard ($19,350$ tweets), the majority of which will be shared with the research community. The gold standard is a collection of timelines that have been subjectively annotated by the readers to whom they belong, allowing fair evaluation of summaries which are not limited to tweets of general interest, but which are specific to the reader. Where the home timeline is used by professional users for social media analysis, automatic text summarisation can be applied to give results which beat all baselines. In the general case, where no limitation is placed on the types of readers, personalisation features which exploit the relationship between author and reader and the reader's own previous posts, were shown to outperform both automatic text summarisation and all baselines.

Item Type: Thesis (PhD)
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield)
The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield)
Identification Number/EthosID: uk.bl.ethos.675076
Depositing User: Mr Dominic Rout
Date Deposited: 11 Dec 2015 15:29
Last Modified: 03 Oct 2016 12:19
URI: http://etheses.whiterose.ac.uk/id/eprint/11213

Actions (repository staff only: login required)