White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Answer Re-ranking with bilingual LDA and social QA forum corpus

Katsura, Akihiro (2014) Answer Re-ranking with bilingual LDA and social QA forum corpus. MSc by research thesis, University of York.

[img]
Preview
Text
Thesis for MSc by Research rev6.1.pdf
Available under License Creative Commons Attribution-Noncommercial-No Derivative Works 2.0 UK: England & Wales.

Download (33Mb) | Preview

Abstract

One of the most important tasks for AI is to find valuable information from the Web. In this research, we develop a question answering system for retrieving answers based on a topic model, bilingual latent Dirichlet allocation (Bi-LDA), and knowledge from social question answering (SQA) forum, such as Yahoo! Answers. Regarding question and answer pairs from a SQA forum as a bilingual corpus, a shared topic over question and answer documents is assigned to each term so that the answer re-ranking system can infer the correlation of terms between questions and answers. A query expansion approach based on the topic model obtains a 9% higher top-150 mean reciprocal rank (MRR@150) and a 16% better geometric mean rank as compared to a simple matching system via Okapi/BM25. In addition, this thesis compares the performance in several experimental settings to clarify the factor of the result.

Item Type: Thesis (MSc by research)
Academic Units: The University of York > Computer Science (York)
Depositing User: Mr Akihiro Katsura
Date Deposited: 24 Nov 2014 17:02
Last Modified: 24 Nov 2014 17:02
URI: http://etheses.whiterose.ac.uk/id/eprint/7358

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Actions (repository staff only: login required)