Audio-Visual Speech Processing for Multimedia Localisation

Benatan, Matthew Aaron (2016) Audio-Visual Speech Processing for Multimedia Localisation. PhD thesis, University of Leeds.

Abstract

Metadata

Supervisors: Ng, Kia and Bulpitt, Andy and Magee, Derek
Keywords: Voice Activity Detection, Visual Voice Activity Detection, Speech Processing, Visual Speech Processing, Multimedia Alignment, Audio Alignment
Awarding institution: University of Leeds
Academic Units: The University of Leeds > Faculty of Engineering (Leeds) > School of Computing (Leeds)
Identification Number/EthosID: uk.bl.ethos.703358
Depositing User: Mr Matthew Benatan
Date Deposited: 20 Feb 2017 13:15
Last Modified: 25 Jul 2018 09:54

Download

Final eThesis - complete (pdf)

Export

Statistics


You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.