Park, Chanho ORCID: https://orcid.org/0000-0001-6671-1671
(2025)
Data Selection Methods for Semi-Supervised Learning in Automatic Speech Recognition.
PhD thesis, University of Sheffield.
Abstract
For an automatic speech recognition (ASR) system to perform well on new target data, it must be trained on data from the same domain as the target. Semi-supervised learning, which integrates both manually transcribed and untranscribed data, can be beneficial in this context. However, manually transcribed data are often unavailable. This challenge can be addressed by generating a training dataset from a data pool whenever new target data are introduced.
By employing the methods proposed in this thesis, training data can be effectively selected from a multi-domain untranscribed data pool, leading to improved ASR performance. The findings of this research demonstrate the potential for leveraging large amounts of untranscribed data from multiple sources to enhance ASR systems.
Metadata
Supervisors: | Hain, Thomas |
---|---|
Keywords: | semi-supervised learning, automatic speech recognition, data selection, word error rate, word error rate estimation, character error rate, character error rate estimation, domain similarity measurement, acoustic domain similarity, linguistic domain similarity, deep metric learning, multi-layer perceptron, multi-domain data |
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Depositing User: | Dr Chanho Park |
Date Deposited: | 17 Feb 2025 16:55 |
Last Modified: | 17 Feb 2025 16:55 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:36289 |
Download
Final eThesis - complete (pdf)
Embargoed until: 17 February 2026
Please use the button below to request a copy.
Filename: Park, Chanho, 200271413 (corrected).pdf

Export
Statistics
Please use the 'Request a copy' link(s) in the 'Downloads' section above to request this thesis. This will be sent directly to someone who may authorise access.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.