Kempton, Timothy (2012) Machine-Assisted Phonemic Analysis. PhD thesis, University of Sheffield.
Abstract
There is a consensus between many linguists that half of all languages risk disappearing by the end of the century. Documentation is agreed to be a priority. This includes the process of phonemic analysis to discover the contrastive sounds of a language with the resulting benefits of further linguistic analysis, literacy, and access to speech technology. A machine-assisted approach to phonemic analysis has the potential to greatly speed up the process and make the analysis more objective.
Good computer tools are already available to help in a phonemic analysis, but these primarily provide search and sort database functionality, rather than automated analysis. In computational phonology there have been very few studies on the automated discovery of phonological patterns from surface level data such as narrow phonetic transcriptions or acoustics.
This thesis addresses the lack of research in this area. The key scientific question underpinning the work in this thesis is "To what extent can a machine algorithm contribute to the procedures needed for a phonemic analysis?". A secondary question is "What insights does such a quantitative evaluation give about the contribution of each of these procedures to a phonemic analysis?"
It is demonstrated that a machine-assisted approach can make a measurable contribution to a phonemic analysis for all the procedures investigated; phonetic similarity, phone recognition & alignment, complementary distribution, and minimal pairs. The evaluation measures introduced in this thesis allows a comprehensive quantitative comparison between these phonemic analysis procedures. Given the best available data and the machine-assisted procedures described, there is a strong indication that phonetic similarity is the most important piece of evidence in a phonemic analysis.
The tools and techniques developed in this thesis have resulted in tangible benefits to the analysis of two under-resourced languages and it is expected that many more languages will follow.
Metadata
Supervisors: | Moore, R. K. |
---|---|
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield) The University of Sheffield > Faculty of Science (Sheffield) > Computer Science (Sheffield) |
Identification Number/EthosID: | uk.bl.ethos.564157 |
Depositing User: | Mr Timothy Kempton |
Date Deposited: | 09 Jan 2013 14:44 |
Last Modified: | 05 Dec 2023 12:22 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:3122 |
Download
Final eThesis - complete (pdf)
Filename: KemptonPhdThesis21Nov2012.pdf
Description: PhD Thesis
Licence:
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.