Speech Analytics for the Detection of Neurological Conditions in Global Varieties of English

Abstract

Dementia refers to the decline of cognition and memory due to neurological conditions. Both speech content and acoustics can be analysed as biomarkers to detect dementia, offering a promising, cost-effective alternative to traditional methods. While speech-based dementia classification has shown success for L1 (native) speakers, this thesis explores the challenges posed by L2 (non-native) English speakers.

Investigating Automatic Speech Recognition (ASR) and its performance across global English varieties, we identify a disconnect between advertised and real-world system capabilities. ASR systems often reduce L2 language diversity, treating fine-tuned derivations of the same models as unique systems. We analyse ASR evaluation methods and propose a new multi-metric standard inspired by advances in Machine Translation. Examining ASR performance in dementia datasets, we find only a small deviation between L1 and L2 speech using the latest models, suggesting potential reliability for transcribing non-native speech. However, L2 speech produces speech analytics that cluster more closely with L1 speakers with dementia than with healthy L1 speakers, particularly in lexical and syntactic complexity metrics. This thesis proposes a framework for classification pipeline evaluation focused on measuring feature fluctuation as an interpretable framework for understanding inaccuracies within analytics directly fed into our classification models.

This system design approach aims to support clinician-facing reports and improve awareness of healthcare equity outcomes. To deploy these systems, we must better understand how different configurations impact minority voices. We propose longitudinal, speaker-dependent analytics as a means of calibrating dementia classification pipelines to a profile of speech analytics rather than single-session data. This approach would contextualise ‘healthy’ in the context of each speaker’s voice. Additionally, we suggest redefining healthy control classifications to better reflect cognitive concerns. Finally, we advocate for a more integrated approach to Speech Technology development for dementia detection, ensuring system outputs align with clinician-facing reports, patient-facing reports, and assurance protocols.

Metadata

Supervisors:	Christensen, Heidi and Blackburn, Daniel
Related URLs:	Evaluating the performance of state-of-the-art ASR systems on non-native English using corpora with extensive language background variation (Related publication) Towards Disfluency Features for Speech Technology Based Automatic Dementia Classification (Related publication)
Keywords:	Speech Technologies, Automatic Speech Recognition, Dementia Detection, EDI, Machine Learning
Awarding institution:	University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Computer Science (Sheffield)
Date Deposited:	09 Feb 2026 13:50
Last Modified:	09 Feb 2026 13:50
Open Archives Initiative ID (OAI ID):	oai:etheses.whiterose.ac.uk:38079

Download

Final eThesis - complete (pdf)

Embargoed until: 9 February 2027

This file cannot be downloaded or requested.

Filename: PhD-Thesis_SCHMÜCK_WhiteRose.pdf

Description: PhD Thesis

You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Speech Analytics for the Detection of Neurological Conditions in Global Varieties of English

Abstract

Metadata

Download

Final eThesis - complete (pdf)

Export

Statistics