Williams, Samantha ORCID: https://orcid.org/0000-0001-7661-6950
(2024)
Interpretable by design: An interpretability-centred approach to automatic L1 identification from L2 English speech.
PhD thesis, University of York.
Abstract
As AI and machine learning methods are increasingly used for high-stakes decision-making, the challenges surrounding understanding how these decisions are made has emerged as a critical concern, prompting a push towards the development of more interpretable methods. The field of forensic speech science (FSS) presents a particularly demanding context for exploring interpretability not only due to legal and regulatory requirements, but also the responsibility to the public. Although this is a broader challenge facing the field, the research presented in this thesis focuses on the gap in systems for automatic L1 identification from L2 speech, specifically its evidential applications.
This research adopts an interpretability-centred approach exploring: 1) how interpretability can be integrated into the design of an automatic L1 identification system, and 2) whether such design choices can address the limitations of existing systems for forensic applications. The thesis investigated three methods for integrating interpretability and contributed practical strategies for evaluation within an FSS context. A novel phoneme modelling method was introduced, the Variational Autoencoder with Spectrogram Reconstruction (VAE+SR) method, which enables real-time exploration of which underlying speech features are captured in the encodings. Additionally, existing forensic methods, MVKD and Bayesian Networks, are newly applied to the L1 identification task and explored as a means of improving different aspects of interpretability. Results demonstrated that while the presented methods offered ways to address the limitations of existing systems, there is still room for development as new limitations emerge.
As the field continues to evolve, this work demonstrates how deliberate design choices that prioritise interpretability can help forensic phonetics practitioners present data-driven decisions that can be explained and supported by existing research in linguistics and phonetics and encourages a rethinking of how we may integrate interpretability into system design.
Metadata
Supervisors: | Foulkes, Paul and Hughes, Vincent |
---|---|
Related URLs: | |
Keywords: | interpretability, L1 identification, explainable AI, acoustics, forensic speech science, L2 English |
Awarding institution: | University of York |
Academic Units: | The University of York > Language and Linguistic Science (York) |
Depositing User: | Samantha Williams |
Date Deposited: | 25 Jun 2025 15:43 |
Last Modified: | 25 Jun 2025 15:43 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:37025 |
Download
Examined Thesis (PDF)
Embargoed until: 25 June 2026
Please use the button below to request a copy.
Filename: Williams_205043102_Thesis_2024.pdf

Export
Statistics
Please use the 'Request a copy' link(s) in the 'Downloads' section above to request this thesis. This will be sent directly to someone who may authorise access.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.