White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Voice Synthesis using the Three-Dimensional Digital Waveguide Mesh

Speed, Matthew DA (2012) Voice Synthesis using the Three-Dimensional Digital Waveguide Mesh. PhD thesis, University of York.

[img]
Preview
Text
Speed_Thesis_Corrected.pdf
Available under License Creative Commons Attribution-Noncommercial-No Derivative Works 2.0 UK: England & Wales.

Download (7Mb)

Abstract

The acoustic response of the vocal tract is fundamental to our interpretation of voice production. As an acoustic filter, it shapes the spectral envelope of vocal fold vibration towards resonant modes, or formants, whose behaviours form the most basic building blocks of phonetics. Physical models of the voice exploit this effect by modelling the nature of wave propagation in abstracted cylindrical constructs. Whilst effective, the accuracy of such approaches is limited due to their limited geometrical analogue. Developments in numerical acoustics modelling meanwhile have seen the formalisation of higher dimensionality configurations of the same technologies, allowing a much closer geometrical representation of an acoustic field. The major focus of this thesis is the application of such a technique to the vocal tract, and comparison of its performance with lower dimensionality approaches. To afford the development of such models, a body of data is collected from Magnetic Resonance Imaging for a range of subjects, and procedures are developed for the decomposition of this imaging into suitable, efficient data structures for simulation. The simulation technique is exhaustively validated using a combination of bespoke measurement/inversion techniques and analytical determination of lower frequency behaviours. Finally, voice synthesis based on each numerical model is compared with acoustic recordings of the subjects involved and with equivalent simulations from lower dimensionality methods. It is found that application of a higher dimensionality method typically yields a more accurate frequency-domain representation of the voice, although in some cases lower dimensionality equivalents are seen to perform better at low frequencies.

Item Type: Thesis (PhD)
Keywords: Voice Synthesis, Digital Waveguide, Vocal Tract, Acoustics Simulation
Academic Units: The University of York > Electronics (York)
Depositing User: Mr Matthew DA Speed
Date Deposited: 04 Oct 2012 13:40
Last Modified: 08 Aug 2013 08:50
URI: http://etheses.whiterose.ac.uk/id/eprint/2800

Actions (repository staff only: login required)