White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Iterative Separation of Note Events from Single-Channel Polyphonic Recordings

Delgado Castro, Alejandro (2019) Iterative Separation of Note Events from Single-Channel Polyphonic Recordings. PhD thesis, University of York.

[img] Text
PhD_Thesis_DelgadoCastro.pdf - Examined Thesis (PDF)
Restricted until 29 October 2020.

Request a copy

Abstract

This thesis is concerned with the separation of audio sources from single-channel polyphonic musical recordings using the iterative estimation and separation of note events. Each event is defined as a section of audio containing largely harmonic energy identified as coming from a single sound source. Multiple events can be clustered to form separated sources. This solution is a model-based algorithm that can be applied to a large variety of audio recordings without requiring previous training stages. The proposed system embraces two principal stages. The first one considers the iterative detection and separation of note events from within the input mixture. In every iteration, the pitch trajectory of the predominant note event is automatically selected from an array of fundamental frequency estimates and used to guide the separation of the event's spectral content using two different methods: time-frequency masking and time-domain subtraction. A residual signal is then generated and used as the input mixture for the next iteration. After convergence, the second stage considers the clustering of all detected note events into individual audio sources. Performance evaluation is carried out at three different levels. Firstly, the accuracy of the note-event-based multipitch estimator is compared with that of the baseline algorithm used in every iteration to generate the initial set of pitch estimates. Secondly, the performance of the semi-supervised source separation process is compared with that of another semi-automatic algorithm. Finally, a listening test is conducted to assess the audio quality and naturalness of the separated sources when they are used to create stereo mixes from monaural recordings. Future directions for this research focus on the application of the proposed system to other music-related tasks. Also, a preliminary optimisation-based approach is presented as an alternative method for the separation of overlapping partials, and as a high resolution time-frequency representation for digital signals.

Item Type: Thesis (PhD)
Related URLs:
Keywords: Audio Source Separation, Multipitch Estimation, Iterative Separation of Note Events, Mono-to-Stereo Conversion, Time-Frequency Masking, Time-Domain Subtraction, Music Information Retrieval, Optimisation-based Separation of Overlapping Partials, Spectral Analysis and Filtering
Academic Units: The University of York > Electronics (York)
Depositing User: Mr Alejandro Delgado Castro
Date Deposited: 09 Dec 2019 12:22
Last Modified: 09 Dec 2019 12:22
URI: http://etheses.whiterose.ac.uk/id/eprint/25239

Please use the 'Request a copy' link(s) above to request this thesis. This will be sent directly to someone who may authorise access.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Actions (repository staff only: login required)