Sound events separation and recognition using LiSparse complex nonnegative matrix factorization and multi-class mean supervector support vector machine

Parathai, Phetcharat, Tengtrairat, Naruephorn and Woo, Wai Lok (2018) Sound events separation and recognition using LiSparse complex nonnegative matrix factorization and multi-class mean supervector support vector machine. In: INCIT 2017 - 2nd International Conference on Information Technology, 2nd - 3rd November 2017, Nakhonpathom, Thailand.

Full text not available from this repository.
Official URL: http://dx.doi.org/10.1109/INCIT.2017.8257878

Abstract

This paper proposes a novel single channel sound separation and events recognition method. First, the sound separation step is based on a complex nonnegative matrix factorization (CMF) with probabilistically optimal L1 sparsity which decomposes an information-bearing matrix into twodimensional convolution of factor matrices that represent the spectral basis and temporal code of the sources. The L1 sparsity CMF method can extract recurrent patterns of magnitude spectra that underlie observed complex spectra and the phase estimates of constituent signals, thus enabling the features of the components to be extracted more efficiently. Second, the event recognition step is built by using the multi-class mean supervector support vector (MS-SVM) machine. The separated signal from the first step is segmented by using the sliding window function and then extract features of each block. The major features which are zero-crossing rate, Mel frequency cepstral coefficients, and short-time energy are investigated to classify sound events signal into defined classes. The mean supervector is encoded from the obtained features. The multi-class MS-SVM method has been examined the recognition performance by modeling with various features. The experimental results show the robustness and efficiency of the proposed method.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: nonnegative matric factorization, single-channel separation, sound event recognition, support vector machines
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Paul Burns
Date Deposited: 29 Mar 2019 17:17
Last Modified: 10 Oct 2019 20:48
URI: http://nrl.northumbria.ac.uk/id/eprint/38654

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics