Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation

Gao, Bin, Woo, Wai Lok and Khor, L. C. (2014) Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation. The Journal of the Acoustical Society of America, 135 (3). pp. 1171-1185. ISSN 0001-4966

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1121/1.4864294

Abstract

An unsupervised single channel audio separation method from pattern recognition viewpoint is presented. The proposed method does not require training knowledge and the separation system is based on non-uniform time-frequency (TF) analysis and feature extraction. Unlike conventional research that concentrates on the use of spectrogram or its variants, the proposed separation algorithm uses an alternative TF representation based on the gammatone filterbank. In particular, the monaural mixed audio signal is shown to be considerably more separable in this non-uniform TF domain. The analysis of signal separability to verify this finding is provided. In addition, a variational Bayesian approach is derived to learn the sparsity parameters for optimizing the matrix factorization. Experimental tests have been conducted, which show that the extraction of the spectral dictionary and temporal codes is more efficient using sparsity learning and subsequently leads to better separation performance.

Item Type:	Article
Subjects:	G900 Others in Mathematical and Computing Sciences
Department:	Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User:	Becky Skoyles
Date Deposited:	11 Apr 2019 11:54
Last Modified:	10 Oct 2019 20:04
URI:	http://nrl.northumbria.ac.uk/id/eprint/38938

Actions (login required)

View Item

Downloads

Downloads per month over past year

View more statistics