Clustering NMF basis functions using shifted NMF for monaural sound source separation

Rajesh Jaiswal, Derry FitzGerald, Dan Barry, Eugene Coyle, Scott Rickard

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Non-negative Matrix Factorization (NMF) has found use in single channel separation of audio signals, as it gives a parts-based decomposition of audio spectrograms where the parts typically correspond to individual notes or chords. However, a notable shortcoming of NMF is the need to cluster the basis functions to their sources after decomposition. Despite recent improvements in algorithms for clustering the basis functions to sources, much work still remains to further improve these algorithms. To this end we present a novel clustering algorithm which overcomes some of the limitations of previous clustering methods. This involves the use of Shifted Nonnegative Matrix Factorization (SNMF) as a means of clustering the frequency basis functions obtained from NMF. Results show that this gives improved clustering of pitched basis functions over previous methods.

Original languageEnglish
Title of host publication2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
Pages245-248
Number of pages4
DOIs
Publication statusPublished - 2011
Event36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Prague, Czech Republic
Duration: 22 May 201127 May 2011

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
Country/TerritoryCzech Republic
CityPrague
Period22/05/1127/05/11

Keywords

  • Constant Q spectrogram
  • NMF basis functions
  • Shifted-NMF
  • Sound Source Separation

Fingerprint

Dive into the research topics of 'Clustering NMF basis functions using shifted NMF for monaural sound source separation'. Together they form a unique fingerprint.

Cite this