The synchronized short-time-fourier-transform: Properties and definitions for multichannel source separation

Ruairi De Frein, Scott T. Rickard

Research output: Contribution to journalArticlepeer-review

Abstract

This paper proposes the use of a synchronized linear transform, the synchronized short-time-Fourier-transform (sSTFT), for time-frequency analysis of anechoic mixtures. We address the short comings of the commonly used time-frequency linear transform in multichannel settings, namely the classical short-time-Fourier-transform (cSTFT). We propose a series of desirable properties for the linear transform used in a multichannel source separation scenario: stationary invertibility, relative delay, relative attenuation, and finally delay invariant relative windowed-disjoint orthogonality (DIRWDO). Multisensor source separation techniques which operate in the time-frequency domain, have an inherent error unless consideration is given to the multichannel properties proposed in this paper. The sSTFT preserves these relationships for multichannel data. The crucial innovation of the sSTFT is to locally synchronize the analysis to the observations as opposed to a global clock. Improvement in separation performance can be achieved because assumed properties of the time-frequency transform are satisfied when it is appropriately synchronized. Numerical experiments show the sSTFT improves instantaneous subsample relative parameter estimation in low noise conditions and achieves good synthesis.

Original languageEnglish
Article number5605264
Pages (from-to)91-103
Number of pages13
JournalIEEE Transactions on Signal Processing
Volume59
Issue number1
DOIs
Publication statusPublished - 2011
Externally publishedYes

Keywords

  • Signal analysis
  • source separation

Fingerprint

Dive into the research topics of 'The synchronized short-time-fourier-transform: Properties and definitions for multichannel source separation'. Together they form a unique fingerprint.

Cite this