Remedying Sound Source Separation via Azimuth Discrimination and Re-synthesis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Commercially recorded music since the 1950s has been mixed down from many input sound sources to a two-channel reproduction of these sources. The effect of this approach is to assign sources to locations in a stereo field using a pan-position for each source. The Adress algorithm is a popular way of extracting individual music sound sources from a stereo mixture. A drawback of the Adress algorithm is that when time-frequency components in the stereo mixture are shared between two or more sources, calculating the inter-aural intensity scaling parameter for each source for that time-frequency component is challenging. We show how to obtain a good quality inverse of the pan-mixing process in the time-frequency components which are shared between different sources using a new method called Redress. We demonstrate that we can estimate how much of each source is active in time-frequency components which are shared between sources for two and three-source music mixtures. The consequence of this is that audible artefacts are not as prominent in the source estimates.

Original languageEnglish
Title of host publication2020 31st Irish Signals and Systems Conference, ISSC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728194189
DOIs
Publication statusPublished - Jun 2020
Event31st Irish Signals and Systems Conference, ISSC 2020 - Letterkenny, Ireland
Duration: 11 Jun 202012 Jun 2020

Publication series

Name2020 31st Irish Signals and Systems Conference, ISSC 2020

Conference

Conference31st Irish Signals and Systems Conference, ISSC 2020
Country/TerritoryIreland
CityLetterkenny
Period11/06/2012/06/20

Keywords

  • Music Signal Processing
  • Source Separation
  • Time-Frequency

Fingerprint

Dive into the research topics of 'Remedying Sound Source Separation via Azimuth Discrimination and Re-synthesis'. Together they form a unique fingerprint.

Cite this