Abstract
Binary masking forms the basis for a number of source separation approaches that have been successfully applied to the problem of de-mixing music sources from a stereo recording. A well-known problem with binary masking is that, when music sources overlap in the time-frequency domain, only one of the overlapping sources can be assigned the energy in a particular time-frequency bin. To overcome this problem, we reformulate the classical pan-pot source separation problem for music sources as a non-negative quadratic program. This reformulation gives rise to an algorithm, called Redress, which extends the popular Adress algorithm. It works by defining an azimuth trajectory for each source based on its spatial position within the stereo field. Redress allows for the allocation of energy in one time-frequency bin to multiple sources. We present results that show that for music recordings Redress improves the SNR, SAR, and SDR in comparison to the Adress algorithm.
| Original language | English |
|---|---|
| Article number | 1373 |
| Pages (from-to) | 1-18 |
| Number of pages | 18 |
| Journal | Electronics (Switzerland) |
| Volume | 9 |
| Issue number | 9 |
| DOIs | |
| Publication status | Published - Sep 2020 |
Keywords
- Binary masking
- Music signal processing
- Source separation
- Time-frequency
Fingerprint
Dive into the research topics of 'Reformulating the binary masking approach of adress as soft masking'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver