TY - GEN
T1 - Elevato-CDR
T2 - 9th International Conference on Frontiers of Signal Processing, ICFSP 2024
AU - Bagchi, Swarnadeep
AU - de Fréin, Ruairí
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Speech enhancement algorithms are needed for Assisted Living (AL) environments, which are characterized by large inter-microphone spacings, significant phase-wraparound, and high reverberation times (T60). We contribute Elevato-CDR, which addresses the task of enhancing speech given the large relative delays and high levels of reverberation, which are characteristic of AL scenarios. The purpose of this paper is to evaluate Elevato-CDR's performance and compare it with benchmark reverberation schemes, using speech mixtures which experience reverberation and background noise. Our findings indicate that Elevato-CDR successfully outperforms classical dereverberation algorithms in terms of subjective (MARS, PESQ) and objective (SDR, SRR) performance measures in environments where the T60 = 4.1 s and the inter-microphone spacing is up to 3.7 m. These findings demonstrate that combining a robust, big relative delay estimation technique, called the Elevatogram with a coherence-based reverberation attenuation method extends the useful range of operation of speech enhancement to AL environments.
AB - Speech enhancement algorithms are needed for Assisted Living (AL) environments, which are characterized by large inter-microphone spacings, significant phase-wraparound, and high reverberation times (T60). We contribute Elevato-CDR, which addresses the task of enhancing speech given the large relative delays and high levels of reverberation, which are characteristic of AL scenarios. The purpose of this paper is to evaluate Elevato-CDR's performance and compare it with benchmark reverberation schemes, using speech mixtures which experience reverberation and background noise. Our findings indicate that Elevato-CDR successfully outperforms classical dereverberation algorithms in terms of subjective (MARS, PESQ) and objective (SDR, SRR) performance measures in environments where the T60 = 4.1 s and the inter-microphone spacing is up to 3.7 m. These findings demonstrate that combining a robust, big relative delay estimation technique, called the Elevatogram with a coherence-based reverberation attenuation method extends the useful range of operation of speech enhancement to AL environments.
KW - Ambient Noise (AN)
KW - Assisted Living
KW - Big Delay
KW - Coherence-to-Diffuse Ratio
KW - Cross-Power Spectrum (CPS)
KW - Ideal Binary Mask (IBM)
KW - Interference Suppression (IS)
KW - phase-wraparound
KW - Source Enhancement
KW - Source-to-Distortion Ratio (SDR)
KW - Time Difference-of-Arrival
UR - https://www.scopus.com/pages/publications/85215667441
U2 - 10.1109/ICFSP62546.2024.10785349
DO - 10.1109/ICFSP62546.2024.10785349
M3 - Conference contribution
AN - SCOPUS:85215667441
T3 - 2024 9th International Conference on Frontiers of Signal Processing, ICFSP 2024
SP - 153
EP - 157
BT - 2024 9th International Conference on Frontiers of Signal Processing, ICFSP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 12 September 2024 through 14 September 2024
ER -