Elevato-CDR: Speech Enhancement in Large Delay and Reverberant Assisted Living Scenarios

Swarnadeep Bagchi, Ruairí de Fréin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Speech enhancement algorithms are needed for Assisted Living (AL) environments, which are characterized by large inter-microphone spacings, significant phase-wraparound, and high reverberation times (T60). We contribute Elevato-CDR, which addresses the task of enhancing speech given the large relative delays and high levels of reverberation, which are characteristic of AL scenarios. The purpose of this paper is to evaluate Elevato-CDR's performance and compare it with benchmark reverberation schemes, using speech mixtures which experience reverberation and background noise. Our findings indicate that Elevato-CDR successfully outperforms classical dereverberation algorithms in terms of subjective (MARS, PESQ) and objective (SDR, SRR) performance measures in environments where the T60 = 4.1 s and the inter-microphone spacing is up to 3.7 m. These findings demonstrate that combining a robust, big relative delay estimation technique, called the Elevatogram with a coherence-based reverberation attenuation method extends the useful range of operation of speech enhancement to AL environments.

Original languageEnglish
Title of host publication2024 9th International Conference on Frontiers of Signal Processing, ICFSP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages153-157
Number of pages5
ISBN (Electronic)9798350353235
DOIs
Publication statusPublished - 2024
Event9th International Conference on Frontiers of Signal Processing, ICFSP 2024 - Paris, France
Duration: 12 Sep 202414 Sep 2024

Publication series

Name2024 9th International Conference on Frontiers of Signal Processing, ICFSP 2024

Conference

Conference9th International Conference on Frontiers of Signal Processing, ICFSP 2024
Country/TerritoryFrance
CityParis
Period12/09/2414/09/24

Keywords

  • Ambient Noise (AN)
  • Assisted Living
  • Big Delay
  • Coherence-to-Diffuse Ratio
  • Cross-Power Spectrum (CPS)
  • Ideal Binary Mask (IBM)
  • Interference Suppression (IS)
  • phase-wraparound
  • Source Enhancement
  • Source-to-Distortion Ratio (SDR)
  • Time Difference-of-Arrival

Fingerprint

Dive into the research topics of 'Elevato-CDR: Speech Enhancement in Large Delay and Reverberant Assisted Living Scenarios'. Together they form a unique fingerprint.

Cite this