Abstract
A model of human speech quality perception has been developed to provide an objective measure for predicting subjective quality assessments. The Virtual Speech Quality Objective Listener (ViSQOL) model is a signal based full reference metric that uses a spectro-temporal measure of similarity between a reference and a test speech signal. This paper describes the algorithm and compares the results with PESQ for common problems in VoIP: clock drift, associated time warping and jitter. The results indicate that ViSQOL is less prone to underestimation of speech quality in both scenarios than the ITU standard.
Original language | English |
---|---|
DOIs | |
Publication status | Published - 2012 |
Event | International Workshop on Acoustic Signal Enhancement - Aachen, Germany Duration: 4 Sep 2012 → 6 Sep 2012 |
Conference
Conference | International Workshop on Acoustic Signal Enhancement |
---|---|
Country/Territory | Germany |
City | Aachen |
Period | 4/09/12 → 6/09/12 |
Keywords
- human speech quality perception
- objective measure
- subjective quality assessments
- Virtual Speech Quality Objective Listener
- ViSQOL
- signal based full reference metric
- spectro-temporal measure
- reference speech signal
- test speech signal
- algorithm
- PESQ
- VoIP
- clock drift
- time warping
- jitter
- ITU standard