Abstract
A model of human speech quality perception has been developed to provide an objective measure for predicting subjective quality assessments. The Virtual Speech Quality Objective Listener (ViSQOL) model is a signal based full reference metric that uses a spectro-temporal measure of similarity between a reference and a test speech signal. This paper describes the algorithm and compares the results with PESQ for common problems in VoIP: clock drift, associated time warping and jitter. The results indicate that ViSQOL is less prone to underestimation of speech quality in both scenarios than the ITU standard.
| Original language | English |
|---|---|
| DOIs | |
| Publication status | Published - 2012 |
| Event | International Workshop on Acoustic Signal Enhancement - Aachen, Germany Duration: 4 Sep 2012 → 6 Sep 2012 |
Conference
| Conference | International Workshop on Acoustic Signal Enhancement |
|---|---|
| Country/Territory | Germany |
| City | Aachen |
| Period | 4/09/12 → 6/09/12 |
Keywords
- human speech quality perception
- objective measure
- subjective quality assessments
- Virtual Speech Quality Objective Listener
- ViSQOL
- signal based full reference metric
- spectro-temporal measure
- reference speech signal
- test speech signal
- algorithm
- PESQ
- VoIP
- clock drift
- time warping
- jitter
- ITU standard