VISQOL: The Virtual Speech Quality Objective Listener

Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte

Research output: Contribution to conferencePaperpeer-review

Abstract

A model of human speech quality perception has been developed to provide an objective measure for predicting subjective quality assessments. The Virtual Speech Quality Objective Listener (ViSQOL) model is a signal based full reference metric that uses a spectro-temporal measure of similarity between a reference and a test speech signal. This paper describes the algorithm and compares the results with PESQ for common problems in VoIP: clock drift, associated time warping and jitter. The results indicate that ViSQOL is less prone to underestimation of speech quality in both scenarios than the ITU standard.
Original languageEnglish
DOIs
Publication statusPublished - 2012
EventInternational Workshop on Acoustic Signal Enhancement - Aachen, Germany
Duration: 4 Sep 20126 Sep 2012

Conference

ConferenceInternational Workshop on Acoustic Signal Enhancement
Country/TerritoryGermany
CityAachen
Period4/09/126/09/12

Keywords

  • human speech quality perception
  • objective measure
  • subjective quality assessments
  • Virtual Speech Quality Objective Listener
  • ViSQOL
  • signal based full reference metric
  • spectro-temporal measure
  • reference speech signal
  • test speech signal
  • algorithm
  • PESQ
  • VoIP
  • clock drift
  • time warping
  • jitter
  • ITU standard

Fingerprint

Dive into the research topics of 'VISQOL: The Virtual Speech Quality Objective Listener'. Together they form a unique fingerprint.

Cite this