Streaming VR for immersion: Quality aspects of compressed spatial audio

Miroslaw Narbutt, Sean O'Leary, Andrew Allen, Jan Skoglund, Andrew Hines

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Delivering a 360-degree soundscape that matches full sphere visuals is an essential aspect of immersive VR. Ambisonics is a full sphere surround sound technique that takes into account the azimuth and elevation of sound sources, portraying source location above and below as well as around the horizontal plane of the listener. In contrast to channel-based methods, ambisonics representation offers the advantage of being independent of a specific loudspeaker set-up. Streaming ambisonics over networks requires efficient encoding techniques that compress the raw audio content without compromising quality of experience (QoE). This work investigates the effect of audio channel compression via the OPUS 1.2 codec on the quality of spatial audio as perceived by listeners. In particular we evaluate the listening quality and localization accuracy of first-order ambisonic audio (FOA) and third-order ambisonic audio (HOA) compressed at various bitrates (i.e. 32, 64, 128 and 128, 256, 512kbps respectively). To assess the impact of OPUS compression on spatial audio a number of subjective listening tests were carried out. The sample set for the tests comprises both recorded and synthetic audio clips with a wide range of time-frequency characteristics. In order to evaluate localization accuracy of compressed audio a number of fixed and dynamic (moving vertically and horizontally) source positions were selected for the test samples. The results show that for compressed spatial audio, perceived quality and localization accuracy are influenced more by compression scheme, bitrate and ambisonic order than by sample content. The insights provided by this work into factors and parameters influencing QoE will guide future development of a objective spatial audio quality metric.

Original languageEnglish
Title of host publicationProceedings of the 2017 23rd International Conference on Virtual Systems and Multimedia, VSMM 2017
EditorsAlonzo Addison, Lizbeth Goodman, Alonzo Addison
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-6
Number of pages6
ISBN (Electronic)9781538644935
DOIs
Publication statusPublished - 24 Apr 2018
Event23rd International Conference on Virtual Systems and Multimedia, VSMM 2017 - Dublin, Ireland
Duration: 31 Oct 20174 Nov 2017

Publication series

NameProceedings of the 2017 23rd International Conference on Virtual Systems and Multimedia, VSMM 2017
Volume2018-January

Conference

Conference23rd International Conference on Virtual Systems and Multimedia, VSMM 2017
Country/TerritoryIreland
CityDublin
Period31/10/174/11/17

Keywords

  • ambisonics
  • audio coding
  • audio compression
  • MUSHRA
  • opus codec
  • spatial audio
  • virtual reality

Fingerprint

Dive into the research topics of 'Streaming VR for immersion: Quality aspects of compressed spatial audio'. Together they form a unique fingerprint.

Cite this