High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA)

David Dorran, Robert Lawlor, Eugene Coyle

Research output: Contribution to journalConference articlepeer-review

Abstract

The duration of a speech passage can be altered using audio time-scale modification techniques. Time-scale modification can be achieved in the time domain by segmenting the input signal into overlapping frames and recombining the frames with an overlap differing from the analysis overlap. We present a time-scale modification algorithm that uses a simple peak alignment technique to synchronize overlapping synthesis frames. The peak alignment overlap-add (PAOLA) algorithm also takes advantage of waveform properties to ensure a high quality output for the minimum number of iterations. The new algorithm produces a time-scaled output of approximately equal quality to that of an adaptive implementation of the commercially popular synchronised overlap-add (SOLA) algorithm, but offers a computational saving ranging from a factor of 15 (for a time-scale factor of 0.5) to 170 (for a time-scale factor of 1.1).

Original languageEnglish
Pages (from-to)700-703
Number of pages4
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
Publication statusPublished - 2003
Event2003 IEEE International Conference on Accoustics, Speech, and Signal Processing - Hong Kong, Hong Kong
Duration: 6 Apr 200310 Apr 2003

Fingerprint

Dive into the research topics of 'High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA)'. Together they form a unique fingerprint.

Cite this