Benchmarking classification models for emotion recognition in natural speech: A multi-corporal study

Alexey Tarasov, Sarah Jane Delany

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Citations (Scopus)

Abstract

A significant amount of the research on automatic emotion recognition from speech focuses on acted speech that is produced by professional actors. This approach often leads to overoptimistic results as the recognition of emotion in real-life conditions is more challenging due the propensity of mixed and less intense emotions in natural speech. The paper presents an empirical study of the most widely used classifiers in the domain of emotion recognition from speech, across multiple non-acted emotional speech corpora. The results indicate that Support Vector Machines have the best performance and that they along with Multi-Layer Perceptron networks and k-nearest neighbour classifiers perform significantly better (using the appropriate statistical tests) than decision trees, Nave Bayes classifiers and Radial Basis Function networks.

Original languageEnglish
Title of host publication2011 IEEE International Conference on Automatic Face and Gesture Recognition and Workshops, FG 2011
Pages841-845
Number of pages5
DOIs
Publication statusPublished - 2011
Event2011 IEEE International Conference on Automatic Face and Gesture Recognition and Workshops, FG 2011 - Santa Barbara, CA, United States
Duration: 21 Mar 201125 Mar 2011

Publication series

Name2011 IEEE International Conference on Automatic Face and Gesture Recognition and Workshops, FG 2011

Conference

Conference2011 IEEE International Conference on Automatic Face and Gesture Recognition and Workshops, FG 2011
Country/TerritoryUnited States
CitySanta Barbara, CA
Period21/03/1125/03/11

Fingerprint

Dive into the research topics of 'Benchmarking classification models for emotion recognition in natural speech: A multi-corporal study'. Together they form a unique fingerprint.

Cite this