Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention

Abstract: 

Automatic emotion recognition from speech is a challenging task which relies heavily on the effectiveness of the speech features used for classification. In this work, we study the use of deep learning to automatically discover emotionally relevant features from speech. It is shown that using a deep recurrent neural network, we can learn both the short-time frame-level acoustic features that are emotionally relevant, as well as an appropriate temporal aggregation of those features into a compact utterance-level representation. Moreover, we propose a novel strategy for feature pooling over time which uses local attention in order to focus on specific regions of a speech signal that are more emotionally salient. The proposed solution is evaluated on the IEMOCAP corpus, and is shown to provide more accurate predictions compared to existing emotion recognition algorithms.

up
0 users have voted:

Paper Details

Authors:
Emad Barsoum, Cha Zhang
Submitted On:
15 March 2017 - 12:33am
Short Link:
Type:
Research Manuscript
Event:
Presenter's Name:
Seyedmahdad Mirsamadi
Paper Code:
MLSP-L1.05
Document Year:
2017
Cite

Document Files

icassp2017.pptx

(80 downloads)

icassp2017.pdf

(156 downloads)

Subscribe

[1] Emad Barsoum, Cha Zhang, "Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1667. Accessed: Sep. 20, 2017.
@article{1667-17,
url = {http://sigport.org/1667},
author = {Emad Barsoum; Cha Zhang },
publisher = {IEEE SigPort},
title = {Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention},
year = {2017} }
TY - EJOUR
T1 - Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention
AU - Emad Barsoum; Cha Zhang
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1667
ER -
Emad Barsoum, Cha Zhang. (2017). Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention. IEEE SigPort. http://sigport.org/1667
Emad Barsoum, Cha Zhang, 2017. Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention. Available at: http://sigport.org/1667.
Emad Barsoum, Cha Zhang. (2017). "Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention." Web.
1. Emad Barsoum, Cha Zhang. Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1667