Sorry, you need to enable JavaScript to visit this website.

Detection of Mood Disorder Using Speech Emotion Profiles and LSTM

Citation Author(s):
Tsung-Hsien Yang, Kun-Yi Huang, and Ming-Hsiang Su
Submitted by:
Chung-Hsien Wu
Last updated:
14 October 2016 - 9:30pm
Document Type:
Presentation Slides
Document Year:
Chung-Hsien Wu
Paper Code:


In mood disorder diagnosis, bipolar disorder (BD) patients are often misdiagnosed as unipolar depression (UD) on initial presentation. It is crucial to establish an accurate distinction between BD and UD to make a correct and early diagnosis, leading to improvements in treatment and course of illness. To deal with this misdiagnosis problem, in this study, we experimented on eliciting subjects’ emotions by watching six eliciting emotional video clips. After watching each video clips, their speech responses were collected when they were interviewing with a clinician. In mood disorder detection, speech emotions play an import role to detect manic or depressive symptoms. Therefore, speech emotion profiles (EP) are obtained by using the support vector machine (SVM) which are built via speech features adapted from selected databases using a denoising autoencoder-based method. Finally, a Long Short-Term Memory (LSTM) recurrent neural network is employed to characterize the temporal information of the EPs with respect to six emotional videos. Comparative experiments clearly show the promising advantage and efficacy of the LSTM-based approach for mood disorder detection.

0 users have voted: