Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis

Abstract: 

Directions into Velocities of Articulators (DIVA) model is a kind of self-adaptive neural network model which controls movements of a simulated vocal tract to produce words, syllables or phonemes. However, DIVA model lacks of emotion functions. To implement the emotion function in DIVA model, we investigate the process of affective speech production based on the combination of fundamental frequency (F0) and formant frequencies, as well as the relations between F0 and formants of emotional speech. The relations between F0 and formants of the speech with different emotions are investigated using the logistic regression (LR) models on the emotional databases. For a given emotion-related F0, the formants can be predicted correctly using the LR models. An affective speech synthesizer was constructed by implementing the relation of F0 and formants in an improved formant synthesis method. Experiments on affective speech synthesis were conducted on three different emotional speech datasets, and affective speech with negative or positive emotion can also be effectively synthesized from neutral speech.

up
0 users have voted:

Paper Details

Authors:
Bogu Li, Zhilei Liu, Jianwu Dang
Submitted On:
11 October 2016 - 12:11am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Bogu Li
Paper Code:
ISCSLP2016-81
Document Year:
2016
Cite

Document Files

poster

(257 downloads)

Subscribe

[1] Bogu Li, Zhilei Liu, Jianwu Dang, "Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1155. Accessed: Sep. 20, 2018.
@article{1155-16,
url = {http://sigport.org/1155},
author = {Bogu Li; Zhilei Liu; Jianwu Dang },
publisher = {IEEE SigPort},
title = {Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis},
year = {2016} }
TY - EJOUR
T1 - Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis
AU - Bogu Li; Zhilei Liu; Jianwu Dang
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1155
ER -
Bogu Li, Zhilei Liu, Jianwu Dang. (2016). Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis. IEEE SigPort. http://sigport.org/1155
Bogu Li, Zhilei Liu, Jianwu Dang, 2016. Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis. Available at: http://sigport.org/1155.
Bogu Li, Zhilei Liu, Jianwu Dang. (2016). "Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis." Web.
1. Bogu Li, Zhilei Liu, Jianwu Dang. Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1155