Sorry, you need to enable JavaScript to visit this website.

RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR SPEECH PROCESSING

Citation Author(s):
Yue Zhao, Xingyu Jin, Xiaolin Hu
Submitted by:
Xiaolin Hu
Last updated:
5 March 2017 - 10:18am
Document Type:
Poster
Document Year:
2017
Event:
Presenters:
Xiaolin Hu
Paper Code:
1332
 

Different neural networks have exhibited excellent performance on various speech processing tasks, and they usually have specific advantages and disadvantages. We propose to use a recently developed deep learning model, recurrent convolutional neural network (RCNN), for speech processing, which inherits some merits of recurrent neural network (RNN) and convolutional neural network (CNN). The core module can be viewed as a convolutional layer embedded with an RNN, which enables the model to capture both temporal and frequency dependence in the spectrogram of the speech in an efficient way. The model is tested on speech corpus TIMIT for phoneme recognition and IEMOCAP for emotion recognition. Experimental results show that the model is competitive with previous methods in terms of accuracy and efficiency.

up
0 users have voted: