Documents
Poster
RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR SPEECH PROCESSING
- Citation Author(s):
- Submitted by:
- Xiaolin Hu
- Last updated:
- 5 March 2017 - 10:18am
- Document Type:
- Poster
- Document Year:
- 2017
- Event:
- Presenters:
- Xiaolin Hu
- Paper Code:
- 1332
- Categories:
- Log in to post comments
Different neural networks have exhibited excellent performance on various speech processing tasks, and they usually have specific advantages and disadvantages. We propose to use a recently developed deep learning model, recurrent convolutional neural network (RCNN), for speech processing, which inherits some merits of recurrent neural network (RNN) and convolutional neural network (CNN). The core module can be viewed as a convolutional layer embedded with an RNN, which enables the model to capture both temporal and frequency dependence in the spectrogram of the speech in an efficient way. The model is tested on speech corpus TIMIT for phoneme recognition and IEMOCAP for emotion recognition. Experimental results show that the model is competitive with previous methods in terms of accuracy and efficiency.