Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION

Examples of subband filters learned using ConvRBM: (a) filters in time-domain (i.e., impulse responses), (b) filters in frequency-domain (i.e., frequency responses).
Abstract: 

Convolutional Restricted Boltzmann Machine (ConvRBM) as a model for speech signal is presented in this paper. We have
developed ConvRBM with sampling from noisy rectified linear units (NReLUs). ConvRBM is trained in an unsupervised way to model speech signal of arbitrary lengths. Weights of the model can represent an auditory-like filterbank. Our
proposed learned filterbank is also nonlinear with respect to center frequencies of subband filters similar to standard filterbanks(such as Mel, Bark, ERB, etc.). We have used our proposed model as a front-end to learn features and applied to speech recognition task. Performance of ConvRBM features is improved compared to MFCC with relative improvement of 5% on TIMIT test set and 7% on WSJ0 database for both Nov’92 test sets using GMM-HMM systems. With DNNHMM systems, we achieved relative improvement of 3% on TIMIT test set over MFCC and Mel filterbank (FBANK). On WSJ0 Nov’92 test sets, we achieved relative improvement of 4-14% using ConvRBM features over MFCC features and 3.6-5.6% using ConvRBM filterbank over FBANK features.

poster.pdf

PDF icon poster.pdf (900)
Media Folder: 
up
0 users have voted:

Paper Details

Authors:
Hardik B. Sailor, Hemant A. Patil
Submitted On:
31 March 2016 - 4:04am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Hardik Sailor
Paper Code:
2705
Document Year:
2016
Cite

Document Files

poster.pdf

(900)

Subscribe

[1] Hardik B. Sailor, Hemant A. Patil, "FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1075. Accessed: Jul. 18, 2019.
@article{1075-16,
url = {http://sigport.org/1075},
author = {Hardik B. Sailor; Hemant A. Patil },
publisher = {IEEE SigPort},
title = {FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION},
year = {2016} }
TY - EJOUR
T1 - FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION
AU - Hardik B. Sailor; Hemant A. Patil
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1075
ER -
Hardik B. Sailor, Hemant A. Patil. (2016). FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION. IEEE SigPort. http://sigport.org/1075
Hardik B. Sailor, Hemant A. Patil, 2016. FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION. Available at: http://sigport.org/1075.
Hardik B. Sailor, Hemant A. Patil. (2016). "FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION." Web.
1. Hardik B. Sailor, Hemant A. Patil. FILTERBANK LEARNING USING CONVOLUTIONAL RESTRICTED BOLTZMANN MACHINE FOR SPEECH RECOGNITION [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1075