Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

SPEECH RECOGNITION MODEL COMPRESSION

Abstract: 

Deep Neural Network-based speech recognition systems are widely used in most speech processing applications. To achieve better model robustness and accuracy, these networks are constructed with millions of parameters, making them storage and compute-intensive. In this paper, we propose Bin & Quant (B&Q), a compression technique using which we were able to reduce the Deep Speech 2 speech recognition model size by 7 times for a negligible loss in accuracy. We have shown that our algorithm is generally beneficial based on its effectiveness across two other speech recognition models and the VGG16 model. In this paper, we have empirically shown that Recurrent Neural Networks (RNNs) are more sensitive to model parameter perturbation than Convolutional Neural Networks (CNNs), followed by fully connected(FC) networks. Using our B&Q technique, we have shown that we can establish parameter sharing across layers instead of just within a particular layer.

up
0 users have voted:

Paper Details

Authors:
Ahmed Tewfik, Raj Pawate
Submitted On:
25 May 2020 - 2:17pm
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Madhumitha Sakthi
Paper Code:
4186
Document Year:
2020
Cite

Document Files

ICASSP 2020 slides.pptx

(40)

Subscribe

[1] Ahmed Tewfik, Raj Pawate, "SPEECH RECOGNITION MODEL COMPRESSION", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5434. Accessed: Oct. 01, 2020.
@article{5434-20,
url = {http://sigport.org/5434},
author = {Ahmed Tewfik; Raj Pawate },
publisher = {IEEE SigPort},
title = {SPEECH RECOGNITION MODEL COMPRESSION},
year = {2020} }
TY - EJOUR
T1 - SPEECH RECOGNITION MODEL COMPRESSION
AU - Ahmed Tewfik; Raj Pawate
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5434
ER -
Ahmed Tewfik, Raj Pawate. (2020). SPEECH RECOGNITION MODEL COMPRESSION. IEEE SigPort. http://sigport.org/5434
Ahmed Tewfik, Raj Pawate, 2020. SPEECH RECOGNITION MODEL COMPRESSION. Available at: http://sigport.org/5434.
Ahmed Tewfik, Raj Pawate. (2020). "SPEECH RECOGNITION MODEL COMPRESSION." Web.
1. Ahmed Tewfik, Raj Pawate. SPEECH RECOGNITION MODEL COMPRESSION [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5434