Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES

Abstract: 

There have been several studies, in the recent past, pointing to the
importance of analytic phase of the speech signal in human percep-
tion, especially in noisy conditions. However, phase information is
still not used in state-of-the-art speech recognition systems. In this
paper, we illustrate the importance of analytic phase of the speech
signal for automatic speech recognition. As the computation of ana-
lytic phase suffers from inevitable phase wrapping problem, we ex-
tract features from its time derivative, referred to as instantaneous
frequency (IF). In this work, we highlight the issues involved in IF
extraction from speech-like signals, and propose suitable modifica-
tions for IF extraction from speech signals. We used the deep neural
network (DNN) framework to build a speech recognition system us-
ing features extracted from the IF of speech signals. The speech
recognition system based on IF features delivered a phoneme er-
ror rate of 21.8% on TIMIT database, while the baseline system
based on mel-frequency cepstral coefficients (MFCCs) delivered a
phoneme error rate of 18.4%. The combination of IF and MFCC fea-
tures based systems, using minimum Bayes risk (MBR) decoding,
provided a relative improvement of 8.7% over the baseline system,
illustrating the significance of analytic phase for speech recognition.

up
0 users have voted:

Paper Details

Authors:
Saurabhchand Bhati
Submitted On:
11 November 2017 - 8:10am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Shekhar Nayak
Paper Code:
GS-SIPA-P.1.5
Document Year:
2017
Cite

Document Files

Paper #1418

(61 downloads)

Subscribe

[1] Saurabhchand Bhati, "AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/2305. Accessed: Apr. 25, 2018.
@article{2305-17,
url = {http://sigport.org/2305},
author = {Saurabhchand Bhati },
publisher = {IEEE SigPort},
title = {AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES},
year = {2017} }
TY - EJOUR
T1 - AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES
AU - Saurabhchand Bhati
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/2305
ER -
Saurabhchand Bhati. (2017). AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES. IEEE SigPort. http://sigport.org/2305
Saurabhchand Bhati, 2017. AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES. Available at: http://sigport.org/2305.
Saurabhchand Bhati. (2017). "AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES." Web.
1. Saurabhchand Bhati. AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/2305