Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS

Abstract: 

The task of automatic language identification (LID) involving multiple dialects of the same language family on short speech recordings is a challenging problem. This can be further complicated for short-duration audio snippets in the presence of noise sources. In these scenarios, the identity of the language/dialect may be reliably present only in parts of the speech embedded in the temporal sequence. The conventional approaches to LID (and for speaker recognition) ignore the sequence information by extracting long-term statistical summary of the recording assuming independence of the feature frames. In this paper, we propose to develop an end-to-end neural network framework utilizing short-sequence information in language recognition. A hierarchical gated recurrent unit (HGRU) model with attention module is proposed for incorporating relevance in language recognition, where parts of speech data are weighted more based on their relevance for the language recognition task. Experiments are performed using the language recognition task in NIST LRE 2017 Challenge using clean, noisy and multi-speaker speech data. In these experiments, the proposed approach yields significant improvements over the conventional i-vector based language recognition approaches as well as a previously proposed approach to language recognition using recurrent networks.

up
0 users have voted:

Paper Details

Authors:
Bharat Padi, Anand Mohan, Sriram Ganapathy
Submitted On:
10 May 2019 - 2:30am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Bharat Kumar Padi
Paper Code:
SLP-P1.4
Document Year:
2019
Cite

Document Files

ICASSP19_3253_poster.pdf

(21)

Subscribe

[1] Bharat Padi, Anand Mohan, Sriram Ganapathy, "END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4275. Accessed: Aug. 25, 2019.
@article{4275-19,
url = {http://sigport.org/4275},
author = {Bharat Padi; Anand Mohan; Sriram Ganapathy },
publisher = {IEEE SigPort},
title = {END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS},
year = {2019} }
TY - EJOUR
T1 - END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS
AU - Bharat Padi; Anand Mohan; Sriram Ganapathy
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4275
ER -
Bharat Padi, Anand Mohan, Sriram Ganapathy. (2019). END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS. IEEE SigPort. http://sigport.org/4275
Bharat Padi, Anand Mohan, Sriram Ganapathy, 2019. END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS. Available at: http://sigport.org/4275.
Bharat Padi, Anand Mohan, Sriram Ganapathy. (2019). "END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS." Web.
1. Bharat Padi, Anand Mohan, Sriram Ganapathy. END-TO-END LANGUAGE RECOGNITION USING ATTENTION BASED HIERARCHICAL GATED RECURRENT UNIT MODELS [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4275