Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings

Abstract: 

The detection of overlapping speech segments is of key importance in speech applications involving analysis of multi-party conversations. The detection problem is challenging because overlapping speech segments are typically captured as short speech utterances far-field microphone recordings. In this paper, we propose detection of overlap segments using a neural network architecture consisting of long-short term memory (LSTM) models. The neural network architecture learns the presence of overlap in speech by identifying the spectrotemporal structure of overlapping speech segments. In order to evaluate the model performance, we perform experiments on simulated overlapped speech generated from the TIMIT database, and natural multi-talker conversational speech in the augmented multi-party interaction (AMI) meeting corpus. The proposed approach yields improvements over a Gaussian mixture model based overlap detection system. Furthermore, as an application of overlap detection, integration of overlap detection into speaker diarization task is shown to give improvement in diarization error rate.

up
0 users have voted:

Paper Details

Authors:
Neeraj Sajjan, Shobhana Ganesh, Neeraj Sharma, Sriram Ganapathy, Neville Ryant
Submitted On:
14 April 2018 - 2:54am
Short Link:
Type:
Poster
Event:
Presenter's Name:
SRIRAM GANAPATHY
Paper Code:
4021
Document Year:
2018
Cite

Document Files

neeraj_conference_poster.pdf

(38 downloads)

Subscribe

[1] Neeraj Sajjan, Shobhana Ganesh, Neeraj Sharma, Sriram Ganapathy, Neville Ryant, "Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2803. Accessed: Jul. 20, 2018.
@article{2803-18,
url = {http://sigport.org/2803},
author = {Neeraj Sajjan; Shobhana Ganesh; Neeraj Sharma; Sriram Ganapathy; Neville Ryant },
publisher = {IEEE SigPort},
title = {Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings},
year = {2018} }
TY - EJOUR
T1 - Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings
AU - Neeraj Sajjan; Shobhana Ganesh; Neeraj Sharma; Sriram Ganapathy; Neville Ryant
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2803
ER -
Neeraj Sajjan, Shobhana Ganesh, Neeraj Sharma, Sriram Ganapathy, Neville Ryant. (2018). Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings. IEEE SigPort. http://sigport.org/2803
Neeraj Sajjan, Shobhana Ganesh, Neeraj Sharma, Sriram Ganapathy, Neville Ryant, 2018. Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings. Available at: http://sigport.org/2803.
Neeraj Sajjan, Shobhana Ganesh, Neeraj Sharma, Sriram Ganapathy, Neville Ryant. (2018). "Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings." Web.
1. Neeraj Sajjan, Shobhana Ganesh, Neeraj Sharma, Sriram Ganapathy, Neville Ryant. Leveraging LSTM Models for Overlap Detection in Multi-Party Meetings [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2803