Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Frame-based Overlapping Speech Detection using Convolutional Neural Networks

Abstract: 

Naturalistic speech recordings usually contain speech signals from multiple speakers. This phenomenon can degrade the performance of speech technologies due to the complexity of tracing and recognizing individual speakers. In this study, we investigate the detection of overlapping speech on segments as short as 25 ms using Convolutional Neural Networks. We evaluate the detection performance using different spectral features, and show that pyknogram features outperforms other commonly used speech features. The proposed system can predict overlapping speech with an accuracy of 84% and Fscore of 88% on a dataset of mixed speech generated based on the GRID dataset.

up
0 users have voted:

Paper Details

Authors:
Midia Yousefi, Hohn H.L. Hansen
Submitted On:
15 May 2020 - 6:57pm
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Midia Yousefi
Paper Code:
4733
Document Year:
2020
Cite

Document Files

ICASSP2020-overlap-detection_MY-JH-Mar30-2020.pdf

(14)

Subscribe

[1] Midia Yousefi, Hohn H.L. Hansen, "Frame-based Overlapping Speech Detection using Convolutional Neural Networks", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5358. Accessed: Jul. 14, 2020.
@article{5358-20,
url = {http://sigport.org/5358},
author = {Midia Yousefi; Hohn H.L. Hansen },
publisher = {IEEE SigPort},
title = {Frame-based Overlapping Speech Detection using Convolutional Neural Networks},
year = {2020} }
TY - EJOUR
T1 - Frame-based Overlapping Speech Detection using Convolutional Neural Networks
AU - Midia Yousefi; Hohn H.L. Hansen
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5358
ER -
Midia Yousefi, Hohn H.L. Hansen. (2020). Frame-based Overlapping Speech Detection using Convolutional Neural Networks. IEEE SigPort. http://sigport.org/5358
Midia Yousefi, Hohn H.L. Hansen, 2020. Frame-based Overlapping Speech Detection using Convolutional Neural Networks. Available at: http://sigport.org/5358.
Midia Yousefi, Hohn H.L. Hansen. (2020). "Frame-based Overlapping Speech Detection using Convolutional Neural Networks." Web.
1. Midia Yousefi, Hohn H.L. Hansen. Frame-based Overlapping Speech Detection using Convolutional Neural Networks [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5358