Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION

Abstract: 

Speaker segmentation is an essential part of any diarization system.Applications of diarization include tasks such as speaker indexing, improving automatic speech recognition (ASR) performance and making single speaker-based algorithms available for use in multi-speaker environments.This paper proposes a multiple hypothesis tracking (MHT) method that exploits the harmonic structure associated with the pitch in voiced speech in order to segment the onsets and end-points of speech from multiple, overlapping speakers. The proposed method is evaluated against a segmentation system from the literature that uses a spectral representation and is based on employing bidirectional long short term memory networks (BLSTM). The proposed method is shown to achieve comparable performance for segmenting overlapping speakers only using the pitch harmonic information in the MHT framework.

up
0 users have voted:

Paper Details

Authors:
Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor
Submitted On:
16 October 2019 - 7:03am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Aidan Hogg
Paper Code:
1570546902
Document Year:
2019
Cite

Document Files

Poster___WASPAA_2019.pdf

(22)

Subscribe

[1] Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor, "WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4874. Accessed: Nov. 11, 2019.
@article{4874-19,
url = {http://sigport.org/4874},
author = {Aidan O. T. Hogg; Christine Evers; Patrick A. Naylor },
publisher = {IEEE SigPort},
title = {WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION},
year = {2019} }
TY - EJOUR
T1 - WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION
AU - Aidan O. T. Hogg; Christine Evers; Patrick A. Naylor
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4874
ER -
Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor. (2019). WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION. IEEE SigPort. http://sigport.org/4874
Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor, 2019. WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION. Available at: http://sigport.org/4874.
Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor. (2019). "WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION." Web.
1. Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor. WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4874