SP-P5: Speaker Diarization & Identification

dklement_dvbx_slides

Read more about dklement_dvbx_slides
Log in to post comments

Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges. It uses an HMM to model speaker turns, a generatively trained probabilistic linear discriminant analysis (PLDA) for speaker distribution modeling, and Bayesian inference to estimate the assignment of x-vectors to speakers. This paper presents a new framework for updating the VBx parameters using discriminative training, which directly optimizes a predefined loss.

DVBx-slides_fin.pdf

DVBx-slides_fin.pdf (788)

Categories:: Other

24 Views

Optimizing Bayesian HMM Based x-vector Clustering for theSecond DIHARD Speech Diarization Challenge

ICASSP2020_DIHARD_BHMM_Slides.pdf

ICASSP2020_DIHARD_BHMM_Slides.pdf (646)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

15 Views

Speaker Diarization with Session-level Speaker Embedding Refinement using Graph Neural Networks

Deep speaker embedding models have been commonly used as a building block for speaker diarization systems; however, the speaker embedding model is usually trained according to a global loss defined on the training data, which could be sub-optimal for distinguishing speakers locally in a specific meeting session. In this work we present the first use of graph neural networks (GNNs) for the speaker diarization problem, utilizing a GNN to refine speaker embeddings locally using the structural information between speech segments inside each session.

icassp2020_slides.pdf

Slides for the paper "Speaker Diarization with Session-level Speaker Embedding Refinement using Graph Neural Networks" (674)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

40 Views

MULTISTREAM DIARIZATION FUSION USING THE MINIMUM VARIANCE BAYESIAN INFORMATION CRITERION

Poster_ICASSP_2018.pdf

Poster_ICASSP_2018.pdf (539)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

8 Views