Sorry, you need to enable JavaScript to visit this website.

Information Maximized Variational Domain Adversarial Learning for Speaker Verification

Citation Author(s):
Youzhi Tu, Man-Wai Mak, Jen-Tzung Chien
Submitted by:
Man-Wai Mak
Last updated:
13 May 2020 - 10:02pm
Document Type:
Presentation Slides
Document Year:
2020
Event:
Presenters:
Man-Wai MAK
Paper Code:
5091
 

Domain mismatch is a common problem in speaker ver- ification. This paper proposes an information-maximized variational domain adversarial neural network (InfoVDANN) to reduce domain mismatch by incorporating an InfoVAE into domain adversarial training (DAT). DAT aims to pro- duce speaker discriminative and domain-invariant features. The InfoVAE has two roles. First, it performs variational regularization on the learned features so that they follow a Gaussian distribution, which is essential for the standard PLDA backend. Second, it preserves mutual information be- tween the features and the training set to extract extra speaker discriminative information. Experiments on both SRE16 and SRE18-CMN2 show that the InfoVDANN outperforms the recent VDANN, which suggests that increasing the mutual information between the latent features and input features enables the InfoVDANN to extract extra speaker information that is otherwise not possible.

up
0 users have voted: