Adaptive Blind Audio Source Extraction Supervised by Dominant Speaker Identification using X-vectors

Citation Author(s):: Jiri Malek

Jansky, Malek, Cmejla, Kounovsky, Koldovsky, Zdansky
Submitted by:: Jiri Malek
Last updated:: 14 May 2020 - 3:35am
Document Type:: Presentation Slides
Document Year:: 2020
Event:: ICASSP 2020

Categories:: Source Separation and Signal Enhancement

We propose a novel algorithm for adaptive blind audio source extraction. The proposed method is based on independent vector analysis and utilizes the auxiliary function optimization to achieve high convergence speed. The algorithm is partially supervised by a pilot signal related to the source of interest (SOI), which ensures that the method correctly extracts the utterance of the desired speaker. The pilot is based on the identification of a dominant speaker in the mixture using x-vectors. The properties of the x-vectors computed in the presence of cross-talk are experimentally analyzed. The proposed approach is verified in a scenario with a moving SOI, static interfering speaker and environmental noise.

icassp2020_JanskyMalek_paper1967_final.pdf

icassp2020_JanskyMalek_paper1967_final.pdf (384)

Thumbs Up

CITE

Documents

Presentation Slides

Adaptive Blind Audio Source Extraction Supervised by Dominant Speaker Identification using X-vectors

icassp2020_JanskyMalek_paper1967_final.pdf

QUESTIONS?