- Read more about DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
- Log in to post comments
Entanglement of speaker features and redundant features may lead to poor performance when evaluating speaker verification systems on an unseen domain. To address this issue, we propose an InfoMax domain separation and adaptation network (InfoMax–DSAN) to disentangle the domain-specific features and domain-invariant speaker features based on domain adaptation techniques. A frame-based mutual information neural estimator is proposed to maximize the mutual information between frame-level features and input acoustic features, which can help retain more useful information.
slides.pdf
- Categories:
- Read more about Robust speaker verification using Population-based Data Augmentation Poster
- Log in to post comments
- Categories:
- Read more about Robust speaker verification using Population-based Data Augmentation
- Log in to post comments
- Categories:
- Read more about "Self-Supervised Speaker Recognition Training using Human-Machine Dialogues" Presentation
- Log in to post comments
- Categories:
- Read more about ATTACK ON PRACTICAL SPEAKER VERIFICATION SYSTEM USING UNIVERSAL ADVERSARIAL PERTURBATIONS
- Log in to post comments
5375slide.pdf
- Categories:
State-of-the-art speaker verification systems take frame-level acoustics features as input and produce fixed-dimensional embeddings as utterance-level representations. Thus, how to aggregate information from frame-level features is vital for achieving high performance. This paper introduces short-time spectral pooling (STSP) for better aggregation of frame-level information. STSP transforms the temporal feature maps of a speaker embedding network into the spectral domain and extracts the lowest spectral components of the averaged spectrograms for aggregation.
- Categories:
- Read more about DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis
- 1 comment
- Log in to post comments
4229-poster.pdf
4229-slides.pdf
- Categories:
- Read more about I-VECTOR TRANSFORMATION USING K-NEAREST NEIGHBORS FOR SPEAKER VERIFICATION
- Log in to post comments
- Categories:
- Read more about Optimizing Bayesian HMM Based x-vector Clustering for theSecond DIHARD Speech Diarization Challenge
- Log in to post comments
- Categories: