Speaker Verification

Self-supervised Speaker Verification with Adaptive Threshold and Hierarchical Training

Read more about Self-supervised Speaker Verification with Adaptive Threshold and Hierarchical Training
1 comment
Log in to post comments

This is a poster material of recent research accepted by IEEE ICASSP 2024.
Title: SELF-SUPERVISED SPEAKER VERIFICATION WITH ADAPTIVE THRESHOLD AND HIERARCHICAL TRAINING

For more inforamation, please check out the publication at IEEE Xplore:
https://ieeexplore.ieee.org/document/10448455

Self-supervised Speaker Verification with Adaptive Threshold and Hierarchical Training_POSTER.pdf

Self-supervised Speaker Verification with Adaptive Threshold and Hierarchical Training_POSTER.pdf (365)

Categories:: Spoken Language Processing

37 Views

TB-RESNET: BRIDGING THE GAP FROM TDNN TO RESNET IN AUTOMATIC SPEAKER VERIFICATION WITH TEMPORAL-BOTTLENECK ENHANCEMENT

This paper focuses on the transition of automatic speaker verification systems from time delay neural networks (TDNN) to ResNet-based networks. TDNN-based systems use a statistics pooling layer to aggregate temporal information which is suitable for two-dimensional tensors. Even though ResNet-based models produce three-dimensional tensors, they continue to incorporate the statistics pooling layer.

TB_RESNET_POSTER.pdf

TB_RESNET_POSTER.pdf (270)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

40 Views

Using LSF Features for Speaker Verification in Noise

Read more about Using LSF Features for Speaker Verification in Noise
Log in to post comments

An automatic, text-independent speaker verification (SV) system is proposed using Line Spectral Frequency (LSF) features. The state-of-the-art Gaussian Mixture Model with Universal Background Model (GMM-UBM) framework is used for speaker modeling and verification. A score-level fusion based technique is employed in order to extract complementary information from static and dynamic LSF features and improve the noise-robustness of the SV system. In addition, the speaker-discriminative power of different speech zones such as vowels, non-vowels, and transitions are investigated.

Presentation_Raman_Pujita.pdf

Presentation_Raman_Pujita.pdf (463)

24 Views