Sorry, you need to enable JavaScript to visit this website.

Speaker Recognition and Characterization (SPE-SPKR)

Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with


The widely adopted i-vector performances well in textindependent speaker verification with long speech duration.

Paper Details

Authors:
Peixin Chen, Wu Guo, Guoping Hu
Submitted On:
14 October 2016 - 10:24pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ISCSLP2016_PeixinChen.pdf

(382)

Subscribe

[1] Peixin Chen, Wu Guo, Guoping Hu, "Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1206. Accessed: Nov. 21, 2019.
@article{1206-16,
url = {http://sigport.org/1206},
author = {Peixin Chen; Wu Guo; Guoping Hu },
publisher = {IEEE SigPort},
title = {Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with},
year = {2016} }
TY - EJOUR
T1 - Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with
AU - Peixin Chen; Wu Guo; Guoping Hu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1206
ER -
Peixin Chen, Wu Guo, Guoping Hu. (2016). Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with. IEEE SigPort. http://sigport.org/1206
Peixin Chen, Wu Guo, Guoping Hu, 2016. Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with. Available at: http://sigport.org/1206.
Peixin Chen, Wu Guo, Guoping Hu. (2016). "Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with." Web.
1. Peixin Chen, Wu Guo, Guoping Hu. Digit-dependent Local I-Vector for Text-Prompted Speaker Verification with [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1206

A study of variational method for text-independent speaker recognition


An i-vector has become the state-of-the-art algorithm for text-independent recognition. Most of related works take the extraction of the i-vector as a black-box by using some open software (e.g. Kaldi, Alize) and focus on the vector-based back-end algorithms, such as length normalization, WCCN, or PLDA. In this paper, we study the variational method and present a concise derivation for the i-vector. Based on our proposed methods, three criteria for derivation are compared. There are maximum likelihood (ML), maximum a posteriori (MAP) and maximum

Paper Details

Authors:
Liang He, Yao Tian, Yi Liu, Fang Dong, WeiQiang Zhang, Jia Liu
Submitted On:
13 October 2016 - 11:19pm
Short Link:
Type:
Event:

Document Files

The poster in ISCSLP2016

(454)

Subscribe

[1] Liang He, Yao Tian, Yi Liu, Fang Dong, WeiQiang Zhang, Jia Liu, "A study of variational method for text-independent speaker recognition", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1179. Accessed: Nov. 21, 2019.
@article{1179-16,
url = {http://sigport.org/1179},
author = {Liang He; Yao Tian; Yi Liu; Fang Dong; WeiQiang Zhang; Jia Liu },
publisher = {IEEE SigPort},
title = {A study of variational method for text-independent speaker recognition},
year = {2016} }
TY - EJOUR
T1 - A study of variational method for text-independent speaker recognition
AU - Liang He; Yao Tian; Yi Liu; Fang Dong; WeiQiang Zhang; Jia Liu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1179
ER -
Liang He, Yao Tian, Yi Liu, Fang Dong, WeiQiang Zhang, Jia Liu. (2016). A study of variational method for text-independent speaker recognition. IEEE SigPort. http://sigport.org/1179
Liang He, Yao Tian, Yi Liu, Fang Dong, WeiQiang Zhang, Jia Liu, 2016. A study of variational method for text-independent speaker recognition. Available at: http://sigport.org/1179.
Liang He, Yao Tian, Yi Liu, Fang Dong, WeiQiang Zhang, Jia Liu. (2016). "A study of variational method for text-independent speaker recognition." Web.
1. Liang He, Yao Tian, Yi Liu, Fang Dong, WeiQiang Zhang, Jia Liu. A study of variational method for text-independent speaker recognition [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1179

First Investigation of Universal Speech Attributes for Speaker Verification


The universal speech attributes to speaker verification (SV) is addressed in this paper. The manner and place of articulation form the universal attribute unit inventory, and deep neural network (DNN) is used as acoustic model.

Paper Details

Authors:
Sheng Zhang, Wu Guo, Guoping Hu
Submitted On:
13 October 2016 - 4:25am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ISCSLP-张圣.pdf

(74)

Subscribe

[1] Sheng Zhang, Wu Guo, Guoping Hu, "First Investigation of Universal Speech Attributes for Speaker Verification", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1169. Accessed: Nov. 21, 2019.
@article{1169-16,
url = {http://sigport.org/1169},
author = {Sheng Zhang; Wu Guo; Guoping Hu },
publisher = {IEEE SigPort},
title = {First Investigation of Universal Speech Attributes for Speaker Verification},
year = {2016} }
TY - EJOUR
T1 - First Investigation of Universal Speech Attributes for Speaker Verification
AU - Sheng Zhang; Wu Guo; Guoping Hu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1169
ER -
Sheng Zhang, Wu Guo, Guoping Hu. (2016). First Investigation of Universal Speech Attributes for Speaker Verification. IEEE SigPort. http://sigport.org/1169
Sheng Zhang, Wu Guo, Guoping Hu, 2016. First Investigation of Universal Speech Attributes for Speaker Verification. Available at: http://sigport.org/1169.
Sheng Zhang, Wu Guo, Guoping Hu. (2016). "First Investigation of Universal Speech Attributes for Speaker Verification." Web.
1. Sheng Zhang, Wu Guo, Guoping Hu. First Investigation of Universal Speech Attributes for Speaker Verification [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1169

Segment-oriented evaluation of speaker diarisation performance


High performance diarisation is a necessity for a variety of applications, and the task has been
studied extensively in the context of broadcast news and meeting processing. Upon introduction of
the task in NIST led evaluations, diarisation error rate (DER) was introduced as the standard metric
for evaluation, and it has been consistently used to compare systems ever since. DER is a frame
based metric that does not penalise for producing many short segments. However, practical systems

Paper Details

Authors:
Rosanna Milner, Thomas Hain
Submitted On:
23 March 2016 - 4:37am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

dia_scoring_poster.pdf

(417)

Subscribe

[1] Rosanna Milner, Thomas Hain, "Segment-oriented evaluation of speaker diarisation performance", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/985. Accessed: Nov. 21, 2019.
@article{985-16,
url = {http://sigport.org/985},
author = {Rosanna Milner; Thomas Hain },
publisher = {IEEE SigPort},
title = {Segment-oriented evaluation of speaker diarisation performance},
year = {2016} }
TY - EJOUR
T1 - Segment-oriented evaluation of speaker diarisation performance
AU - Rosanna Milner; Thomas Hain
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/985
ER -
Rosanna Milner, Thomas Hain. (2016). Segment-oriented evaluation of speaker diarisation performance. IEEE SigPort. http://sigport.org/985
Rosanna Milner, Thomas Hain, 2016. Segment-oriented evaluation of speaker diarisation performance. Available at: http://sigport.org/985.
Rosanna Milner, Thomas Hain. (2016). "Segment-oriented evaluation of speaker diarisation performance." Web.
1. Rosanna Milner, Thomas Hain. Segment-oriented evaluation of speaker diarisation performance [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/985

Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems

Paper Details

Authors:
Andreas Nautsch, Hong Hao, Themos Stafylakis, Christian Rathgeb, Christoph Busch
Submitted On:
22 March 2016 - 8:12pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

icassp16-slides.pdf

(335)

icassp16-slides.pdf

(358)

Subscribe

[1] Andreas Nautsch, Hong Hao, Themos Stafylakis, Christian Rathgeb, Christoph Busch, "Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/842. Accessed: Nov. 21, 2019.
@article{842-16,
url = {http://sigport.org/842},
author = {Andreas Nautsch; Hong Hao; Themos Stafylakis; Christian Rathgeb; Christoph Busch },
publisher = {IEEE SigPort},
title = {Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems},
year = {2016} }
TY - EJOUR
T1 - Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems
AU - Andreas Nautsch; Hong Hao; Themos Stafylakis; Christian Rathgeb; Christoph Busch
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/842
ER -
Andreas Nautsch, Hong Hao, Themos Stafylakis, Christian Rathgeb, Christoph Busch. (2016). Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems. IEEE SigPort. http://sigport.org/842
Andreas Nautsch, Hong Hao, Themos Stafylakis, Christian Rathgeb, Christoph Busch, 2016. Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems. Available at: http://sigport.org/842.
Andreas Nautsch, Hong Hao, Themos Stafylakis, Christian Rathgeb, Christoph Busch. (2016). "Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems." Web.
1. Andreas Nautsch, Hong Hao, Themos Stafylakis, Christian Rathgeb, Christoph Busch. Towards PLDA-RBM based Speaker Recognition in Mobile Environment: Designing Stacked/Deep PLDA-RBM Systems [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/842

Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification


What happens when a standard speaker verification system is tested with whispered speech?

In this paper, automatic speaker verification using normal and whispered speech is explored. Typically, for speaker verification systems with varying vocal effort inputs, standard solutions such as feature mapping or addition of data during parameter estimation (training) and enrollment stages result in a trade-off between accuracy gains with whispered test data and accuracy losses (up to 70% in equal error rate, EER) with normal test data. To overcome this shortcoming, this paper proposes two innovations.

Paper Details

Authors:
Milton Sarria-Paja, Mohammed Senoussaoui, Douglas O'Shaughnessy, Tiago H. Falk
Submitted On:
19 March 2016 - 6:18pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

ICASSP2016.pdf

(380)

Subscribe

[1] Milton Sarria-Paja, Mohammed Senoussaoui, Douglas O'Shaughnessy, Tiago H. Falk, "Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/837. Accessed: Nov. 21, 2019.
@article{837-16,
url = {http://sigport.org/837},
author = {Milton Sarria-Paja; Mohammed Senoussaoui; Douglas O'Shaughnessy; Tiago H. Falk },
publisher = {IEEE SigPort},
title = {Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification},
year = {2016} }
TY - EJOUR
T1 - Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification
AU - Milton Sarria-Paja; Mohammed Senoussaoui; Douglas O'Shaughnessy; Tiago H. Falk
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/837
ER -
Milton Sarria-Paja, Mohammed Senoussaoui, Douglas O'Shaughnessy, Tiago H. Falk. (2016). Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification. IEEE SigPort. http://sigport.org/837
Milton Sarria-Paja, Mohammed Senoussaoui, Douglas O'Shaughnessy, Tiago H. Falk, 2016. Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification. Available at: http://sigport.org/837.
Milton Sarria-Paja, Mohammed Senoussaoui, Douglas O'Shaughnessy, Tiago H. Falk. (2016). "Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification." Web.
1. Milton Sarria-Paja, Mohammed Senoussaoui, Douglas O'Shaughnessy, Tiago H. Falk. Feature Mapping, Score-, and Feature-Level Fusion for Improved Normal and Whispered Speech Speaker Verification [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/837

Local variability modeling for text-independent speaker verification

Paper Details

Authors:
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

Poster_Odyssey_landscape.pdf

(502)

Subscribe

[1] KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai, "Local variability modeling for text-independent speaker verification", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/157. Accessed: Nov. 21, 2019.
@article{157-15,
url = {http://sigport.org/157},
author = {KongAik Lee; Bin Ma; Wu Guo; Haizhou Li; LiRong Dai },
publisher = {IEEE SigPort},
title = {Local variability modeling for text-independent speaker verification},
year = {2015} }
TY - EJOUR
T1 - Local variability modeling for text-independent speaker verification
AU - KongAik Lee; Bin Ma; Wu Guo; Haizhou Li; LiRong Dai
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/157
ER -
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. (2015). Local variability modeling for text-independent speaker verification. IEEE SigPort. http://sigport.org/157
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai, 2015. Local variability modeling for text-independent speaker verification. Available at: http://sigport.org/157.
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. (2015). "Local variability modeling for text-independent speaker verification." Web.
1. KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. Local variability modeling for text-independent speaker verification [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/157

Local Variability Vector for Text-Independent Speaker Verification (presentation)

Paper Details

Authors:
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:

Document Files

Constrained Local Variability__ Modeling for Text Independent Speaker Verification.pdf

(85)

Subscribe

[1] KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai, "Local Variability Vector for Text-Independent Speaker Verification (presentation) ", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/156. Accessed: Nov. 21, 2019.
@article{156-15,
url = {http://sigport.org/156},
author = {KongAik Lee; Bin Ma; Wu Guo; Haizhou Li; LiRong Dai },
publisher = {IEEE SigPort},
title = {Local Variability Vector for Text-Independent Speaker Verification (presentation) },
year = {2015} }
TY - EJOUR
T1 - Local Variability Vector for Text-Independent Speaker Verification (presentation)
AU - KongAik Lee; Bin Ma; Wu Guo; Haizhou Li; LiRong Dai
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/156
ER -
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. (2015). Local Variability Vector for Text-Independent Speaker Verification (presentation) . IEEE SigPort. http://sigport.org/156
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai, 2015. Local Variability Vector for Text-Independent Speaker Verification (presentation) . Available at: http://sigport.org/156.
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. (2015). "Local Variability Vector for Text-Independent Speaker Verification (presentation) ." Web.
1. KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. Local Variability Vector for Text-Independent Speaker Verification (presentation) [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/156

Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring

Paper Details

Authors:
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

ICASSP2014_ka.pdf

(489)

Subscribe

[1] KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai, "Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/155. Accessed: Nov. 21, 2019.
@article{155-15,
url = {http://sigport.org/155},
author = {KongAik Lee; Bin Ma; Wu Guo; Haizhou Li; LiRong Dai },
publisher = {IEEE SigPort},
title = {Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring},
year = {2015} }
TY - EJOUR
T1 - Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring
AU - KongAik Lee; Bin Ma; Wu Guo; Haizhou Li; LiRong Dai
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/155
ER -
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. (2015). Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring. IEEE SigPort. http://sigport.org/155
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai, 2015. Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring. Available at: http://sigport.org/155.
KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. (2015). "Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring." Web.
1. KongAik Lee, Bin Ma, Wu Guo, Haizhou Li, LiRong Dai. Minimum Divergence Estimation of Speaker Prior in Multi-session PLDA Scoring [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/155

Pages