Sorry, you need to enable JavaScript to visit this website.

ICASSP 2019

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website

HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION

Paper Details

Authors:
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou
Submitted On:
14 May 2019 - 7:11am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

aed_hal

(85)

Subscribe

[1] Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou, "HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4481. Accessed: May. 30, 2020.
@article{4481-19,
url = {http://sigport.org/4481},
author = {Arindam Jati; Naveen Kumar; Ruxin Chen; Panayiotis Georgiou },
publisher = {IEEE SigPort},
title = {HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION},
year = {2019} }
TY - EJOUR
T1 - HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION
AU - Arindam Jati; Naveen Kumar; Ruxin Chen; Panayiotis Georgiou
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4481
ER -
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou. (2019). HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION. IEEE SigPort. http://sigport.org/4481
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou, 2019. HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION. Available at: http://sigport.org/4481.
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou. (2019). "HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION." Web.
1. Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou. HIERARCHY-AWARE LOSS FUNCTION ON A TREE STRUCTURED LABEL SPACE FOR AUDIO EVENT DETECTION [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4481

The discrete cosine transform on triangles

Paper Details

Authors:
Knut Hüper
Submitted On:
13 May 2019 - 8:12am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

main.pdf

(76)

Subscribe

[1] Knut Hüper, "The discrete cosine transform on triangles", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4480. Accessed: May. 30, 2020.
@article{4480-19,
url = {http://sigport.org/4480},
author = {Knut Hüper },
publisher = {IEEE SigPort},
title = {The discrete cosine transform on triangles},
year = {2019} }
TY - EJOUR
T1 - The discrete cosine transform on triangles
AU - Knut Hüper
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4480
ER -
Knut Hüper. (2019). The discrete cosine transform on triangles. IEEE SigPort. http://sigport.org/4480
Knut Hüper, 2019. The discrete cosine transform on triangles. Available at: http://sigport.org/4480.
Knut Hüper. (2019). "The discrete cosine transform on triangles." Web.
1. Knut Hüper. The discrete cosine transform on triangles [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4480

DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION


Speech signal contains intrinsic and extrinsic variations such as accent, emotion, dialect, phoneme, speaking manner, noise, music, and reverberation. Some of these variations are unnecessary and are unspecified factors of variation. These factors lead to increased variability in speaker representation. In this paper, we assume that unspecified factors of variation exist in speaker representations, and we attempt to minimize variability in speaker representation.

Paper Details

Authors:
Insoo Kim, Kyuhong Kim, Jiwhan Kim, Changkyu Choi
Submitted On:
13 May 2019 - 2:29am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Poster_InsooKim.pdf

(140)

Subscribe

[1] Insoo Kim, Kyuhong Kim, Jiwhan Kim, Changkyu Choi, "DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4477. Accessed: May. 30, 2020.
@article{4477-19,
url = {http://sigport.org/4477},
author = {Insoo Kim; Kyuhong Kim; Jiwhan Kim; Changkyu Choi },
publisher = {IEEE SigPort},
title = {DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION},
year = {2019} }
TY - EJOUR
T1 - DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION
AU - Insoo Kim; Kyuhong Kim; Jiwhan Kim; Changkyu Choi
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4477
ER -
Insoo Kim, Kyuhong Kim, Jiwhan Kim, Changkyu Choi. (2019). DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION. IEEE SigPort. http://sigport.org/4477
Insoo Kim, Kyuhong Kim, Jiwhan Kim, Changkyu Choi, 2019. DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION. Available at: http://sigport.org/4477.
Insoo Kim, Kyuhong Kim, Jiwhan Kim, Changkyu Choi. (2019). "DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION." Web.
1. Insoo Kim, Kyuhong Kim, Jiwhan Kim, Changkyu Choi. DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4477

Adversarial Speaker Adaptation


We propose a novel adversarial speaker adaptation (ASA) scheme, in which adversarial learning is applied to regularize the distribution of deep hidden features in a speaker-dependent (SD) deep neural network (DNN) acoustic model to be close to that of a fixed speaker-independent (SI) DNN acoustic model during adaptation. An additional discriminator network is introduced to distinguish the deep features generated by the SD model from those produced by the SI model.

Paper Details

Authors:
Zhong Meng, Jinyu Li, Yifan Gong
Submitted On:
12 May 2019 - 9:26pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

asa_oral_v3.pptx

(137)

Subscribe

[1] Zhong Meng, Jinyu Li, Yifan Gong, "Adversarial Speaker Adaptation", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4475. Accessed: May. 30, 2020.
@article{4475-19,
url = {http://sigport.org/4475},
author = {Zhong Meng; Jinyu Li; Yifan Gong },
publisher = {IEEE SigPort},
title = {Adversarial Speaker Adaptation},
year = {2019} }
TY - EJOUR
T1 - Adversarial Speaker Adaptation
AU - Zhong Meng; Jinyu Li; Yifan Gong
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4475
ER -
Zhong Meng, Jinyu Li, Yifan Gong. (2019). Adversarial Speaker Adaptation. IEEE SigPort. http://sigport.org/4475
Zhong Meng, Jinyu Li, Yifan Gong, 2019. Adversarial Speaker Adaptation. Available at: http://sigport.org/4475.
Zhong Meng, Jinyu Li, Yifan Gong. (2019). "Adversarial Speaker Adaptation." Web.
1. Zhong Meng, Jinyu Li, Yifan Gong. Adversarial Speaker Adaptation [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4475

Attentive Adversarial Learning for Domain-Invariant Training


Adversarial domain-invariant training (ADIT) proves to be effective in suppressing the effects of domain variability in acoustic modeling and has led to improved performance in automatic speech recognition (ASR). In ADIT, an auxiliary domain classifier takes in equally-weighted deep features from a deep neural network (DNN) acoustic model and is trained to improve their domain-invariance by optimizing an adversarial loss function.

Paper Details

Authors:
Zhong Meng, Jinyu Li, Yifan Gong
Submitted On:
12 May 2019 - 9:03pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

aadit_poster.pptx

(102)

Subscribe

[1] Zhong Meng, Jinyu Li, Yifan Gong, "Attentive Adversarial Learning for Domain-Invariant Training", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4474. Accessed: May. 30, 2020.
@article{4474-19,
url = {http://sigport.org/4474},
author = {Zhong Meng; Jinyu Li; Yifan Gong },
publisher = {IEEE SigPort},
title = {Attentive Adversarial Learning for Domain-Invariant Training},
year = {2019} }
TY - EJOUR
T1 - Attentive Adversarial Learning for Domain-Invariant Training
AU - Zhong Meng; Jinyu Li; Yifan Gong
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4474
ER -
Zhong Meng, Jinyu Li, Yifan Gong. (2019). Attentive Adversarial Learning for Domain-Invariant Training. IEEE SigPort. http://sigport.org/4474
Zhong Meng, Jinyu Li, Yifan Gong, 2019. Attentive Adversarial Learning for Domain-Invariant Training. Available at: http://sigport.org/4474.
Zhong Meng, Jinyu Li, Yifan Gong. (2019). "Attentive Adversarial Learning for Domain-Invariant Training." Web.
1. Zhong Meng, Jinyu Li, Yifan Gong. Attentive Adversarial Learning for Domain-Invariant Training [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4474

Adversarial Speaker Verification


The use of deep networks to extract embeddings for speaker recognition has proven successfully. However, such embeddings are susceptible to performance degradation due to the mismatches among the training, enrollment, and test conditions. In this work, we propose an adversarial speaker verification (ASV) scheme to learn the condition-invariant deep embedding via adversarial multi-task training. In ASV, a speaker classification network and a condition identification network are jointly optimized to minimize the speaker classification loss and simultaneously mini-maximize the condition loss.

Paper Details

Authors:
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong
Submitted On:
12 May 2019 - 9:24pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

asv_poster_v3.pptx

(108)

Subscribe

[1] Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong, "Adversarial Speaker Verification", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4473. Accessed: May. 30, 2020.
@article{4473-19,
url = {http://sigport.org/4473},
author = {Zhong Meng; Yong Zhao; Jinyu Li; Yifan Gong },
publisher = {IEEE SigPort},
title = {Adversarial Speaker Verification},
year = {2019} }
TY - EJOUR
T1 - Adversarial Speaker Verification
AU - Zhong Meng; Yong Zhao; Jinyu Li; Yifan Gong
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4473
ER -
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong. (2019). Adversarial Speaker Verification. IEEE SigPort. http://sigport.org/4473
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong, 2019. Adversarial Speaker Verification. Available at: http://sigport.org/4473.
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong. (2019). "Adversarial Speaker Verification." Web.
1. Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong. Adversarial Speaker Verification [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4473

Conditional Teacher-Student Learning


The teacher-student (T/S) learning has been shown to be effective for a variety of problems such as domain adaptation and model compression. One shortcoming of the T/S learning is that a teacher model, not always perfect, sporadically produces wrong guidance in form of posterior probabilities that misleads the student model towards a suboptimal performance.

Paper Details

Authors:
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong
Submitted On:
12 May 2019 - 9:23pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

cts_poster.pptx

(98)

Subscribe

[1] Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong, "Conditional Teacher-Student Learning", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4472. Accessed: May. 30, 2020.
@article{4472-19,
url = {http://sigport.org/4472},
author = {Zhong Meng; Jinyu Li; Yong Zhao; Yifan Gong },
publisher = {IEEE SigPort},
title = {Conditional Teacher-Student Learning},
year = {2019} }
TY - EJOUR
T1 - Conditional Teacher-Student Learning
AU - Zhong Meng; Jinyu Li; Yong Zhao; Yifan Gong
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4472
ER -
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong. (2019). Conditional Teacher-Student Learning. IEEE SigPort. http://sigport.org/4472
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong, 2019. Conditional Teacher-Student Learning. Available at: http://sigport.org/4472.
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong. (2019). "Conditional Teacher-Student Learning." Web.
1. Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong. Conditional Teacher-Student Learning [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4472

QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL


The performance of voice-based systems for remote monitoring of Parkinson’s disease is highly dependent on the degree of adherence of the recordings to the test protocols, which probe for specific symptoms. Identifying segments of the signal that adhere to the protocol assumptions is typically performed manually by experts. This process is costly, time consuming, and often infeasible for large-scale data sets. In this paper, we propose a method to automatically identify the segments of signals that violate the test protocol with a high accuracy.

Paper Details

Authors:
Amir Hossein Poorjam, Yordan P. Raykov, Reham Badawy, Jesper Rindom Jensen, Mads Græsbøll Christensen, Max A. Little
Submitted On:
12 May 2019 - 6:13pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Infinite hidden Markov model for quality control

(68)

Keywords

Additional Categories

Subscribe

[1] Amir Hossein Poorjam, Yordan P. Raykov, Reham Badawy, Jesper Rindom Jensen, Mads Græsbøll Christensen, Max A. Little, "QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4471. Accessed: May. 30, 2020.
@article{4471-19,
url = {http://sigport.org/4471},
author = {Amir Hossein Poorjam; Yordan P. Raykov; Reham Badawy; Jesper Rindom Jensen; Mads Græsbøll Christensen; Max A. Little },
publisher = {IEEE SigPort},
title = {QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL},
year = {2019} }
TY - EJOUR
T1 - QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL
AU - Amir Hossein Poorjam; Yordan P. Raykov; Reham Badawy; Jesper Rindom Jensen; Mads Græsbøll Christensen; Max A. Little
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4471
ER -
Amir Hossein Poorjam, Yordan P. Raykov, Reham Badawy, Jesper Rindom Jensen, Mads Græsbøll Christensen, Max A. Little. (2019). QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL. IEEE SigPort. http://sigport.org/4471
Amir Hossein Poorjam, Yordan P. Raykov, Reham Badawy, Jesper Rindom Jensen, Mads Græsbøll Christensen, Max A. Little, 2019. QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL. Available at: http://sigport.org/4471.
Amir Hossein Poorjam, Yordan P. Raykov, Reham Badawy, Jesper Rindom Jensen, Mads Græsbøll Christensen, Max A. Little. (2019). "QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL." Web.
1. Amir Hossein Poorjam, Yordan P. Raykov, Reham Badawy, Jesper Rindom Jensen, Mads Græsbøll Christensen, Max A. Little. QUALITY CONTROL OF VOICE RECORDINGS IN REMOTE PARKINSON'S DISEASE MONITORING USING THE INFINITE HIDDEN MARKOV MODEL [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4471

ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES

Paper Details

Authors:
Karim Armanious, Youssef Mecky, Sergios Gatidis, Bin Yang
Submitted On:
12 May 2019 - 4:02pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

poster_icassp2019.pdf

(91)

Keywords

Subscribe

[1] Karim Armanious, Youssef Mecky, Sergios Gatidis, Bin Yang, "ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4470. Accessed: May. 30, 2020.
@article{4470-19,
url = {http://sigport.org/4470},
author = {Karim Armanious; Youssef Mecky; Sergios Gatidis; Bin Yang },
publisher = {IEEE SigPort},
title = {ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES},
year = {2019} }
TY - EJOUR
T1 - ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES
AU - Karim Armanious; Youssef Mecky; Sergios Gatidis; Bin Yang
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4470
ER -
Karim Armanious, Youssef Mecky, Sergios Gatidis, Bin Yang. (2019). ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES. IEEE SigPort. http://sigport.org/4470
Karim Armanious, Youssef Mecky, Sergios Gatidis, Bin Yang, 2019. ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES. Available at: http://sigport.org/4470.
Karim Armanious, Youssef Mecky, Sergios Gatidis, Bin Yang. (2019). "ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES." Web.
1. Karim Armanious, Youssef Mecky, Sergios Gatidis, Bin Yang. ADVERSARIAL INPAINTING OF MEDICAL IMAGE MODALITIES [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4470

TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION


We consider the problem of detecting whether a tensor signal having many missing entities lies within a given low dimensional Kronecker-Structured (KS) subspace. This is a matched subspace detection problem. Tensor matched subspace detection problem is more challenging because of the intertwined signal dimensions. We solve this problem by projecting the signal onto the KS subspace, which is a Kronecker product of different subspaces corresponding to each signal dimension. Under this framework, we define the KS subspaces and the orthogonal projection of the signal onto the KS subspace.

Paper Details

Authors:
Ishan Jindal, Matthew Nokleby
Submitted On:
12 May 2019 - 1:39pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Poster for the paper titled TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION

(69)

Subscribe

[1] Ishan Jindal, Matthew Nokleby, "TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4468. Accessed: May. 30, 2020.
@article{4468-19,
url = {http://sigport.org/4468},
author = {Ishan Jindal; Matthew Nokleby },
publisher = {IEEE SigPort},
title = {TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION},
year = {2019} }
TY - EJOUR
T1 - TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION
AU - Ishan Jindal; Matthew Nokleby
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4468
ER -
Ishan Jindal, Matthew Nokleby. (2019). TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION. IEEE SigPort. http://sigport.org/4468
Ishan Jindal, Matthew Nokleby, 2019. TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION. Available at: http://sigport.org/4468.
Ishan Jindal, Matthew Nokleby. (2019). "TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION." Web.
1. Ishan Jindal, Matthew Nokleby. TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4468

Pages