Sorry, you need to enable JavaScript to visit this website.

ICASSP 2020

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2020 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH


Prosodic boundary is a crucial prosodic cue of prosodic phrasing. This research aims to build a prosodic boundary prediction model for improving the naturalness of the Viet- namese speech synthesis. This model can be used directly to predict prosodic boundaries in synthesis phase of the statisti- cal parametric speech synthesis (e.g. Hidden Markov Model - HMM, Deep Neural Network - DNN). It can also be used to improve the quality of the training phase in the end-to- end speech synthesis (e.g. Tacotron).

Paper Details

Authors:
Nguyen Thi Thu Trang, Nguyen Hoang Ky, Albert Rilliard, Christophe D’Alessandro
Submitted On:
22 October 2020 - 8:15am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

ICASSP_2021_Prosodic_Boundary_Prediction.pdf

(3)

ICASSP_2021_Prosodic_Boundary_Prediction.pdf

(3)

ICASSP_2021_Prosodic_Boundary_Prediction.pdf

(2)

Subscribe

[1] Nguyen Thi Thu Trang, Nguyen Hoang Ky, Albert Rilliard, Christophe D’Alessandro, "PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5466. Accessed: Oct. 29, 2020.
@article{5466-20,
url = {http://sigport.org/5466},
author = {Nguyen Thi Thu Trang; Nguyen Hoang Ky; Albert Rilliard; Christophe D’Alessandro },
publisher = {IEEE SigPort},
title = {PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH},
year = {2020} }
TY - EJOUR
T1 - PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH
AU - Nguyen Thi Thu Trang; Nguyen Hoang Ky; Albert Rilliard; Christophe D’Alessandro
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5466
ER -
Nguyen Thi Thu Trang, Nguyen Hoang Ky, Albert Rilliard, Christophe D’Alessandro. (2020). PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH. IEEE SigPort. http://sigport.org/5466
Nguyen Thi Thu Trang, Nguyen Hoang Ky, Albert Rilliard, Christophe D’Alessandro, 2020. PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH. Available at: http://sigport.org/5466.
Nguyen Thi Thu Trang, Nguyen Hoang Ky, Albert Rilliard, Christophe D’Alessandro. (2020). "PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH." Web.
1. Nguyen Thi Thu Trang, Nguyen Hoang Ky, Albert Rilliard, Christophe D’Alessandro. PROSODIC BOUNDARY PREDICTION MODEL FOR VIETNAMESE TEXT-TO-SPEECH [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5466

EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES


This paper shows recent progresses on our Silent Speech Interface (SSI) that translates tongue motions into audible speech. In our previous work and also in the current study, the prediction of fundamental frequency (F0) from Ultra-sound Tongue Images (UTI) was achieved using articulatory-to-acoustic mapping methods based on deep learning. Here we investigated several traditional discontinuous speech-based F0 estimation algorithms for the target of UTI-based SSI system.

Paper Details

Authors:
Submitted On:
21 October 2020 - 10:35am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

dai.pdf

(6)

Subscribe

[1] , "EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5465. Accessed: Oct. 29, 2020.
@article{5465-20,
url = {http://sigport.org/5465},
author = { },
publisher = {IEEE SigPort},
title = {EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES},
year = {2020} }
TY - EJOUR
T1 - EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES
AU -
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5465
ER -
. (2020). EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES. IEEE SigPort. http://sigport.org/5465
, 2020. EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES. Available at: http://sigport.org/5465.
. (2020). "EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES." Web.
1. . EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5465

EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES


This paper shows recent progresses on our Silent Speech Interface (SSI) that translates tongue motions into audible speech. In our previous work and also in the current study, the prediction of fundamental frequency (F0) from Ultra-sound Tongue Images (UTI) was achieved using articulatory-to-acoustic mapping methods based on deep learning. Here we investigated several traditional discontinuous speech-based F0 estimation algorithms for the target of UTI-based SSI system.

a.pdf

PDF icon a.pdf (6)

Paper Details

Authors:
Submitted On:
21 October 2020 - 11:11am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

a.pdf

(6)

Subscribe

[1] , "EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5464. Accessed: Oct. 29, 2020.
@article{5464-20,
url = {http://sigport.org/5464},
author = { },
publisher = {IEEE SigPort},
title = {EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES},
year = {2020} }
TY - EJOUR
T1 - EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES
AU -
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5464
ER -
. (2020). EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES. IEEE SigPort. http://sigport.org/5464
, 2020. EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES. Available at: http://sigport.org/5464.
. (2020). "EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES." Web.
1. . EFFECTS OF F0 ESTIMATION ALGORITHMS ON ULTRASOUND-BASED SILENT SPEECH INTERFACES [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5464

EnerGAN: A Generative Adversarial Network for Energy Disaggregation


An efficient, appliance-level approach for energy disaggregation, exploiting the benefits of Generative Adversarial Networks, is presented. The concept of adversarial training supports the creation of fine tuned disaggregators, which produce more detailed load estimations for a specific appliance, compared to state of the art deep learning models. The Generator and Discriminator of the model are appropriately adapted to fit the particularities of NILM problem, whereas a Seeder component is added to provide encoded compact input vectors to the Generator.

Paper Details

Authors:
Maria Kaselimi, Athanasios Voulodimos, Eftychios Protopapadakis, Nikolaos Doulamis, Anastasios Doulamis
Submitted On:
27 June 2020 - 2:06pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

EnerGAN_ICASSP_2020_final.pdf

(60)

Subscribe

[1] Maria Kaselimi, Athanasios Voulodimos, Eftychios Protopapadakis, Nikolaos Doulamis, Anastasios Doulamis, "EnerGAN: A Generative Adversarial Network for Energy Disaggregation", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5463. Accessed: Oct. 29, 2020.
@article{5463-20,
url = {http://sigport.org/5463},
author = {Maria Kaselimi; Athanasios Voulodimos; Eftychios Protopapadakis; Nikolaos Doulamis; Anastasios Doulamis },
publisher = {IEEE SigPort},
title = {EnerGAN: A Generative Adversarial Network for Energy Disaggregation},
year = {2020} }
TY - EJOUR
T1 - EnerGAN: A Generative Adversarial Network for Energy Disaggregation
AU - Maria Kaselimi; Athanasios Voulodimos; Eftychios Protopapadakis; Nikolaos Doulamis; Anastasios Doulamis
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5463
ER -
Maria Kaselimi, Athanasios Voulodimos, Eftychios Protopapadakis, Nikolaos Doulamis, Anastasios Doulamis. (2020). EnerGAN: A Generative Adversarial Network for Energy Disaggregation. IEEE SigPort. http://sigport.org/5463
Maria Kaselimi, Athanasios Voulodimos, Eftychios Protopapadakis, Nikolaos Doulamis, Anastasios Doulamis, 2020. EnerGAN: A Generative Adversarial Network for Energy Disaggregation. Available at: http://sigport.org/5463.
Maria Kaselimi, Athanasios Voulodimos, Eftychios Protopapadakis, Nikolaos Doulamis, Anastasios Doulamis. (2020). "EnerGAN: A Generative Adversarial Network for Energy Disaggregation." Web.
1. Maria Kaselimi, Athanasios Voulodimos, Eftychios Protopapadakis, Nikolaos Doulamis, Anastasios Doulamis. EnerGAN: A Generative Adversarial Network for Energy Disaggregation [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5463

Learning to Fool the Speaker Recognition (poster)


Due to the widespread deployment of fingerprint/face/speaker recognition systems, attacking deep learning based biometric systems has drawn more and more attention. Previous research mainly studied the attack to the vision-based system, such as fingerprint and face recognition. While the attack for speaker recognition has not been investigated yet, although it has been widely used in our daily life.

Paper Details

Authors:
Submitted On:
9 June 2020 - 11:26am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

preid_453146_learning_to_fool_the_speaker_recognition_ICASSP2020.pdf

(73)

Subscribe

[1] , "Learning to Fool the Speaker Recognition (poster)", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5462. Accessed: Oct. 29, 2020.
@article{5462-20,
url = {http://sigport.org/5462},
author = { },
publisher = {IEEE SigPort},
title = {Learning to Fool the Speaker Recognition (poster)},
year = {2020} }
TY - EJOUR
T1 - Learning to Fool the Speaker Recognition (poster)
AU -
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5462
ER -
. (2020). Learning to Fool the Speaker Recognition (poster). IEEE SigPort. http://sigport.org/5462
, 2020. Learning to Fool the Speaker Recognition (poster). Available at: http://sigport.org/5462.
. (2020). "Learning to Fool the Speaker Recognition (poster)." Web.
1. . Learning to Fool the Speaker Recognition (poster) [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5462

dMazeRunner: Optimizing Convolutions on Dataflow Accelerators

Paper Details

Authors:
Shail Dave, Aviral Shrivastava, Youngbin Kim, Sasikanth Avancha, Kyoungwoo Lee
Submitted On:
7 June 2020 - 8:48pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

Dave2020ICASSP.pdf

(107)

Subscribe

[1] Shail Dave, Aviral Shrivastava, Youngbin Kim, Sasikanth Avancha, Kyoungwoo Lee, "dMazeRunner: Optimizing Convolutions on Dataflow Accelerators", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5461. Accessed: Oct. 29, 2020.
@article{5461-20,
url = {http://sigport.org/5461},
author = {Shail Dave; Aviral Shrivastava; Youngbin Kim; Sasikanth Avancha; Kyoungwoo Lee },
publisher = {IEEE SigPort},
title = {dMazeRunner: Optimizing Convolutions on Dataflow Accelerators},
year = {2020} }
TY - EJOUR
T1 - dMazeRunner: Optimizing Convolutions on Dataflow Accelerators
AU - Shail Dave; Aviral Shrivastava; Youngbin Kim; Sasikanth Avancha; Kyoungwoo Lee
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5461
ER -
Shail Dave, Aviral Shrivastava, Youngbin Kim, Sasikanth Avancha, Kyoungwoo Lee. (2020). dMazeRunner: Optimizing Convolutions on Dataflow Accelerators. IEEE SigPort. http://sigport.org/5461
Shail Dave, Aviral Shrivastava, Youngbin Kim, Sasikanth Avancha, Kyoungwoo Lee, 2020. dMazeRunner: Optimizing Convolutions on Dataflow Accelerators. Available at: http://sigport.org/5461.
Shail Dave, Aviral Shrivastava, Youngbin Kim, Sasikanth Avancha, Kyoungwoo Lee. (2020). "dMazeRunner: Optimizing Convolutions on Dataflow Accelerators." Web.
1. Shail Dave, Aviral Shrivastava, Youngbin Kim, Sasikanth Avancha, Kyoungwoo Lee. dMazeRunner: Optimizing Convolutions on Dataflow Accelerators [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5461

dMazeRunner: Optimizing Convolutions on Dataflow Accelerators

Paper Details

Authors:
Submitted On:
7 June 2020 - 8:31pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

Dave2020ICASSP.pdf

(107)

Subscribe

[1] , "dMazeRunner: Optimizing Convolutions on Dataflow Accelerators", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5460. Accessed: Oct. 29, 2020.
@article{5460-20,
url = {http://sigport.org/5460},
author = { },
publisher = {IEEE SigPort},
title = {dMazeRunner: Optimizing Convolutions on Dataflow Accelerators},
year = {2020} }
TY - EJOUR
T1 - dMazeRunner: Optimizing Convolutions on Dataflow Accelerators
AU -
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5460
ER -
. (2020). dMazeRunner: Optimizing Convolutions on Dataflow Accelerators. IEEE SigPort. http://sigport.org/5460
, 2020. dMazeRunner: Optimizing Convolutions on Dataflow Accelerators. Available at: http://sigport.org/5460.
. (2020). "dMazeRunner: Optimizing Convolutions on Dataflow Accelerators." Web.
1. . dMazeRunner: Optimizing Convolutions on Dataflow Accelerators [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5460

dMazeRunner: Optimizing Convolutions on Dataflow Accelerators

Paper Details

Authors:
Submitted On:
7 June 2020 - 8:31pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

Dave2020ICASSP.pdf

(107)

Subscribe

[1] , "dMazeRunner: Optimizing Convolutions on Dataflow Accelerators", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5459. Accessed: Oct. 29, 2020.
@article{5459-20,
url = {http://sigport.org/5459},
author = { },
publisher = {IEEE SigPort},
title = {dMazeRunner: Optimizing Convolutions on Dataflow Accelerators},
year = {2020} }
TY - EJOUR
T1 - dMazeRunner: Optimizing Convolutions on Dataflow Accelerators
AU -
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5459
ER -
. (2020). dMazeRunner: Optimizing Convolutions on Dataflow Accelerators. IEEE SigPort. http://sigport.org/5459
, 2020. dMazeRunner: Optimizing Convolutions on Dataflow Accelerators. Available at: http://sigport.org/5459.
. (2020). "dMazeRunner: Optimizing Convolutions on Dataflow Accelerators." Web.
1. . dMazeRunner: Optimizing Convolutions on Dataflow Accelerators [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5459

Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides

Paper Details

Authors:
Morgan Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux
Submitted On:
6 June 2020 - 10:30pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

Libri-Light - A Benchmark for ASR with Limited or No Supervision -- ICASSP 2020.pdf

(65)

Subscribe

[1] Morgan Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux, "Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5458. Accessed: Oct. 29, 2020.
@article{5458-20,
url = {http://sigport.org/5458},
author = {Morgan Rivière; Weiyi Zheng; Evgeny Kharitonov; Qiantong Xu; Pierre-Emmanuel Mazaré; Julien Karadayi; Vitaliy Liptchinsky; Ronan Collobert; Christian Fuegen; Tatiana Likhomanenko; Gabriel Synnaeve; Armand Joulin; Abdelrahman Mohamed; Emmanuel Dupoux },
publisher = {IEEE SigPort},
title = {Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides},
year = {2020} }
TY - EJOUR
T1 - Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides
AU - Morgan Rivière; Weiyi Zheng; Evgeny Kharitonov; Qiantong Xu; Pierre-Emmanuel Mazaré; Julien Karadayi; Vitaliy Liptchinsky; Ronan Collobert; Christian Fuegen; Tatiana Likhomanenko; Gabriel Synnaeve; Armand Joulin; Abdelrahman Mohamed; Emmanuel Dupoux
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5458
ER -
Morgan Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux. (2020). Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides. IEEE SigPort. http://sigport.org/5458
Morgan Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux, 2020. Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides. Available at: http://sigport.org/5458.
Morgan Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux. (2020). "Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides." Web.
1. Morgan Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux. Libri-Light: A Benchmark for ASR with Limited or No Supervision- ICASSP 2020 Slides [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5458

Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides

Paper Details

Authors:
Ann Lee, Awni Hannun
Submitted On:
6 June 2020 - 10:19pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

Self-Training for End-to-End Speech Recognition - ICASSP 2020.pdf

(87)

Subscribe

[1] Ann Lee, Awni Hannun, "Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5457. Accessed: Oct. 29, 2020.
@article{5457-20,
url = {http://sigport.org/5457},
author = { Ann Lee; Awni Hannun },
publisher = {IEEE SigPort},
title = {Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides},
year = {2020} }
TY - EJOUR
T1 - Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides
AU - Ann Lee; Awni Hannun
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5457
ER -
Ann Lee, Awni Hannun. (2020). Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides. IEEE SigPort. http://sigport.org/5457
Ann Lee, Awni Hannun, 2020. Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides. Available at: http://sigport.org/5457.
Ann Lee, Awni Hannun. (2020). "Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides." Web.
1. Ann Lee, Awni Hannun. Self-Training for End-to-End Speech Recognition - ICASSP 2020 Slides [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5457

Pages