Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network

Abstract: 

Speech enhancement deep learning systems usually require large amounts of training data to operate in broad conditions or real applications. This makes the adaptability of those systems into new, low resource environments an important topic. In this work, we present the results of adapting a speech enhancement generative adversarial network by fine-tuning the generator with small amounts of data. We investigate the minimum requirements to obtain a stable behavior in terms of several objective metrics in two very different languages: Catalan and Korean. We also study the variability of test performance to unseen noise as a function of the amount of different types of noise available for training. Results show that adapting a pre-trained English model with 10\,min of data already achieves a comparable performance to having two orders of magnitude more data. They also demonstrate the relative stability in test performance with respect to the number of training noise types.

up
0 users have voted:

Paper Details

Authors:
Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn
Submitted On:
19 April 2018 - 4:44pm
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Santiago Pascual
Paper Code:
4272
Document Year:
2018
Cite

Document Files

language-noise-transfer.pdf

(118 downloads)

Subscribe

[1] Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn, "Language and Noise Transfer in Speech Enhancement Generative Adversarial Network", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/3025. Accessed: Dec. 11, 2018.
@article{3025-18,
url = {http://sigport.org/3025},
author = {Maruchan Park; Joan Serrà; Antonio Bonafonte; Kang-Hun Ahn },
publisher = {IEEE SigPort},
title = {Language and Noise Transfer in Speech Enhancement Generative Adversarial Network},
year = {2018} }
TY - EJOUR
T1 - Language and Noise Transfer in Speech Enhancement Generative Adversarial Network
AU - Maruchan Park; Joan Serrà; Antonio Bonafonte; Kang-Hun Ahn
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/3025
ER -
Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn. (2018). Language and Noise Transfer in Speech Enhancement Generative Adversarial Network. IEEE SigPort. http://sigport.org/3025
Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn, 2018. Language and Noise Transfer in Speech Enhancement Generative Adversarial Network. Available at: http://sigport.org/3025.
Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn. (2018). "Language and Noise Transfer in Speech Enhancement Generative Adversarial Network." Web.
1. Maruchan Park, Joan Serrà, Antonio Bonafonte, Kang-Hun Ahn. Language and Noise Transfer in Speech Enhancement Generative Adversarial Network [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/3025