Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network

Abstract: 

In this paper, we propose a dictionary update method for Nonnegative Matrix Factorization (NMF) with high dimensional data in a spectral conversion (SC) task. Voice conversion has been widely studied due to its potential applications such as personalized speech synthesis and speech enhancement. Exemplar-based NMF (ENMF) emerges as an effective and probably the simplest choice among all techniques for SC, as long as a source-target parallel speech corpus is given. ENMF-based SC systems usually need a large amount of bases (exemplars) to ensure the quality of the converted speech. However, a small and effective dictionary is desirable but hard to obtain via dictionary update, in particular when high-dimensional features such as STRAIGHT spectra are used. Therefore, we propose a dictionary update framework for NMF by means of an encoder-decoder reformulation. Regarding NMF as an encoder-decoder network makes it possible to exploit the whole parallel corpus more effectively and efficiently when applied to SC. Our experiments demonstrate significant gains of the proposed system with small dictionaries over conventional ENMF-based systems with dictionaries of same or much larger size.

up
0 users have voted:

Paper Details

Authors:
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang
Submitted On:
13 October 2016 - 4:15am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Chin-Cheng Hsu
Document Year:
2016
Cite

Document Files

2016-10-20-ISCSLP-v1.0-SigPort.pptx

(351)

Subscribe

[1] Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang, "Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1168. Accessed: Feb. 27, 2020.
@article{1168-16,
url = {http://sigport.org/1168},
author = {Chin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao; and Hsin-Min Wang },
publisher = {IEEE SigPort},
title = {Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network},
year = {2016} }
TY - EJOUR
T1 - Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network
AU - Chin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao; and Hsin-Min Wang
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1168
ER -
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang. (2016). Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network. IEEE SigPort. http://sigport.org/1168
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang, 2016. Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network. Available at: http://sigport.org/1168.
Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang. (2016). "Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network." Web.
1. Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, and Hsin-Min Wang. Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1168