Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION

Abstract: 

In frequency warping (FW)-based Voice Conversion (VC), the source spectrum is modified to match the frequency-axis of the target spectrum followed by an Amplitude Scaling (AS) to compensate the amplitude differences between the warped spectrum and the actual target spectrum. In this paper, we propose a novel AS technique which linearly transfers the amplitude of the frequency warped spectrum using the knowledge of a Gaussian Mixture Model (GMM)-based converted spectrum without adding any spurious peaks. The novelty of the proposed approach lies in avoiding a perceptual impression of wrong formant location (due to perfect match assumption between the warped spectrum and the actual target spectrum in state-of-the-art AS method) leading to deterioration in converted voice quality. From subjective analysis, it is evident that the proposed system has been preferred 33.81 % and 12.37 % times more compared to the GMM and state-of-the-art AS method for voice quality, respectively. Similar to the quality conversion trade-offs observed by other studies in the literature, speaker identity conversion was 0.73 % times more and 9.09 % times less preferred over GMM and state-of-the-art AS-based method, respectively.

up
0 users have voted:

Paper Details

Authors:
Nirmesh J. Shah and Hemant A. Patil
Submitted On:
28 February 2017 - 4:49am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Nirmesh Shah
Paper Code:
1380
Document Year:
2017
Cite

Document Files

ICASSP_2017_NH.pdf

(344)

Subscribe

[1] Nirmesh J. Shah and Hemant A. Patil, "NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1493. Accessed: Aug. 10, 2020.
@article{1493-17,
url = {http://sigport.org/1493},
author = {Nirmesh J. Shah and Hemant A. Patil },
publisher = {IEEE SigPort},
title = {NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION},
year = {2017} }
TY - EJOUR
T1 - NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION
AU - Nirmesh J. Shah and Hemant A. Patil
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1493
ER -
Nirmesh J. Shah and Hemant A. Patil. (2017). NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION. IEEE SigPort. http://sigport.org/1493
Nirmesh J. Shah and Hemant A. Patil, 2017. NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION. Available at: http://sigport.org/1493.
Nirmesh J. Shah and Hemant A. Patil. (2017). "NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION." Web.
1. Nirmesh J. Shah and Hemant A. Patil. NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1493