Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN

Abstract: 

We provide a speech coding scheme employing a generative model based on SampleRNN that, while operating at significantly lower bitrates, matches or surpasses the perceptual quality of state-of-the-art classic wide-band codecs. Moreover, it is demonstrated that the proposed scheme can provide a meaningful rate-distortion trade-off without retraining. We evaluate the proposed scheme in a series of listening tests and discuss limitations of the approach.

up
0 users have voted:

Paper Details

Authors:
Janusz Klejsa, Per Hedelin, Cong Zhou, Roy Fejgin, Lars Villemoes
Submitted On:
23 May 2019 - 7:33am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Janusz Klejsa
Paper Code:
SLP-P24.7
Document Year:
2019
Cite

Document Files

Audio demo

(64)

Poster

(42)

Subscribe

[1] Janusz Klejsa, Per Hedelin, Cong Zhou, Roy Fejgin, Lars Villemoes, "HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/3895. Accessed: Aug. 24, 2019.
@article{3895-19,
url = {http://sigport.org/3895},
author = {Janusz Klejsa; Per Hedelin; Cong Zhou; Roy Fejgin; Lars Villemoes },
publisher = {IEEE SigPort},
title = {HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN},
year = {2019} }
TY - EJOUR
T1 - HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN
AU - Janusz Klejsa; Per Hedelin; Cong Zhou; Roy Fejgin; Lars Villemoes
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/3895
ER -
Janusz Klejsa, Per Hedelin, Cong Zhou, Roy Fejgin, Lars Villemoes. (2019). HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN. IEEE SigPort. http://sigport.org/3895
Janusz Klejsa, Per Hedelin, Cong Zhou, Roy Fejgin, Lars Villemoes, 2019. HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN. Available at: http://sigport.org/3895.
Janusz Klejsa, Per Hedelin, Cong Zhou, Roy Fejgin, Lars Villemoes. (2019). "HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN." Web.
1. Janusz Klejsa, Per Hedelin, Cong Zhou, Roy Fejgin, Lars Villemoes. HIGH-QUALITY SPEECH CODING WITH SAMPLE RNN [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/3895