Sorry, you need to enable JavaScript to visit this website.

Speaker-dependent WaveNet-based delay-free ADPCM speech coding

Citation Author(s):
Submitted by:
Takenori Yoshimura
Last updated:
7 May 2019 - 10:45pm
Document Type:
Poster
Document Year:
2019
Event:
Presenters:
Takenori Yoshimura
Categories:
 

This paper proposes a WaveNet-based delay-free adaptive differential pulse code modulation (ADPCM) speech coding system. The WaveNet generative model, which is a stateof-the-art model for neural-network-based speech waveform synthesis, is used as the adaptive predictor in ADPCM. To further improve speech quality, mel-cepstrum-based noise shaping and postfiltering were integrated with the proposed ADPCM system. Both objective and subjective evaluation results indicate that the proposed ADPCM system outperformed not only the conventional ADPCM system based on ITU-T Recommendation G.726 but also the ADPCM system based on adaptive mel-cepstral analysis.

up
0 users have voted: