Documents
Presentation Slides
Presentation Slides
Source Coding of Audio Signals with a Generative Model
- Citation Author(s):
- Submitted by:
- Roy Fejgin
- Last updated:
- 27 May 2020 - 2:04pm
- Document Type:
- Presentation Slides
- Event:
- Presenters:
- Roy Fejgin
- Paper Code:
- AUD-P2.3
- Categories:
- Log in to post comments
These are the slides from the video presentation at ICASSP 2020 of the paper "Source Coding of Audio Signals with a Generative Model".
Paper abstract:
We consider source coding of audio signals with the help of a generative model. We use a construction where a waveform is first quantized, yielding a finite bitrate representation. The waveform is then reconstructed by random sampling from a model conditioned on the quantized waveform. The proposed coding scheme is theoretically
analyzed. Using SampleRNN as the generative model, we demonstrate that the proposed coding structure provides performance competitive with state-of-the-art source coding tools for specific categories of audio signals.
Comments
Additional materials
The video of this presentation is at:
https://2020.ieeeicassp-virtual.org/presentation/poster/source-coding-au...
Audio samples:
https://sigport.org/documents/source-coding-audio-signals-generative-model
The paper itself:
https://ieeexplore.ieee.org/abstract/document/9053220