Documents
Presentation Slides
Modeling nonlinear audio effects with end-to-end deep neural networks
- Citation Author(s):
- Submitted by:
- Marco A. Martin...
- Last updated:
- 10 May 2019 - 12:06pm
- Document Type:
- Presentation Slides
- Document Year:
- 2019
- Event:
- Presenters:
- Marco A. Martinez Ramirez
- Paper Code:
- AASP-L6.5
- Categories:
- Log in to post comments
Audio processors whose parameters are modified periodically
over time are often referred as time-varying or modulation based
audio effects. Most existing methods for modeling these type of
effect units are often optimized to a very specific circuit and cannot
be efficiently generalized to other time-varying effects. Based on
convolutional and recurrent neural networks, we propose a deep
learning architecture for generic black-box modeling of audio processors
with long-term memory. We explore the capabilities of
deep neural networks to learn such long temporal dependencies
and we show the network modeling various linear and nonlinear,
time-varying and time-invariant audio effects. In order to measure
the performance of the model, we propose an objective metric
based on the psychoacoustics of modulation frequency perception.
We also analyze what the model is actually learning and how the
given task is accomplished.