pith. sign in

arxiv: 2408.04829 · v1 · pith:7T67ZZ4Anew · submitted 2024-08-09 · 💻 cs.SD · eess.AS

Hyper Recurrent Neural Network: Condition Mechanisms for Black-box Audio Effect Modeling

classification 💻 cs.SD eess.AS
keywords audioconditioningmechanismsmodelmodelinganalogblack-boxcontrol
0
0 comments X
read the original abstract

Recurrent neural networks (RNNs) have demonstrated impressive results for virtual analog modeling of audio effects. These networks process time-domain audio signals using a series of matrix multiplication and nonlinear activation functions to emulate the behavior of the target device accurately. To additionally model the effect of the knobs for an RNN-based model, existing approaches integrate control parameters by concatenating them channel-wisely with some intermediate representation of the input signal. While this method is parameter-efficient, there is room to further improve the quality of generated audio because the concatenation-based conditioning method has limited capacity in modulating signals. In this paper, we propose three novel conditioning mechanisms for RNNs, tailored for black-box virtual analog modeling. These advanced conditioning mechanisms modulate the model based on control parameters, yielding superior results to existing RNN- and CNN-based architectures across various evaluation metrics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.