Towards zero-shot amplifier modeling: One-to-many amplifier modeling via tone embedding control

arXiv - CS - Sound Pub Date : 2024-07-15 DOI:arxiv-2407.10646

Yu-Hua Chen, Yen-Tung Yeh, Yuan-Chiao Cheng, Jui-Te Wu, Yu-Hsiang Ho, Jyh-Shing Roger Jang, Yi-Hsuan Yang

引用次数: 0

Abstract

Replicating analog device circuits through neural audio effect modeling has garnered increasing interest in recent years. Existing work has predominantly focused on a one-to-one emulation strategy, modeling specific devices individually. In this paper, we tackle the less-explored scenario of one-to-many emulation, utilizing conditioning mechanisms to emulate multiple guitar amplifiers through a single neural model. For condition representation, we use contrastive learning to build a tone embedding encoder that extracts style-related features of various amplifiers, leveraging a dataset of comprehensive amplifier settings. Targeting zero-shot application scenarios, we also examine various strategies for tone embedding representation, evaluating referenced tone embedding against two retrieval-based embedding methods for amplifiers unseen in the training time. Our findings showcase the efficacy and potential of the proposed methods in achieving versatile one-to-many amplifier modeling, contributing a foundational step towards zero-shot audio modeling applications.

查看原文本刊更多论文

迈向零射频放大器建模：通过音调嵌入控制进行一对多放大器建模

近年来，通过神经音频效果建模复制模拟设备电路的研究越来越受到关注。现有工作主要集中在一对一仿真策略上，对特定设备进行单独建模。在本文中，我们利用调节机制，通过单个神经模型模拟多个吉他放大器，从而解决了一对多仿真这一探索较少的问题。在条件表示方面，我们利用对比学习建立了一个音调嵌入编码器，该编码器可以提取各种放大器的风格相关特征，并利用一个综合放大器设置的数据集。针对零镜头应用场景，我们还研究了音调嵌入表示的各种策略，针对训练时间内未见的放大器，评估了参考音调嵌入和两种基于检索的嵌入方法。我们的研究结果展示了所提方法在实现多用途一对多放大器建模方面的功效和潜力，为实现零镜头音频建模应用迈出了奠基性的一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Sound

自引率

0.00%

发文量