Amp-Space: A Large-Scale Dataset for Fine-Grained Timbre Transformation

2021 24th International Conference on Digital Audio Effects (DAFx) Pub Date : 2021-09-08 DOI:10.23919/DAFx51585.2021.9768241

Jason Naradowsky

引用次数: 1

Abstract

We release Amp-Space, a large-scale dataset of paired audio samples: a source audio signal, and an output signal, the result of a timbre transformation. The types of transformations we study are from blackbox musical tools (amplifiers, stompboxes, studio effects) traditionally used to shape the sound of guitar, bass, or synthesizer sounds. For each sample of transformed audio, the set of parameters used to create it are given. Samples are from both real and simulated devices, the latter allowing for orders of magnitude greater data than found in comparable datasets. We demonstrate potential use cases of this data by (a) pre-training a conditional WaveNet model on synthetic data and show that it reduces the number of samples necessary to digitally reproduce a real musical device, and (b) training a variational autoencoder to shape a continuous space of timbre transformations for creating new sounds through interpolation.

查看原文本刊更多论文

放大器空间:用于细粒度音色变换的大规模数据集

我们发布了Amp-Space，配对音频样本的大规模数据集:源音频信号和输出信号，音色变换的结果。我们研究的转换类型是从黑盒子音乐工具(放大器，stompboxes，工作室效果)传统上用于塑造吉他，贝斯或合成器声音的声音。对于转换音频的每个样本，给出了用于创建它的参数集。样本来自真实和模拟设备，后者允许比可比数据集中发现的数据大几个数量级。我们通过(a)在合成数据上预训练条件WaveNet模型来演示该数据的潜在用例，并表明它减少了数字再现真实音乐设备所需的样本数量，以及(b)训练变分自编码器来塑造音色转换的连续空间，以便通过插值创建新的声音。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 24th International Conference on Digital Audio Effects (DAFx)

自引率

0.00%

发文量