International Society for Music Information Retrieval Conference最新文献_第4页

Generating Coherent Drum Accompaniment With Fills And Improvisations 产生连贯的鼓伴奏与填充和即兴

International Society for Music Information Retrieval Conference Pub Date : 2022-09-01 DOI: 10.48550/arXiv.2209.00291

Rishabh A. Dahale, Vaibhav Talwadker, P. Rao, Prateek Verma

{"title":"Generating Coherent Drum Accompaniment With Fills And Improvisations","authors":"Rishabh A. Dahale, Vaibhav Talwadker, P. Rao, Prateek Verma","doi":"10.48550/arXiv.2209.00291","DOIUrl":"https://doi.org/10.48550/arXiv.2209.00291","url":null,"abstract":"Creating a complex work of art like music necessitates profound creativity. With recent advancements in deep learning and powerful models such as transformers, there has been huge progress in automatic music generation. In an accompaniment generation context, creating a coherent drum pattern with apposite fills and improvisations at proper locations in a song is a challenging task even for an experienced drummer. Drum beats tend to follow a repetitive pattern through stanzas with fills or improvisation at section boundaries. In this work, we tackle the task of drum pattern generation conditioned on the accompanying music played by four melodic instruments: Piano, Guitar, Bass, and Strings. We use the transformer sequence to sequence model to generate a basic drum pattern conditioned on the melodic accompaniment to find that improvisation is largely absent, attributed possibly to its expectedly relatively low representation in the training data. We propose a novelty function to capture the extent of improvisation in a bar relative to its neighbors. We train a model to predict improvisation locations from the melodic accompaniment tracks. Finally, we use a novel BERT-inspired in-filling architecture, to learn the structure of both the drums and melody to in-fill elements of improvised music.","PeriodicalId":309903,"journal":{"name":"International Society for Music Information Retrieval Conference","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127929361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Cadence Detection in Symbolic Classical Music using Graph Neural Networks 基于图神经网络的符号古典音乐节奏检测

International Society for Music Information Retrieval Conference Pub Date : 2022-08-31 DOI: 10.48550/arXiv.2208.14819

E. Karystinaios, G. Widmer

{"title":"Cadence Detection in Symbolic Classical Music using Graph Neural Networks","authors":"E. Karystinaios, G. Widmer","doi":"10.48550/arXiv.2208.14819","DOIUrl":"https://doi.org/10.48550/arXiv.2208.14819","url":null,"abstract":"Cadences are complex structures that have been driving music from the beginning of contrapuntal polyphony until today. Detecting such structures is vital for numerous MIR tasks such as musicological analysis, key detection, or music segmentation. However, automatic cadence detection remains challenging mainly because it involves a combination of high-level musical elements like harmony, voice leading, and rhythm. In this work, we present a graph representation of symbolic scores as an intermediate means to solve the cadence detection task. We approach cadence detection as an imbalanced node classification problem using a Graph Convolutional Network. We obtain results that are roughly on par with the state of the art, and we present a model capable of making predictions at multiple levels of granularity, from individual notes to beats, thanks to the fine-grained, note-by-note representation. Moreover, our experiments suggest that graph convolution can learn non-local features that assist in cadence detection, freeing us from the need of having to devise specialized features that encode non-local context. We argue that this general approach to modeling musical scores and classification tasks has a number of potential advantages, beyond the specific recognition task presented here.","PeriodicalId":309903,"journal":{"name":"International Society for Music Information Retrieval Conference","volume":"59 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116522414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Sketching the Expression: Flexible Rendering of Expressive Piano Performance with Self-Supervised Learning 写生表达:自我监督学习下钢琴表现力演奏的灵活呈现

International Society for Music Information Retrieval Conference Pub Date : 2022-08-31 DOI: 10.48550/arXiv.2208.14867

Seungyeon Rhyu, Sarah Kim, Kyogu Lee

引用次数: 1

Evaluating generative audio systems and their metrics 评估生成音频系统及其参数

International Society for Music Information Retrieval Conference Pub Date : 2022-08-31 DOI: 10.48550/arXiv.2209.00130

Ashvala Vinay, Alexander Lerch

引用次数: 7

MeloForm: Generating Melody with Musical Form based on Expert Systems and Neural Networks MeloForm:基于专家系统和神经网络的曲式旋律生成

International Society for Music Information Retrieval Conference Pub Date : 2022-08-30 DOI: 10.48550/arXiv.2208.14345

Peiling Lu, Xu Tan, Botao Yu, Tao Qin, Sheng Zhao, Tie-Yan Liu

{"title":"MeloForm: Generating Melody with Musical Form based on Expert Systems and Neural Networks","authors":"Peiling Lu, Xu Tan, Botao Yu, Tao Qin, Sheng Zhao, Tie-Yan Liu","doi":"10.48550/arXiv.2208.14345","DOIUrl":"https://doi.org/10.48550/arXiv.2208.14345","url":null,"abstract":"Human usually composes music by organizing elements according to the musical form to express music ideas. However, for neural network-based music generation, it is difficult to do so due to the lack of labelled data on musical form. In this paper, we develop MeloForm, a system that generates melody with musical form using expert systems and neural networks. Specifically, 1) we design an expert system to generate a melody by developing musical elements from motifs to phrases then to sections with repetitions and variations according to pre-given musical form; 2) considering the generated melody is lack of musical richness, we design a Transformer based refinement model to improve the melody without changing its musical form. MeloForm enjoys the advantages of precise musical form control by expert systems and musical richness learning via neural models. Both subjective and objective experimental evaluations demonstrate that MeloForm generates melodies with precise musical form control with 97.79% accuracy, and outperforms baseline systems in terms of subjective evaluation score by 0.75, 0.50, 0.86 and 0.89 in structure, thematic, richness and overall quality, without any labelled musical form data. Besides, MeloForm can support various kinds of forms, such as verse and chorus form, rondo form, variational form, sonata form, etc.","PeriodicalId":309903,"journal":{"name":"International Society for Music Information Retrieval Conference","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131240265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

HPPNet: Modeling the Harmonic Structure and Pitch Invariance in Piano Transcription 钢琴转录中的和声结构和音高不变性建模

International Society for Music Information Retrieval Conference Pub Date : 2022-08-30 DOI: 10.48550/arXiv.2208.14339

Weixing Wei, P. Li, Yi Yu, Wei Li

引用次数: 5

Towards robust music source separation on loud commercial music 在嘈杂的商业音乐上实现健壮的音乐源分离

International Society for Music Information Retrieval Conference Pub Date : 2022-08-30 DOI: 10.48550/arXiv.2208.14355

Chang-Bin Jeon, Kyogu Lee

引用次数: 4

Music Separation Enhancement with Generative Modeling 音乐分离增强与生成建模

International Society for Music Information Retrieval Conference Pub Date : 2022-08-26 DOI: 10.48550/arXiv.2208.12387

N. Schaffer, Boaz Cogan, Ethan Manilow, Max Morrison, Prem Seetharaman, Bryan Pardo

引用次数: 6

MuLan: A Joint Embedding of Music Audio and Natural Language 木兰:音乐音频与自然语言的联合嵌入

International Society for Music Information Retrieval Conference Pub Date : 2022-08-26 DOI: 10.48550/arXiv.2208.12415

Qingqing Huang, A. Jansen, Joonseok Lee, R. Ganti, Judith Yue Li, D. Ellis

引用次数: 46

Multi-objective Hyper-parameter Optimization of Behavioral Song Embeddings 行为歌曲嵌入的多目标超参数优化

International Society for Music Information Retrieval Conference Pub Date : 2022-08-26 DOI: 10.48550/arXiv.2208.12724

Massimo Quadrana, Antoine Larreche-Mouly, Matthias Mauch

引用次数: 2