基于LSTM网络的情感符号音乐生成系统

2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) Pub Date : 2019-03-15 DOI:10.1109/ITNEC.2019.8729266

Kun Zhao, Siqi Li, Juanjuan Cai, Hui Wang, Jingling Wang

{"title":"基于LSTM网络的情感符号音乐生成系统","authors":"Kun Zhao, Siqi Li, Juanjuan Cai, Hui Wang, Jingling Wang","doi":"10.1109/ITNEC.2019.8729266","DOIUrl":null,"url":null,"abstract":"With the development of AI technology in recent years, Neural Networks have been used in the task of algorithmic music composition and have achieved desirable results. Music is highly associated with human emotion, however, there are few attempts of intelligent music composition in the scene of expressing different emotions. In this work, Biaxial LSTM networks have been used to generate polyphonic music, and the thought of LookBack is also introduced into the architecture to improve the long-term structure. Above all, we design a novel system for emotional music generation with a manner of steerable parameters for 4 basic emotions divided by Russell’s 2-demonsion valence-arousal (VA) emotional space. The evaluation indices of generated music by this model is closer to real music, and via human listening test, it shows that the different affects expressed by the generated emotional samples can be distinguished correctly in majority.","PeriodicalId":202966,"journal":{"name":"2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"An Emotional Symbolic Music Generation System based on LSTM Networks\",\"authors\":\"Kun Zhao, Siqi Li, Juanjuan Cai, Hui Wang, Jingling Wang\",\"doi\":\"10.1109/ITNEC.2019.8729266\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of AI technology in recent years, Neural Networks have been used in the task of algorithmic music composition and have achieved desirable results. Music is highly associated with human emotion, however, there are few attempts of intelligent music composition in the scene of expressing different emotions. In this work, Biaxial LSTM networks have been used to generate polyphonic music, and the thought of LookBack is also introduced into the architecture to improve the long-term structure. Above all, we design a novel system for emotional music generation with a manner of steerable parameters for 4 basic emotions divided by Russell’s 2-demonsion valence-arousal (VA) emotional space. The evaluation indices of generated music by this model is closer to real music, and via human listening test, it shows that the different affects expressed by the generated emotional samples can be distinguished correctly in majority.\",\"PeriodicalId\":202966,\"journal\":{\"name\":\"2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITNEC.2019.8729266\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNEC.2019.8729266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 33

摘要

近年来随着人工智能技术的发展，神经网络已被应用于算法作曲任务中，并取得了理想的效果。音乐与人类的情感有着高度的联系，然而，在表达不同情感的场景中，智能音乐创作的尝试却很少。在这项工作中，双轴LSTM网络被用于生成复调音乐，并在架构中引入了LookBack的思想来改善长期结构。最重要的是，我们设计了一个新的情感音乐生成系统，该系统具有可控制的参数，用于罗素的2-恶魔价-觉醒(VA)情感空间划分的4种基本情绪。该模型生成的音乐评价指标更接近真实音乐，通过人听测试表明，生成的情感样本所表达的不同情感在大多数情况下都能正确区分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Emotional Symbolic Music Generation System based on LSTM Networks

With the development of AI technology in recent years, Neural Networks have been used in the task of algorithmic music composition and have achieved desirable results. Music is highly associated with human emotion, however, there are few attempts of intelligent music composition in the scene of expressing different emotions. In this work, Biaxial LSTM networks have been used to generate polyphonic music, and the thought of LookBack is also introduced into the architecture to improve the long-term structure. Above all, we design a novel system for emotional music generation with a manner of steerable parameters for 4 basic emotions divided by Russell’s 2-demonsion valence-arousal (VA) emotional space. The evaluation indices of generated music by this model is closer to real music, and via human listening test, it shows that the different affects expressed by the generated emotional samples can be distinguished correctly in majority.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)

自引率

0.00%

发文量