Support System for Improvisational Ensemble Based on Long Short-Term Memory Using Smartphone Sensor

Haruya Takase, Shun Shiramatsu
{"title":"Support System for Improvisational Ensemble Based on Long Short-Term Memory Using Smartphone Sensor","authors":"Haruya Takase, Shun Shiramatsu","doi":"10.5281/ZENODO.4813434","DOIUrl":null,"url":null,"abstract":"Our goal is to develop an improvisational ensemble support system for music beginners who do not have knowledge of chord progressions and do not have enough experience of playing an instrument. We hypothesized that a music beginner cannot determine tonal pitches of melody over a particular chord but can use body movements to specify the pitch contour (i.e., melodic outline) and the attack timings (i.e., rhythm). We aim to realize a performance interface for supporting expressing intuitive pitch contour and attack timings using body motion and outputting harmonious pitches over the chord progression of the background music. Since the intended users of this system are not limited to people with music experience, we plan to develop a system that uses Android smartphones, which many people have. Our system consists of three modules: a module for specifying attack timing using smartphone sensors, module for estimating the vertical movement of the smartphone using smartphone sensors, and module for estimating the sound height using smartphone vertical movement and background chord progression. Each estimation module is developed using long short-term memory (LSTM), which is often used to estimate time series data. We conduct evaluation experiments for each module. As a result, the attack timing estimation had zero misjudgments, and the mean error time of the estimated attack timing was smaller than the sensor-acquisition interval. The accuracy of the vertical motion estimation was 64%, and that of the pitch estimation was 7.6%. The results indicate that the attack timing is accurate enough, but the vertical motion estimation and the pitch estimation need to be improved for actual use.","PeriodicalId":161317,"journal":{"name":"New Interfaces for Musical Expression","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Interfaces for Musical Expression","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.4813434","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Our goal is to develop an improvisational ensemble support system for music beginners who do not have knowledge of chord progressions and do not have enough experience of playing an instrument. We hypothesized that a music beginner cannot determine tonal pitches of melody over a particular chord but can use body movements to specify the pitch contour (i.e., melodic outline) and the attack timings (i.e., rhythm). We aim to realize a performance interface for supporting expressing intuitive pitch contour and attack timings using body motion and outputting harmonious pitches over the chord progression of the background music. Since the intended users of this system are not limited to people with music experience, we plan to develop a system that uses Android smartphones, which many people have. Our system consists of three modules: a module for specifying attack timing using smartphone sensors, module for estimating the vertical movement of the smartphone using smartphone sensors, and module for estimating the sound height using smartphone vertical movement and background chord progression. Each estimation module is developed using long short-term memory (LSTM), which is often used to estimate time series data. We conduct evaluation experiments for each module. As a result, the attack timing estimation had zero misjudgments, and the mean error time of the estimated attack timing was smaller than the sensor-acquisition interval. The accuracy of the vertical motion estimation was 64%, and that of the pitch estimation was 7.6%. The results indicate that the attack timing is accurate enough, but the vertical motion estimation and the pitch estimation need to be improved for actual use.
基于智能手机传感器的长短期记忆即兴合奏支持系统
我们的目标是为音乐初学者开发一个即兴合奏支持系统,他们没有和弦进行的知识,也没有足够的演奏乐器的经验。我们假设,音乐初学者不能确定特定和弦旋律的音调,但可以使用身体动作来指定音高轮廓(即旋律轮廓)和攻击时间(即节奏)。我们的目标是实现一个表演界面,支持用身体运动来表达直观的音高轮廓和攻击时序,并在背景音乐的和弦进行中输出和谐的音高。由于这个系统的目标用户并不局限于有音乐经验的人,所以我们计划开发一个使用安卓智能手机的系统,很多人都有安卓智能手机。我们的系统由三个模块组成:一个模块用于使用智能手机传感器指定攻击时间,一个模块用于使用智能手机传感器估计智能手机的垂直运动,一个模块用于使用智能手机垂直运动和背景和弦进行估计声音高度。每个估计模块都是使用长短期记忆(LSTM)开发的,它通常用于估计时间序列数据。我们对每个模块进行了评估实验。结果表明,攻击时间估计为零误判,估计攻击时间的平均误差时间小于传感器采集间隔。垂直运动估计精度为64%,俯仰估计精度为7.6%。实验结果表明,该方法的攻击定时精度较高,但在垂直运动估计和俯仰估计方面有待改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信