Yue Wu, Anran Qiu, Liuxuan Ruan, Xuejie Li, Jinhao Huang, Stephen Jia Wang
{"title":"DJaytopia: a hybrid intelligent DJ co-remixing system","authors":"Yue Wu, Anran Qiu, Liuxuan Ruan, Xuejie Li, Jinhao Huang, Stephen Jia Wang","doi":"10.54941/ahfe1004114","DOIUrl":null,"url":null,"abstract":"Nowadays, musical mixing platforms are featured with programmed\n interventions and digitized information visualization to support DJ's\n performance (Montano 2010), however, the visualization is always obscure to\n the average music consumers (Beamish, Maclean, and Fels 2004). Being a\n well-performed DJ requires the level of expertise and experience that most\n average music consumers lack (Cliff 2000), as every audience has a\n completely different taste in music (Schäfer and Sedlmeier 2010). This study\n aims at developing an AI / ML-based system to lower the bar for novice DJs\n and even average music consumers to create personalized music\n remixes.Generally, music can be intelligently composed by analyzing harmonic\n and melodic features to generate genre-specific compositional elements or to\n alter the compositional structure of a song (Tan and Li 2021). Despite the\n technical breakthroughs that have been made, listeners have reacted\n negatively to this music due to the lack of user data to back it up and the\n neglect of the user's perception of the piece (Tigre Moura and Maw 2021). In\n a conventional scenario, DJs can express their attitudes towards music\n preferences by listening to the music directly, which requires a well\n understanding of the audience's mind. Following the recent launch and\n explosion of ChatGPT, which has evidenced that an intelligent system could\n help users innovate by solving their problems in textual form through\n conversational interactions (Dis et al. 2023; Dwivedi et al. 2023); also\n collecting the users' feedback through conversations, observing user\n reactions, and inviting user reviews. Such AI-enabled systems are able to\n learn about the user's preferred music style and various DJ mixing\n techniques. This study adopts a typical human-in-the-loop (HITL) approach to\n develop a crowd-learning music mixing system implementing AI and Virtual\n Reality technologies. The proposed HITL-based co-music arrangement system\n should be able to collect musical data and techniques; a VR environment is\n built to provide users with a platform to record user-created music and\n corresponding applied methods as well as audience ratings worldwide. After\n processing the data, users can try out a compilation of songs assisted by a\n robotic arm. With the help of the robotic arm, it will be easier and faster\n for users to create collections with a personal touch and more specific\n techniques. The essential functions include: a) Providing users with an\n immersive environment to learn the basic operations of the DJ console. b)\n Collecting the user's preferences for compilation techniques and the content\n of different DJ's compositions for use through an “immersive online\n multiplayer music compilation platform” to generate a personalized library\n of methods to help the user compile songs; c) Assisting the user in creating\n their preferred individual compilation style faster as they try out the DJ's\n operations; d) Indicating to the user where the music needs to be equalized,\n switched or arranged. Instead of showing the user the digital music signal\n to assist in creating more efficiently, the system directly operates on the\n DJ console.User experience experiments were conducted with both novice DJs\n and experienced DJs to validate whether the proposed system could help\n humans in creating more engaging music with stronger musicality. Five\n participants, respectively three novice DJs and two experienced DJs, joined\n two experiments of half an hour on a virtual DJ and an actual DJ console.\n They started the experiment by experiencing the virtual DJ console and DJ\n community in VR. They remixed independently first and then collaborated with\n the robotic arm together for music production on the actual DJ console.\n Three different audience also joined the experiment to evaluate the\n performance of users. The result was that the music produced with the\n robotic arm had better musicality. The user's attitude towards the whole\n experience, reflected in whether the music was rhythmic or the system was\n inspiring was recorded in the feedback. Overall, the users had a satisfying\n and smooth experience, and the collaborative music remixing had a certain\n level of musicality, but there is still some room for improvement in terms\n of user understanding. However, the users expressed that this fresh\n collaborative approach made them more interested in DJing and motivated\n their desire to learn and create.","PeriodicalId":231376,"journal":{"name":"Human Systems Engineering and Design (IHSED 2023): Future Trends\n and Applications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Systems Engineering and Design (IHSED 2023): Future Trends\n and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54941/ahfe1004114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays, musical mixing platforms are featured with programmed
interventions and digitized information visualization to support DJ's
performance (Montano 2010), however, the visualization is always obscure to
the average music consumers (Beamish, Maclean, and Fels 2004). Being a
well-performed DJ requires the level of expertise and experience that most
average music consumers lack (Cliff 2000), as every audience has a
completely different taste in music (Schäfer and Sedlmeier 2010). This study
aims at developing an AI / ML-based system to lower the bar for novice DJs
and even average music consumers to create personalized music
remixes.Generally, music can be intelligently composed by analyzing harmonic
and melodic features to generate genre-specific compositional elements or to
alter the compositional structure of a song (Tan and Li 2021). Despite the
technical breakthroughs that have been made, listeners have reacted
negatively to this music due to the lack of user data to back it up and the
neglect of the user's perception of the piece (Tigre Moura and Maw 2021). In
a conventional scenario, DJs can express their attitudes towards music
preferences by listening to the music directly, which requires a well
understanding of the audience's mind. Following the recent launch and
explosion of ChatGPT, which has evidenced that an intelligent system could
help users innovate by solving their problems in textual form through
conversational interactions (Dis et al. 2023; Dwivedi et al. 2023); also
collecting the users' feedback through conversations, observing user
reactions, and inviting user reviews. Such AI-enabled systems are able to
learn about the user's preferred music style and various DJ mixing
techniques. This study adopts a typical human-in-the-loop (HITL) approach to
develop a crowd-learning music mixing system implementing AI and Virtual
Reality technologies. The proposed HITL-based co-music arrangement system
should be able to collect musical data and techniques; a VR environment is
built to provide users with a platform to record user-created music and
corresponding applied methods as well as audience ratings worldwide. After
processing the data, users can try out a compilation of songs assisted by a
robotic arm. With the help of the robotic arm, it will be easier and faster
for users to create collections with a personal touch and more specific
techniques. The essential functions include: a) Providing users with an
immersive environment to learn the basic operations of the DJ console. b)
Collecting the user's preferences for compilation techniques and the content
of different DJ's compositions for use through an “immersive online
multiplayer music compilation platform” to generate a personalized library
of methods to help the user compile songs; c) Assisting the user in creating
their preferred individual compilation style faster as they try out the DJ's
operations; d) Indicating to the user where the music needs to be equalized,
switched or arranged. Instead of showing the user the digital music signal
to assist in creating more efficiently, the system directly operates on the
DJ console.User experience experiments were conducted with both novice DJs
and experienced DJs to validate whether the proposed system could help
humans in creating more engaging music with stronger musicality. Five
participants, respectively three novice DJs and two experienced DJs, joined
two experiments of half an hour on a virtual DJ and an actual DJ console.
They started the experiment by experiencing the virtual DJ console and DJ
community in VR. They remixed independently first and then collaborated with
the robotic arm together for music production on the actual DJ console.
Three different audience also joined the experiment to evaluate the
performance of users. The result was that the music produced with the
robotic arm had better musicality. The user's attitude towards the whole
experience, reflected in whether the music was rhythmic or the system was
inspiring was recorded in the feedback. Overall, the users had a satisfying
and smooth experience, and the collaborative music remixing had a certain
level of musicality, but there is still some room for improvement in terms
of user understanding. However, the users expressed that this fresh
collaborative approach made them more interested in DJing and motivated
their desire to learn and create.
如今,音乐混音平台的特点是通过编程干预和数字化信息可视化来支持DJ的表演(Montano 2010),然而,对于普通音乐消费者来说,可视化总是模糊不清的(Beamish, Maclean, and Fels 2004)。作为一个表现良好的DJ需要的专业知识和经验水平,大多数普通音乐消费者缺乏(Cliff 2000),因为每个听众都有完全不同的音乐品味(Schäfer和Sedlmeier 2010)。本研究旨在开发一个基于AI / ml的系统,以降低新手dj甚至普通音乐消费者制作个性化音乐混音的门槛。一般来说,音乐可以通过分析和声和旋律特征来智能创作,从而产生特定类型的作曲元素或改变歌曲的作曲结构(Tan and Li 2021)。尽管已经取得了技术上的突破,但由于缺乏用户数据来支持它,并且忽视了用户对作品的感知,听众对这首音乐的反应是负面的(Tigre Moura和Maw 2021)。在传统的场景中,dj可以通过直接听音乐来表达他们对音乐偏好的态度,这需要对听众的心理有很好的了解。随着最近ChatGPT的推出和爆炸式增长,这证明了智能系统可以通过对话交互以文本形式解决用户的问题,从而帮助用户进行创新(Dis et al. 2023;Dwivedi et al. 2023);同时通过对话收集用户反馈,观察用户反应,邀请用户评论。这种人工智能系统能够了解用户喜欢的音乐风格和各种DJ混音技术。本研究采用典型的人在环(HITL)方法来开发一个采用人工智能和虚拟现实技术的群体学习音乐混音系统。提出的基于hitl的合曲编曲系统应该能够收集音乐数据和技术;构建VR环境,为用户提供一个平台,记录用户创作的音乐和相应的应用方法,以及全球的收视率。在处理完数据后,用户可以在机械臂的帮助下尝试歌曲汇编。在机械臂的帮助下,用户将更容易、更快地创建具有个人风格和更具体技术的收藏品。其基本功能包括:a)为用户提供一个身临其境的环境来学习DJ控制台的基本操作。b)通过“沉浸式在线多人音乐编曲平台”收集用户对编曲技术的偏好和不同DJ的作曲内容,生成个性化的方法库,帮助用户编曲;c)在用户尝试DJ的操作时,帮助他们更快地创建自己喜欢的个人编译风格;d)指示用户在何处需要对音乐进行均衡、切换或编排。该系统不是向用户显示数字音乐信号以帮助更有效地创作,而是直接在DJ控制台上操作。用户体验实验在新手dj和资深dj中进行,以验证所提出的系统是否可以帮助人类创造出更具音乐性的更吸引人的音乐。五名参与者,分别是三名新手DJ和两名经验丰富的DJ,在虚拟DJ和实际DJ控制台上参加了两个半小时的实验。他们在VR中体验了虚拟DJ控制台和DJ社区,开始了实验。他们首先独立混音,然后与机械臂一起在实际的DJ控制台进行音乐制作。三个不同的观众也加入了这个实验来评估用户的表现。结果是,用机械臂产生的音乐具有更好的音乐性。用户对整个体验的态度,反映在音乐是否有节奏感或系统是否鼓舞人心,都记录在反馈中。总体而言,用户体验满意且流畅,协作音乐混音具有一定的音乐性,但在用户理解方面仍有一定的提升空间。然而,用户们表示,这种全新的合作方式让他们对dj更感兴趣,激发了他们学习和创造的欲望。