Frontiers in Neurorobotics最新文献

筛选
英文 中文
Swimtrans Net: a multimodal robotic system for swimming action recognition driven via Swin-Transformer. Swimtrans Net:通过斯温变换器驱动的游泳动作识别多模态机器人系统。
IF 2.6 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-09-24 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1452019
He Chen, Xiaoyu Yue
{"title":"Swimtrans Net: a multimodal robotic system for swimming action recognition driven via Swin-Transformer.","authors":"He Chen, Xiaoyu Yue","doi":"10.3389/fnbot.2024.1452019","DOIUrl":"10.3389/fnbot.2024.1452019","url":null,"abstract":"<p><strong>Introduction: </strong>Currently, using machine learning methods for precise analysis and improvement of swimming techniques holds significant research value and application prospects. The existing machine learning methods have improved the accuracy of action recognition to some extent. However, they still face several challenges such as insufficient data feature extraction, limited model generalization ability, and poor real-time performance.</p><p><strong>Methods: </strong>To address these issues, this paper proposes an innovative approach called Swimtrans Net: A multimodal robotic system for swimming action recognition driven via Swin-Transformer. By leveraging the powerful visual data feature extraction capabilities of Swin-Transformer, Swimtrans Net effectively extracts swimming image information. Additionally, to meet the requirements of multimodal tasks, we integrate the CLIP model into the system. Swin-Transformer serves as the image encoder for CLIP, and through fine-tuning the CLIP model, it becomes capable of understanding and interpreting swimming action data, learning relevant features and patterns associated with swimming. Finally, we introduce transfer learning for pre-training to reduce training time and lower computational resources, thereby providing real-time feedback to swimmers.</p><p><strong>Results and discussion: </strong>Experimental results show that Swimtrans Net has achieved a 2.94% improvement over the current state-of-the-art methods in swimming motion analysis and prediction, making significant progress. This study introduces an innovative machine learning method that can help coaches and swimmers better understand and improve swimming techniques, ultimately improving swimming performance.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1452019"},"PeriodicalIF":2.6,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11458561/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142389758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human in the collaborative loop: a strategy for integrating human activity recognition and non-invasive brain-machine interfaces to control collaborative robots. 协作回路中的人:整合人类活动识别和非侵入式脑机接口以控制协作机器人的策略。
IF 2.6 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-09-24 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1383089
Artur Pilacinski, Lukas Christ, Marius Boshoff, Ioannis Iossifidis, Patrick Adler, Michael Miro, Bernd Kuhlenkötter, Christian Klaes
{"title":"Human in the collaborative loop: a strategy for integrating human activity recognition and non-invasive brain-machine interfaces to control collaborative robots.","authors":"Artur Pilacinski, Lukas Christ, Marius Boshoff, Ioannis Iossifidis, Patrick Adler, Michael Miro, Bernd Kuhlenkötter, Christian Klaes","doi":"10.3389/fnbot.2024.1383089","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1383089","url":null,"abstract":"<p><p>Human activity recognition (HAR) and brain-machine interface (BMI) are two emerging technologies that can enhance human-robot collaboration (HRC) in domains such as industry or healthcare. HAR uses sensors or cameras to capture and analyze the movements and actions of humans, while BMI uses human brain signals to decode action intentions. Both technologies face challenges impacting accuracy, reliability, and usability. In this article, we review the state-of-the-art techniques and methods for HAR and BMI and highlight their strengths and limitations. We then propose a hybrid framework that fuses HAR and BMI data, which can integrate the complementary information from the brain and body motion signals and improve the performance of human state decoding. We also discuss our hybrid method's potential benefits and implications for HRC.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1383089"},"PeriodicalIF":2.6,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11458527/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142389756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MLFGCN: short-term residential load forecasting via graph attention temporal convolution network. MLFGCN:通过图注意时间卷积网络进行短期住宅负荷预测。
IF 2.6 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-09-23 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1461403
Ding Feng, Dengao Li, Yu Zhou, Wei Wang
{"title":"MLFGCN: short-term residential load forecasting via graph attention temporal convolution network.","authors":"Ding Feng, Dengao Li, Yu Zhou, Wei Wang","doi":"10.3389/fnbot.2024.1461403","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1461403","url":null,"abstract":"<p><strong>Introduction: </strong>Residential load forecasting is a challenging task due to the random fluctuations caused by complex correlations and individual differences. The existing short-term load forecasting models usually introduce external influencing factors such as climate and date. However, these additional information not only bring computational burden to the model, but also have uncertainty. To address these issues, we propose a novel multi-level feature fusion model based on graph attention temporal convolutional network (MLFGCN) for short-term residential load forecasting.</p><p><strong>Methods: </strong>The proposed MLFGCN model fully considers the potential long-term dependencies in a single load series and the correlations between multiple load series, and does not require any additional information to be added. Temporal convolutional network (TCN) with gating mechanism is introduced to learn potential long-term dependencies in the original load series. In addition, we design two graph attentive convolutional modules to capture potential multi-level dependencies in load data. Finally, the outputs of each module are fused through an information fusion layer to obtain the highly accurate forecasting results.</p><p><strong>Results: </strong>We conduct validation experiments on two real-world datasets. The results show that the proposed MLFGCN model achieves 0.25, 7.58% and 0.50 for MAE, MAPE and RMSE, respectively. These values are significantly better than those of baseline models.</p><p><strong>Discussion: </strong>The MLFGCN algorithm proposed in this paper can significantly improve the accuracy of short-term residential load forecasting. This is achieved through high-quality feature reconstruction, comprehensive information graph construction and spatiotemporal features capture.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1461403"},"PeriodicalIF":2.6,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11457015/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142389757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An efficient grasping shared control architecture for unpredictable and unspecified tasks 针对不可预测和未指定任务的高效抓取共享控制架构
IF 3.1 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-09-11 DOI: 10.3389/fnbot.2024.1429952
Shaowen Cheng, Yongbin Jin, Yanhong Liang, Lei Jiang, Hongtao Wang
{"title":"An efficient grasping shared control architecture for unpredictable and unspecified tasks","authors":"Shaowen Cheng, Yongbin Jin, Yanhong Liang, Lei Jiang, Hongtao Wang","doi":"10.3389/fnbot.2024.1429952","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1429952","url":null,"abstract":"Robot control in complex and unpredictable scenarios presents challenges such as adaptability, robustness, and human-robot interaction. These scenarios often require robots to perform tasks that involve unknown objects in unstructured environments with high levels of uncertainty. Traditional control methods, such as automatic control, may not be suitable due to their limited adaptability and reliance on prior knowledge. Human-in-the-loop method faces issues such as insufficient feedback, increased failure rates due to noise and delays, and lack of operator immersion, preventing the achievement of human-level performance. This study proposed a shared control framework to achieve a trade-off between efficiency and adaptability by combing the advantages of both teleoperation and automatic control method. The proposed approach combines the advantages of both human and automatic control methods to achieve a balance between performance and adaptability. We developed a linear model to compare three control methods and analyzed the impact of position noise and communication delays on performance. The real-world implementation of the shared control system demonstrates its effectiveness in object grasping and manipulation tasks. The results suggest that shared control can significantly improve grasping efficiency while maintaining adaptability in task execution for practical robotics applications.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"22 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel signal channel attention network for multi-modal emotion recognition 用于多模态情感识别的新型信号通道注意力网络
IF 3.1 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-09-11 DOI: 10.3389/fnbot.2024.1442080
Ziang Du, Xia Ye, Pujie Zhao
{"title":"A novel signal channel attention network for multi-modal emotion recognition","authors":"Ziang Du, Xia Ye, Pujie Zhao","doi":"10.3389/fnbot.2024.1442080","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1442080","url":null,"abstract":"Physiological signal recognition is crucial in emotion recognition, and recent advancements in multi-modal fusion have enabled the integration of various physiological signals for improved recognition tasks. However, current models for emotion recognition with hyper complex multi-modal signals face limitations due to fusion methods and insufficient attention mechanisms, preventing further enhancement in classification performance. To address these challenges, we propose a new model framework named Signal Channel Attention Network (SCA-Net), which comprises three main components: an encoder, an attention fusion module, and a decoder. In the attention fusion module, we developed five types of attention mechanisms inspired by existing research and performed comparative experiments using the public dataset MAHNOB-HCI. All of these experiments demonstrate the effectiveness of the attention module we addressed for our baseline model in improving both accuracy and F1 score metrics. We also conducted ablation experiments within the most effective attention fusion module to verify the benefits of multi-modal fusion. Additionally, we adjusted the training process for different attention fusion modules by employing varying early stopping parameters to prevent model overfitting.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"10 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible control and trajectory planning of medical two-arm surgical robot 医用双臂手术机器人的灵活控制和轨迹规划
IF 3.1 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-09-02 DOI: 10.3389/fnbot.2024.1451055
Yanchun Xie, Xue Zhao, Yang Jiang, Yao Wu, Hailong Yu
{"title":"Flexible control and trajectory planning of medical two-arm surgical robot","authors":"Yanchun Xie, Xue Zhao, Yang Jiang, Yao Wu, Hailong Yu","doi":"10.3389/fnbot.2024.1451055","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1451055","url":null,"abstract":"This paper introduces the flexible control and trajectory planning medical two-arm surgical robots, and employs effective collision detection methods to ensure the safety and precision during tasks. Firstly, the DH method is employed to establish relative rotation matrices between coordinate systems, determining the relative relationships of each joint link. A neural network based on a multilayer perceptron is proposed to solve FKP problem in real time. Secondly, a universal interpolator based on Non-Uniform Rational B-Splines (NURBS) is developed, capable of handling any geometric shape to ensure smooth and flexible motion trajectories. Finally, we developed a generalized momentum observer to detect external collisions, eliminating the need for external sensors and thereby reducing mechanical complexity and cost. The experiments verify the effectiveness of the kinematics solution and trajectory planning, demonstrating that the improved momentum torque observer can significantly reduce system overshoot, enabling the two-arm surgical robot to perform precise and safe surgical tasks under algorithmic guidance.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"177 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Target detection and classification via EfficientDet and CNN over unmanned aerial vehicles 通过 EfficientDet 和 CNN 在无人机上进行目标检测和分类
IF 3.1 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-08-30 DOI: 10.3389/fnbot.2024.1448538
Muhammad Ovais Yusuf, Muhammad Hanzla, Naif Al Mudawi, Touseef Sadiq, Bayan Alabdullah, Hameedur Rahman, Asaad Algarni
{"title":"Target detection and classification via EfficientDet and CNN over unmanned aerial vehicles","authors":"Muhammad Ovais Yusuf, Muhammad Hanzla, Naif Al Mudawi, Touseef Sadiq, Bayan Alabdullah, Hameedur Rahman, Asaad Algarni","doi":"10.3389/fnbot.2024.1448538","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1448538","url":null,"abstract":"IntroductionAdvanced traffic monitoring systems face significant challenges in vehicle detection and classification. Conventional methods often require substantial computational resources and struggle to adapt to diverse data collection methods.MethodsThis research introduces an innovative technique for classifying and recognizing vehicles in aerial image sequences. The proposed model encompasses several phases, starting with image enhancement through noise reduction and Contrast Limited Adaptive Histogram Equalization (CLAHE). Following this, contour-based segmentation and Fuzzy C-means segmentation (FCM) are applied to identify foreground objects. Vehicle detection and identification are performed using EfficientDet. For feature extraction, Accelerated KAZE (AKAZE), Oriented FAST and Rotated BRIEF (ORB), and Scale Invariant Feature Transform (SIFT) are utilized. Object classification is achieved through a Convolutional Neural Network (CNN) and ResNet Residual Network.ResultsThe proposed method demonstrates improved performance over previous approaches. Experiments on datasets including Vehicle Aerial Imagery from a Drone (VAID) and Unmanned Aerial Vehicle Intruder Dataset (UAVID) reveal that the model achieves an accuracy of 96.6% on UAVID and 97% on VAID.DiscussionThe results indicate that the proposed model significantly enhances vehicle detection and classification in aerial images, surpassing existing methods and offering notable improvements for traffic monitoring systems.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"27 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142222784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Frontiers | Multi-Modal Remote Perception Learning for Object Sensory Data 物体感知数据的多模式远程感知学习前沿
IF 3.1 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-08-26 DOI: 10.3389/fnbot.2024.1427786
Nouf A. Almujally, Adnan A. Rafique, Naif Al Mudawi, Abdulwahab Alazeb, Mohammed Alonazi, Asaad Algarni, Ahmad Jalal, Hui Liu
{"title":"Frontiers | Multi-Modal Remote Perception Learning for Object Sensory Data","authors":"Nouf A. Almujally, Adnan A. Rafique, Naif Al Mudawi, Abdulwahab Alazeb, Mohammed Alonazi, Asaad Algarni, Ahmad Jalal, Hui Liu","doi":"10.3389/fnbot.2024.1427786","DOIUrl":"https://doi.org/10.3389/fnbot.2024.1427786","url":null,"abstract":"IntroductionWhen it comes to interpreting visual input, intelligent systems make use of contextual scene learning, which significantly improves both resilience and context awareness. The management of enormous amounts of data is a driving force behind the growing interest in computational frameworks, particularly in the context of autonomous cars.MethodThe purpose of this study is to introduce a novel approach known as Deep Fused Networks (DFN), which improves contextual scene comprehension by merging multi-object detection and semantic analysis.ResultsTo enhance accuracy and comprehension in complex situations, DFN makes use of a combination of deep learning and fusion techniques. With a minimum gain of 6.4% in accuracy for the SUN-RGB-D dataset and 3.6% for the NYU-Dv2 dataset.DiscussionFindings demonstrate considerable enhancements in object detection and semantic analysis when compared to the methodologies that are currently being utilized.","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"16 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Soft ankle exoskeleton to counteract dropfoot and excessive inversion. 柔软的踝关节外骨骼可抵御足下垂和过度内翻。
IF 2.6 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-08-21 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1372763
Xiaochen Zhang, Yi-Xing Liu, Ruoli Wang, Elena M Gutierrez-Farewik
{"title":"Soft ankle exoskeleton to counteract dropfoot and excessive inversion.","authors":"Xiaochen Zhang, Yi-Xing Liu, Ruoli Wang, Elena M Gutierrez-Farewik","doi":"10.3389/fnbot.2024.1372763","DOIUrl":"10.3389/fnbot.2024.1372763","url":null,"abstract":"<p><strong>Introduction: </strong>Wearable exoskeletons are emerging technologies for providing movement assistance and rehabilitation for people with motor disorders. In this study, we focus on the specific gait pathology dropfoot, which is common after a stroke. Dropfoot makes it difficult to achieve foot clearance during swing and heel contact at early stance and often necessitates compensatory movements.</p><p><strong>Methods: </strong>We developed a soft ankle exoskeleton consisting of actuation and transmission systems to assist two degrees of freedom simultaneously: dorsiflexion and eversion, then performed several proof-of-concept experiments on non-disabled persons. The actuation system consists of two motors worn on a waist belt. The transmission system provides assistive force to the medial and lateral sides of the forefoot via Bowden cables. The coupling design enables variable assistance of dorsiflexion and inversion at the same time, and a force-free controller is proposed to compensate for device resistance. We first evaluated the performance of the exoskeleton in three seated movement tests: assisting dorsiflexion and eversion, controlling plantarflexion, and compensating for device resistance, then during walking tests. In all proof-of-concept experiments, dropfoot tendency was simulated by fastening a weight to the shoe over the lateral forefoot.</p><p><strong>Results: </strong>In the first two seated tests, errors between the target and the achieved ankle joint angles in two planes were low; errors of <1.5° were achieved in assisting dorsiflexion and/or controlling plantarflexion and of <1.4° in assisting ankle eversion. The force-free controller in test three significantly compensated for the device resistance during ankle joint plantarflexion. In the gait tests, the exoskeleton was able to normalize ankle joint and foot segment kinematics, specifically foot inclination angle and ankle inversion angle at initial contact and ankle angle and clearance height during swing.</p><p><strong>Discussion: </strong>Our findings support the feasibility of the new ankle exoskeleton design in assisting two degrees of freedom at the ankle simultaneously and show its potential to assist people with dropfoot and excessive inversion.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1372763"},"PeriodicalIF":2.6,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371749/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142132504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EEG-driven automatic generation of emotive music based on transformer. 基于变压器的脑电图驱动自动生成情感音乐。
IF 2.6 4区 计算机科学
Frontiers in Neurorobotics Pub Date : 2024-08-19 eCollection Date: 2024-01-01 DOI: 10.3389/fnbot.2024.1437737
Hui Jiang, Yu Chen, Di Wu, Jinlin Yan
{"title":"EEG-driven automatic generation of emotive music based on transformer.","authors":"Hui Jiang, Yu Chen, Di Wu, Jinlin Yan","doi":"10.3389/fnbot.2024.1437737","DOIUrl":"10.3389/fnbot.2024.1437737","url":null,"abstract":"<p><p>Utilizing deep features from electroencephalography (EEG) data for emotional music composition provides a novel approach for creating personalized and emotionally rich music. Compared to textual data, converting continuous EEG and music data into discrete units presents significant challenges, particularly the lack of a clear and fixed vocabulary for standardizing EEG and audio data. The lack of this standard makes the mapping relationship between EEG signals and musical elements (such as rhythm, melody, and emotion) blurry and complex. Therefore, we propose a method of using clustering to create discrete representations and using the Transformer model to reverse mapping relationships. Specifically, the model uses clustering labels to segment signals and independently encodes EEG and emotional music data to construct a vocabulary, thereby achieving discrete representation. A time series dictionary was developed using clustering algorithms, which more effectively captures and utilizes the temporal and structural relationships between EEG and audio data. In response to the insensitivity to temporal information in heterogeneous data, we adopted a multi head attention mechanism and positional encoding technology to enable the model to focus on information in different subspaces, thereby enhancing the understanding of the complex internal structure of EEG and audio data. In addition, to address the mismatch between local and global information in emotion driven music generation, we introduce an audio masking prediction loss learning method. Our method generates music that <i>Hits@</i>20 On the indicator, a performance of 68.19% was achieved, which improved the score by 4.9% compared to other methods, indicating the effectiveness of this method.</p>","PeriodicalId":12628,"journal":{"name":"Frontiers in Neurorobotics","volume":"18 ","pages":"1437737"},"PeriodicalIF":2.6,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11366740/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142119502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信