基于cbam - lstm -注意力的人类情绪识别

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL
Jingqi Le , Yanghui Wang , Yong Zhou , Sheng Zou
{"title":"基于cbam - lstm -注意力的人类情绪识别","authors":"Jingqi Le ,&nbsp;Yanghui Wang ,&nbsp;Yong Zhou ,&nbsp;Sheng Zou","doi":"10.1016/j.bspc.2025.108767","DOIUrl":null,"url":null,"abstract":"<div><div>Human emotion recognition seeks to facilitate machines in comprehending human emotional states. EEG signals, being non-invasive measurements, are among the signals that most accurately represent human emotions. Nevertheless, the abundance of channels in EEG signals leads to heightened computational complexity and a greater risk of overfitting the model. In this research, eXtreme Gradient Boosting (XGBoost) was employed to choose the suitable EEG channel, and a unique model named convolutional block attention module − long short-term memory − attention module (CBAM-LSTM-Attention) was introduced for emotion recognition. The model combines residual blocks, LSTM networks, and attention mechanisms to efficiently incorporate important channel-spatial and temporal domain features of EEG signals. To achieve high prediction accuracy in emotion recognition, the following innovations can be implemented: (i) Integrating XGBoost algorithm to evaluate each channel based on power spectral density (PSD), and selecting the channel with the highest score as the input for the model. This approach reduces the computational complexity of the model and minimises the risk of overfitting. (ii) Introducing a channel-spatial attention module in the residual block to enhance the model’s ability to extract channel-spatial domain features in the convolutional block attention module (CBAM) model. (iii) Utilising a multi-head attention mechanism to improve the model’s capability to extract temporal domain features, enabling global feature perception and input to the LSTM layer for decoding. The results indicated that the proposed CBAM-LSTM-Attention model achieved 95.108 % accuracy for arousal and 94.862 % for valence on the DEAP dataset using single-channel data. Using multi-channel data, the model achieved 98.790 % for arousal and 97.249 %for valence. This suggests that the model effectively enables correct classification of human emotion recognition.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108767"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CBAM-LSTM-attention enabled human emotion recognition using EEG signals\",\"authors\":\"Jingqi Le ,&nbsp;Yanghui Wang ,&nbsp;Yong Zhou ,&nbsp;Sheng Zou\",\"doi\":\"10.1016/j.bspc.2025.108767\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Human emotion recognition seeks to facilitate machines in comprehending human emotional states. EEG signals, being non-invasive measurements, are among the signals that most accurately represent human emotions. Nevertheless, the abundance of channels in EEG signals leads to heightened computational complexity and a greater risk of overfitting the model. In this research, eXtreme Gradient Boosting (XGBoost) was employed to choose the suitable EEG channel, and a unique model named convolutional block attention module − long short-term memory − attention module (CBAM-LSTM-Attention) was introduced for emotion recognition. The model combines residual blocks, LSTM networks, and attention mechanisms to efficiently incorporate important channel-spatial and temporal domain features of EEG signals. To achieve high prediction accuracy in emotion recognition, the following innovations can be implemented: (i) Integrating XGBoost algorithm to evaluate each channel based on power spectral density (PSD), and selecting the channel with the highest score as the input for the model. This approach reduces the computational complexity of the model and minimises the risk of overfitting. (ii) Introducing a channel-spatial attention module in the residual block to enhance the model’s ability to extract channel-spatial domain features in the convolutional block attention module (CBAM) model. (iii) Utilising a multi-head attention mechanism to improve the model’s capability to extract temporal domain features, enabling global feature perception and input to the LSTM layer for decoding. The results indicated that the proposed CBAM-LSTM-Attention model achieved 95.108 % accuracy for arousal and 94.862 % for valence on the DEAP dataset using single-channel data. Using multi-channel data, the model achieved 98.790 % for arousal and 97.249 %for valence. This suggests that the model effectively enables correct classification of human emotion recognition.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108767\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425012789\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425012789","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

人类情感识别旨在帮助机器理解人类的情感状态。脑电图信号是一种非侵入性测量,是最准确地代表人类情绪的信号之一。然而,脑电图信号中丰富的通道导致计算复杂度增加和模型过拟合的风险增大。本研究采用极限梯度增强(XGBoost)方法选择合适的脑电通道,并引入卷积块注意模块-长短期记忆-注意模块(CBAM-LSTM-Attention)模型进行情绪识别。该模型结合残差块、LSTM网络和注意机制,有效地融合了脑电信号的重要通道空间和时域特征。为了在情绪识别中实现较高的预测精度,可以实现以下创新:(i)集成XGBoost算法,基于功率谱密度(power spectral density, PSD)对各个通道进行评估,选择得分最高的通道作为模型的输入。这种方法降低了模型的计算复杂度,最大限度地降低了过拟合的风险。(ii)在残差块中引入通道空间注意模块,增强卷积块注意模块(CBAM)模型提取通道空间域特征的能力。(iii)利用多头注意机制提高模型提取时域特征的能力,实现全局特征感知并输入到LSTM层进行解码。结果表明,所提出的CBAM-LSTM-Attention模型在单通道DEAP数据集上的唤醒准确率为95.108%,效价准确率为94.862%。使用多通道数据,该模型的唤醒率达到98.790%,效价率达到97.249%。这表明该模型有效地实现了人类情感识别的正确分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
CBAM-LSTM-attention enabled human emotion recognition using EEG signals
Human emotion recognition seeks to facilitate machines in comprehending human emotional states. EEG signals, being non-invasive measurements, are among the signals that most accurately represent human emotions. Nevertheless, the abundance of channels in EEG signals leads to heightened computational complexity and a greater risk of overfitting the model. In this research, eXtreme Gradient Boosting (XGBoost) was employed to choose the suitable EEG channel, and a unique model named convolutional block attention module − long short-term memory − attention module (CBAM-LSTM-Attention) was introduced for emotion recognition. The model combines residual blocks, LSTM networks, and attention mechanisms to efficiently incorporate important channel-spatial and temporal domain features of EEG signals. To achieve high prediction accuracy in emotion recognition, the following innovations can be implemented: (i) Integrating XGBoost algorithm to evaluate each channel based on power spectral density (PSD), and selecting the channel with the highest score as the input for the model. This approach reduces the computational complexity of the model and minimises the risk of overfitting. (ii) Introducing a channel-spatial attention module in the residual block to enhance the model’s ability to extract channel-spatial domain features in the convolutional block attention module (CBAM) model. (iii) Utilising a multi-head attention mechanism to improve the model’s capability to extract temporal domain features, enabling global feature perception and input to the LSTM layer for decoding. The results indicated that the proposed CBAM-LSTM-Attention model achieved 95.108 % accuracy for arousal and 94.862 % for valence on the DEAP dataset using single-channel data. Using multi-channel data, the model achieved 98.790 % for arousal and 97.249 %for valence. This suggests that the model effectively enables correct classification of human emotion recognition.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信