基于cbam - lstm -注意力的人类情绪识别

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biomedical Signal Processing and Control Pub Date : 2025-10-08 DOI:10.1016/j.bspc.2025.108767

Jingqi Le , Yanghui Wang , Yong Zhou , Sheng Zou

{"title":"基于cbam - lstm -注意力的人类情绪识别","authors":"Jingqi Le , Yanghui Wang , Yong Zhou , Sheng Zou","doi":"10.1016/j.bspc.2025.108767","DOIUrl":null,"url":null,"abstract":"<div><div>Human emotion recognition seeks to facilitate machines in comprehending human emotional states. EEG signals, being non-invasive measurements, are among the signals that most accurately represent human emotions. Nevertheless, the abundance of channels in EEG signals leads to heightened computational complexity and a greater risk of overfitting the model. In this research, eXtreme Gradient Boosting (XGBoost) was employed to choose the suitable EEG channel, and a unique model named convolutional block attention module − long short-term memory − attention module (CBAM-LSTM-Attention) was introduced for emotion recognition. The model combines residual blocks, LSTM networks, and attention mechanisms to efficiently incorporate important channel-spatial and temporal domain features of EEG signals. To achieve high prediction accuracy in emotion recognition, the following innovations can be implemented: (i) Integrating XGBoost algorithm to evaluate each channel based on power spectral density (PSD), and selecting the channel with the highest score as the input for the model. This approach reduces the computational complexity of the model and minimises the risk of overfitting. (ii) Introducing a channel-spatial attention module in the residual block to enhance the model’s ability to extract channel-spatial domain features in the convolutional block attention module (CBAM) model. (iii) Utilising a multi-head attention mechanism to improve the model’s capability to extract temporal domain features, enabling global feature perception and input to the LSTM layer for decoding. The results indicated that the proposed CBAM-LSTM-Attention model achieved 95.108 % accuracy for arousal and 94.862 % for valence on the DEAP dataset using single-channel data. Using multi-channel data, the model achieved 98.790 % for arousal and 97.249 %for valence. This suggests that the model effectively enables correct classification of human emotion recognition.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108767"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CBAM-LSTM-attention enabled human emotion recognition using EEG signals\",\"authors\":\"Jingqi Le , Yanghui Wang , Yong Zhou , Sheng Zou\",\"doi\":\"10.1016/j.bspc.2025.108767\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Human emotion recognition seeks to facilitate machines in comprehending human emotional states. EEG signals, being non-invasive measurements, are among the signals that most accurately represent human emotions. Nevertheless, the abundance of channels in EEG signals leads to heightened computational complexity and a greater risk of overfitting the model. In this research, eXtreme Gradient Boosting (XGBoost) was employed to choose the suitable EEG channel, and a unique model named convolutional block attention module − long short-term memory − attention module (CBAM-LSTM-Attention) was introduced for emotion recognition. The model combines residual blocks, LSTM networks, and attention mechanisms to efficiently incorporate important channel-spatial and temporal domain features of EEG signals. To achieve high prediction accuracy in emotion recognition, the following innovations can be implemented: (i) Integrating XGBoost algorithm to evaluate each channel based on power spectral density (PSD), and selecting the channel with the highest score as the input for the model. This approach reduces the computational complexity of the model and minimises the risk of overfitting. (ii) Introducing a channel-spatial attention module in the residual block to enhance the model’s ability to extract channel-spatial domain features in the convolutional block attention module (CBAM) model. (iii) Utilising a multi-head attention mechanism to improve the model’s capability to extract temporal domain features, enabling global feature perception and input to the LSTM layer for decoding. The results indicated that the proposed CBAM-LSTM-Attention model achieved 95.108 % accuracy for arousal and 94.862 % for valence on the DEAP dataset using single-channel data. Using multi-channel data, the model achieved 98.790 % for arousal and 97.249 %for valence. This suggests that the model effectively enables correct classification of human emotion recognition.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108767\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425012789\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425012789","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

人类情感识别旨在帮助机器理解人类的情感状态。脑电图信号是一种非侵入性测量，是最准确地代表人类情绪的信号之一。然而，脑电图信号中丰富的通道导致计算复杂度增加和模型过拟合的风险增大。本研究采用极限梯度增强（XGBoost）方法选择合适的脑电通道，并引入卷积块注意模块-长短期记忆-注意模块（CBAM-LSTM-Attention）模型进行情绪识别。该模型结合残差块、LSTM网络和注意机制，有效地融合了脑电信号的重要通道空间和时域特征。为了在情绪识别中实现较高的预测精度，可以实现以下创新：(i)集成XGBoost算法，基于功率谱密度（power spectral density， PSD）对各个通道进行评估，选择得分最高的通道作为模型的输入。这种方法降低了模型的计算复杂度，最大限度地降低了过拟合的风险。（ii）在残差块中引入通道空间注意模块，增强卷积块注意模块（CBAM）模型提取通道空间域特征的能力。（iii）利用多头注意机制提高模型提取时域特征的能力，实现全局特征感知并输入到LSTM层进行解码。结果表明，所提出的CBAM-LSTM-Attention模型在单通道DEAP数据集上的唤醒准确率为95.108%，效价准确率为94.862%。使用多通道数据，该模型的唤醒率达到98.790%，效价率达到97.249%。这表明该模型有效地实现了人类情感识别的正确分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CBAM-LSTM-attention enabled human emotion recognition using EEG signals

Human emotion recognition seeks to facilitate machines in comprehending human emotional states. EEG signals, being non-invasive measurements, are among the signals that most accurately represent human emotions. Nevertheless, the abundance of channels in EEG signals leads to heightened computational complexity and a greater risk of overfitting the model. In this research, eXtreme Gradient Boosting (XGBoost) was employed to choose the suitable EEG channel, and a unique model named convolutional block attention module − long short-term memory − attention module (CBAM-LSTM-Attention) was introduced for emotion recognition. The model combines residual blocks, LSTM networks, and attention mechanisms to efficiently incorporate important channel-spatial and temporal domain features of EEG signals. To achieve high prediction accuracy in emotion recognition, the following innovations can be implemented: (i) Integrating XGBoost algorithm to evaluate each channel based on power spectral density (PSD), and selecting the channel with the highest score as the input for the model. This approach reduces the computational complexity of the model and minimises the risk of overfitting. (ii) Introducing a channel-spatial attention module in the residual block to enhance the model’s ability to extract channel-spatial domain features in the convolutional block attention module (CBAM) model. (iii) Utilising a multi-head attention mechanism to improve the model’s capability to extract temporal domain features, enabling global feature perception and input to the LSTM layer for decoding. The results indicated that the proposed CBAM-LSTM-Attention model achieved 95.108 % accuracy for arousal and 94.862 % for valence on the DEAP dataset using single-channel data. Using multi-channel data, the model achieved 98.790 % for arousal and 97.249 %for valence. This suggests that the model effectively enables correct classification of human emotion recognition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.