Mtfsfn：一种基于脑电图的多视点时频空间融合网络。

IF 3.9 3区工程技术 Q2 NEUROSCIENCES

Cognitive Neurodynamics Pub Date : 2025-12-01 Epub Date: 2025-09-27 DOI:10.1007/s11571-025-10342-x

Zhongmin Wang, Shengyang Gao

{"title":"Mtfsfn：一种基于脑电图的多视点时频空间融合网络。","authors":"Zhongmin Wang, Shengyang Gao","doi":"10.1007/s11571-025-10342-x","DOIUrl":null,"url":null,"abstract":"Over the recent years, emotion recognition based on electroencephalogram (EEG) has emerged as a prominent research area. Nevertheless, EEG signals present spatially discrete and non-stationary characteristics, to represent spatiotemporal information and extract more discriminative features from complex signals is still a challenge. This study proposed a multi-view time-frequency-space fusion network, referred to as MTFSFN. To effectively utilize complementary information from different frequency bands, we employ a frequency-domain attention mechanism to allocate weights to features of different frequency bands. A multi-view Transformer model was designed, integrating Transformer with two-dimensional positional embeddings to extract discrete spatial information. Following the fusion of multi-view features, we utilize LSTM to capture dynamic time-frequency-space relationships. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on three public datasets, DEAP, SEED, and SEED-IV. On the DEAP dataset, the average accuracies of valence and arousal are 78.64% and 77.42%, respectively. On the SEED dataset, the average accuracy is 86.91%. On the SEED-IV dataset, the average accuracy is 75.51%. The experimental results show that the proposed MTFSFN model achieves excellent recognition performance.","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"155"},"PeriodicalIF":3.9000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12476352/pdf/","citationCount":"0","resultStr":"{\"title\":\"Mtfsfn: a multi-view time-frequency-space fusion network for EEG-based emotion recognition.\",\"authors\":\"Zhongmin Wang, Shengyang Gao\",\"doi\":\"10.1007/s11571-025-10342-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the recent years, emotion recognition based on electroencephalogram (EEG) has emerged as a prominent research area. Nevertheless, EEG signals present spatially discrete and non-stationary characteristics, to represent spatiotemporal information and extract more discriminative features from complex signals is still a challenge. This study proposed a multi-view time-frequency-space fusion network, referred to as MTFSFN. To effectively utilize complementary information from different frequency bands, we employ a frequency-domain attention mechanism to allocate weights to features of different frequency bands. A multi-view Transformer model was designed, integrating Transformer with two-dimensional positional embeddings to extract discrete spatial information. Following the fusion of multi-view features, we utilize LSTM to capture dynamic time-frequency-space relationships. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on three public datasets, DEAP, SEED, and SEED-IV. On the DEAP dataset, the average accuracies of valence and arousal are 78.64% and 77.42%, respectively. On the SEED dataset, the average accuracy is 86.91%. On the SEED-IV dataset, the average accuracy is 75.51%. The experimental results show that the proposed MTFSFN model achieves excellent recognition performance.\",\"PeriodicalId\":10500,\"journal\":{\"name\":\"Cognitive Neurodynamics\",\"volume\":\"19 1\",\"pages\":\"155\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12476352/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Neurodynamics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11571-025-10342-x\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/9/27 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Neurodynamics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11571-025-10342-x","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/27 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

近年来，基于脑电图（EEG）的情绪识别已成为一个突出的研究领域。然而，脑电图信号具有空间离散和非平稳的特点，如何从复杂信号中表达时空信息并提取更多的判别特征仍然是一个挑战。本研究提出了一种多视点时频空融合网络，简称MTFSFN。为了有效利用不同频带的互补信息，我们采用频域注意机制对不同频带的特征分配权重。设计了一个多视图Transformer模型，将Transformer与二维位置嵌入相结合，提取离散空间信息。在融合多视图特征后，利用LSTM捕获动态时频空间关系。最后，采用主题独立的“留一个主题”交叉验证策略，在三个公共数据集（DEAP、SEED和SEED- iv）上进行了广泛的验证。在DEAP数据集上，效价和唤醒的平均准确率分别为78.64%和77.42%。在SEED数据集上，平均准确率为86.91%。在SEED-IV数据集上，平均准确率为75.51%。实验结果表明，所提出的MTFSFN模型具有良好的识别性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Mtfsfn: a multi-view time-frequency-space fusion network for EEG-based emotion recognition.

Over the recent years, emotion recognition based on electroencephalogram (EEG) has emerged as a prominent research area. Nevertheless, EEG signals present spatially discrete and non-stationary characteristics, to represent spatiotemporal information and extract more discriminative features from complex signals is still a challenge. This study proposed a multi-view time-frequency-space fusion network, referred to as MTFSFN. To effectively utilize complementary information from different frequency bands, we employ a frequency-domain attention mechanism to allocate weights to features of different frequency bands. A multi-view Transformer model was designed, integrating Transformer with two-dimensional positional embeddings to extract discrete spatial information. Following the fusion of multi-view features, we utilize LSTM to capture dynamic time-frequency-space relationships. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on three public datasets, DEAP, SEED, and SEED-IV. On the DEAP dataset, the average accuracies of valence and arousal are 78.64% and 77.42%, respectively. On the SEED dataset, the average accuracy is 86.91%. On the SEED-IV dataset, the average accuracy is 75.51%. The experimental results show that the proposed MTFSFN model achieves excellent recognition performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Cognitive Neurodynamics 医学-神经科学

CiteScore

6.90

自引率

18.90%

发文量

140

审稿时长

12 months

期刊介绍： Cognitive Neurodynamics provides a unique forum of communication and cooperation for scientists and engineers working in the field of cognitive neurodynamics, intelligent science and applications, bridging the gap between theory and application, without any preference for pure theoretical, experimental or computational models. The emphasis is to publish original models of cognitive neurodynamics, novel computational theories and experimental results. In particular, intelligent science inspired by cognitive neuroscience and neurodynamics is also very welcome. The scope of Cognitive Neurodynamics covers cognitive neuroscience, neural computation based on dynamics, computer science, intelligent science as well as their interdisciplinary applications in the natural and engineering sciences. Papers that are appropriate for non-specialist readers are encouraged. 1. There is no page limit for manuscripts submitted to Cognitive Neurodynamics. Research papers should clearly represent an important advance of especially broad interest to researchers and technologists in neuroscience, biophysics, BCI, neural computer and intelligent robotics. 2. Cognitive Neurodynamics also welcomes brief communications: short papers reporting results that are of genuinely broad interest but that for one reason and another do not make a sufficiently complete story to justify a full article publication. Brief Communications should consist of approximately four manuscript pages. 3. Cognitive Neurodynamics publishes review articles in which a specific field is reviewed through an exhaustive literature survey. There are no restrictions on the number of pages. Review articles are usually invited, but submitted reviews will also be considered.