{"title":"Mtfsfn:一种基于脑电图的多视点时频空间融合网络。","authors":"Zhongmin Wang, Shengyang Gao","doi":"10.1007/s11571-025-10342-x","DOIUrl":null,"url":null,"abstract":"<p><p>Over the recent years, emotion recognition based on electroencephalogram (EEG) has emerged as a prominent research area. Nevertheless, EEG signals present spatially discrete and non-stationary characteristics, to represent spatiotemporal information and extract more discriminative features from complex signals is still a challenge. This study proposed a multi-view time-frequency-space fusion network, referred to as MTFSFN. To effectively utilize complementary information from different frequency bands, we employ a frequency-domain attention mechanism to allocate weights to features of different frequency bands. A multi-view Transformer model was designed, integrating Transformer with two-dimensional positional embeddings to extract discrete spatial information. Following the fusion of multi-view features, we utilize LSTM to capture dynamic time-frequency-space relationships. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on three public datasets, DEAP, SEED, and SEED-IV. On the DEAP dataset, the average accuracies of valence and arousal are 78.64% and 77.42%, respectively. On the SEED dataset, the average accuracy is 86.91%. On the SEED-IV dataset, the average accuracy is 75.51%. The experimental results show that the proposed MTFSFN model achieves excellent recognition performance.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"155"},"PeriodicalIF":3.9000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12476352/pdf/","citationCount":"0","resultStr":"{\"title\":\"Mtfsfn: a multi-view time-frequency-space fusion network for EEG-based emotion recognition.\",\"authors\":\"Zhongmin Wang, Shengyang Gao\",\"doi\":\"10.1007/s11571-025-10342-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Over the recent years, emotion recognition based on electroencephalogram (EEG) has emerged as a prominent research area. Nevertheless, EEG signals present spatially discrete and non-stationary characteristics, to represent spatiotemporal information and extract more discriminative features from complex signals is still a challenge. This study proposed a multi-view time-frequency-space fusion network, referred to as MTFSFN. To effectively utilize complementary information from different frequency bands, we employ a frequency-domain attention mechanism to allocate weights to features of different frequency bands. A multi-view Transformer model was designed, integrating Transformer with two-dimensional positional embeddings to extract discrete spatial information. Following the fusion of multi-view features, we utilize LSTM to capture dynamic time-frequency-space relationships. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on three public datasets, DEAP, SEED, and SEED-IV. On the DEAP dataset, the average accuracies of valence and arousal are 78.64% and 77.42%, respectively. On the SEED dataset, the average accuracy is 86.91%. On the SEED-IV dataset, the average accuracy is 75.51%. The experimental results show that the proposed MTFSFN model achieves excellent recognition performance.</p>\",\"PeriodicalId\":10500,\"journal\":{\"name\":\"Cognitive Neurodynamics\",\"volume\":\"19 1\",\"pages\":\"155\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12476352/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Neurodynamics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11571-025-10342-x\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/9/27 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Neurodynamics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11571-025-10342-x","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/27 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
Mtfsfn: a multi-view time-frequency-space fusion network for EEG-based emotion recognition.
Over the recent years, emotion recognition based on electroencephalogram (EEG) has emerged as a prominent research area. Nevertheless, EEG signals present spatially discrete and non-stationary characteristics, to represent spatiotemporal information and extract more discriminative features from complex signals is still a challenge. This study proposed a multi-view time-frequency-space fusion network, referred to as MTFSFN. To effectively utilize complementary information from different frequency bands, we employ a frequency-domain attention mechanism to allocate weights to features of different frequency bands. A multi-view Transformer model was designed, integrating Transformer with two-dimensional positional embeddings to extract discrete spatial information. Following the fusion of multi-view features, we utilize LSTM to capture dynamic time-frequency-space relationships. Finally, a subject-independent leave-one-subject-out cross-validation strategy was used to validate extensively on three public datasets, DEAP, SEED, and SEED-IV. On the DEAP dataset, the average accuracies of valence and arousal are 78.64% and 77.42%, respectively. On the SEED dataset, the average accuracy is 86.91%. On the SEED-IV dataset, the average accuracy is 75.51%. The experimental results show that the proposed MTFSFN model achieves excellent recognition performance.
期刊介绍:
Cognitive Neurodynamics provides a unique forum of communication and cooperation for scientists and engineers working in the field of cognitive neurodynamics, intelligent science and applications, bridging the gap between theory and application, without any preference for pure theoretical, experimental or computational models.
The emphasis is to publish original models of cognitive neurodynamics, novel computational theories and experimental results. In particular, intelligent science inspired by cognitive neuroscience and neurodynamics is also very welcome.
The scope of Cognitive Neurodynamics covers cognitive neuroscience, neural computation based on dynamics, computer science, intelligent science as well as their interdisciplinary applications in the natural and engineering sciences. Papers that are appropriate for non-specialist readers are encouraged.
1. There is no page limit for manuscripts submitted to Cognitive Neurodynamics. Research papers should clearly represent an important advance of especially broad interest to researchers and technologists in neuroscience, biophysics, BCI, neural computer and intelligent robotics.
2. Cognitive Neurodynamics also welcomes brief communications: short papers reporting results that are of genuinely broad interest but that for one reason and another do not make a sufficiently complete story to justify a full article publication. Brief Communications should consist of approximately four manuscript pages.
3. Cognitive Neurodynamics publishes review articles in which a specific field is reviewed through an exhaustive literature survey. There are no restrictions on the number of pages. Review articles are usually invited, but submitted reviews will also be considered.