Multi-scale spatiotemporal representation learning for EEG-based emotion recognition

arXiv - EE - Signal Processing Pub Date : 2024-09-11 DOI:arxiv-2409.07589

Xin Zhou, Xiaojing Peng

{"title":"Multi-scale spatiotemporal representation learning for EEG-based emotion recognition","authors":"Xin Zhou, Xiaojing Peng","doi":"arxiv-2409.07589","DOIUrl":null,"url":null,"abstract":"EEG-based emotion recognition holds significant potential in the field of\nbrain-computer interfaces. A key challenge lies in extracting discriminative\nspatiotemporal features from electroencephalogram (EEG) signals. Existing\nstudies often rely on domain-specific time-frequency features and analyze\ntemporal dependencies and spatial characteristics separately, neglecting the\ninteraction between local-global relationships and spatiotemporal dynamics. To\naddress this, we propose a novel network called Multi-Scale Inverted Mamba\n(MS-iMamba), which consists of Multi-Scale Temporal Blocks (MSTB) and\nTemporal-Spatial Fusion Blocks (TSFB). Specifically, MSTBs are designed to\ncapture both local details and global temporal dependencies across different\nscale subsequences. The TSFBs, implemented with an inverted Mamba structure,\nfocus on the interaction between dynamic temporal dependencies and spatial\ncharacteristics. The primary advantage of MS-iMamba lies in its ability to\nleverage reconstructed multi-scale EEG sequences, exploiting the interaction\nbetween temporal and spatial features without the need for domain-specific\ntime-frequency feature extraction. Experimental results on the DEAP, DREAMER,\nand SEED datasets demonstrate that MS-iMamba achieves classification accuracies\nof 94.86%, 94.94%, and 91.36%, respectively, using only four-channel EEG\nsignals, outperforming state-of-the-art methods.","PeriodicalId":501034,"journal":{"name":"arXiv - EE - Signal Processing","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07589","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

EEG-based emotion recognition holds significant potential in the field of brain-computer interfaces. A key challenge lies in extracting discriminative spatiotemporal features from electroencephalogram (EEG) signals. Existing studies often rely on domain-specific time-frequency features and analyze temporal dependencies and spatial characteristics separately, neglecting the interaction between local-global relationships and spatiotemporal dynamics. To address this, we propose a novel network called Multi-Scale Inverted Mamba (MS-iMamba), which consists of Multi-Scale Temporal Blocks (MSTB) and Temporal-Spatial Fusion Blocks (TSFB). Specifically, MSTBs are designed to capture both local details and global temporal dependencies across different scale subsequences. The TSFBs, implemented with an inverted Mamba structure, focus on the interaction between dynamic temporal dependencies and spatial characteristics. The primary advantage of MS-iMamba lies in its ability to leverage reconstructed multi-scale EEG sequences, exploiting the interaction between temporal and spatial features without the need for domain-specific time-frequency feature extraction. Experimental results on the DEAP, DREAMER, and SEED datasets demonstrate that MS-iMamba achieves classification accuracies of 94.86%, 94.94%, and 91.36%, respectively, using only four-channel EEG signals, outperforming state-of-the-art methods.

查看原文本刊更多论文

基于脑电图的情绪识别多尺度时空表征学习

基于脑电图的情绪识别在脑机接口领域具有巨大潜力。从脑电图（EEG）信号中提取具有区分性的时空特征是一项关键挑战。现有的研究通常依赖于特定领域的时频特征，并分别分析时间依赖性和空间特征，从而忽视了局部-全局关系和时空动态之间的相互作用。为了解决这个问题，我们提出了一种名为多尺度反转曼巴（MS-iMamba）的新型网络，它由多尺度时空块（MSTB）和时空融合块（TSFB）组成。具体来说，MSTB 的设计目的是捕捉不同尺度子序列的局部细节和全局时间依赖性。TSFB 采用倒 Mamba 结构，重点关注动态时间依赖性与空间特征之间的相互作用。MS-iMamba 的主要优势在于它能够利用重建的多尺度脑电图序列，利用时间和空间特征之间的相互作用，而无需进行特定领域的时频特征提取。在 DEAP、DREAMER 和 SEED 数据集上的实验结果表明，仅使用四通道脑电信号，MS-iMamba 的分类准确率就分别达到了 94.86%、94.94% 和 91.36%，超过了最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - EE - Signal Processing

自引率

0.00%

发文量