mst -eegnet:基于初始和时间金字塔池的多尺度时空特征提取用于运动图像分类。

IF 3.9 3区 工程技术 Q2 NEUROSCIENCES
Cognitive Neurodynamics Pub Date : 2025-12-01 Epub Date: 2025-09-20 DOI:10.1007/s11571-025-10337-8
Rashmi Mishra, R K Agrawal, Jyoti Singh Kirar
{"title":"mst -eegnet:基于初始和时间金字塔池的多尺度时空特征提取用于运动图像分类。","authors":"Rashmi Mishra, R K Agrawal, Jyoti Singh Kirar","doi":"10.1007/s11571-025-10337-8","DOIUrl":null,"url":null,"abstract":"<p><p>Motor imagery classification is an essential component of Brain-computer interface systems to interpret and recognize brain signals generated during the visualization of motor imagery tasks by a subject. The objective of this work is to develop a novel DL model to extract discriminative features for better generalization performance to recognize motor imagery tasks. This paper presents a novel Multi-scale spatio-temporal network (MSST-EEGNet) to extract discriminative temporal, spectral, and spatial features for motor imagery task classification. The proposed MSST-EEGNet model includes three modules namely the inception module with dilated convolution, the temporal pyramid pooling module, and the classification module. Multi-scale temporal features along with spatial features are extracted using the inception block with the dilated convolution module. A set of multi-level fine-grained and coarse-grained features are extracted using a temporal pyramid pooling module. Further, categorical cross-entropy in combination with center loss is used as a loss function. Experiments are carried out on three benchmark datasets including the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset. The evaluation results shows that the proposed MSST-EEGNet model outperforms eight existing DL models in terms of classification accuracy for subject-specific and cross-session settings. It also outperforms eight existing DL models and six existing transfer-learning models for cross-subject setting. For the subject-specific classification the proposed MSST-EEGNet model achieved an accuracy of 0.8426 ± 0.1061, 0.7779 ± 0.0938, and 0.7365 ± 0.1477 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. For the cross-session setting, the proposed MSST-EEGNet model achieved an accuracy of 0.7709 ± 0.1098, 0.7524 ± 0.1017, and 0.6860 ± 0.0990 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. For the cross-subject setting, the proposed MSST-EEGNet model achieved an accuracy of 0.7288 ± 0.0730, 0.8161 ± 0.963, and 0.7075 ± 0.0746 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. Furthermore, a non-parametric Friedman statistical test demonstrates statistically significant superior performance of the proposed MSST-EEGNet model over the existing models.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"150"},"PeriodicalIF":3.9000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12450197/pdf/","citationCount":"0","resultStr":"{\"title\":\"Msst-eegnet: multi-scale spatio-temporal feature extraction using inception and temporal pyramid pooling for motor imagery classification.\",\"authors\":\"Rashmi Mishra, R K Agrawal, Jyoti Singh Kirar\",\"doi\":\"10.1007/s11571-025-10337-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Motor imagery classification is an essential component of Brain-computer interface systems to interpret and recognize brain signals generated during the visualization of motor imagery tasks by a subject. The objective of this work is to develop a novel DL model to extract discriminative features for better generalization performance to recognize motor imagery tasks. This paper presents a novel Multi-scale spatio-temporal network (MSST-EEGNet) to extract discriminative temporal, spectral, and spatial features for motor imagery task classification. The proposed MSST-EEGNet model includes three modules namely the inception module with dilated convolution, the temporal pyramid pooling module, and the classification module. Multi-scale temporal features along with spatial features are extracted using the inception block with the dilated convolution module. A set of multi-level fine-grained and coarse-grained features are extracted using a temporal pyramid pooling module. Further, categorical cross-entropy in combination with center loss is used as a loss function. Experiments are carried out on three benchmark datasets including the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset. The evaluation results shows that the proposed MSST-EEGNet model outperforms eight existing DL models in terms of classification accuracy for subject-specific and cross-session settings. It also outperforms eight existing DL models and six existing transfer-learning models for cross-subject setting. For the subject-specific classification the proposed MSST-EEGNet model achieved an accuracy of 0.8426 ± 0.1061, 0.7779 ± 0.0938, and 0.7365 ± 0.1477 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. For the cross-session setting, the proposed MSST-EEGNet model achieved an accuracy of 0.7709 ± 0.1098, 0.7524 ± 0.1017, and 0.6860 ± 0.0990 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. For the cross-subject setting, the proposed MSST-EEGNet model achieved an accuracy of 0.7288 ± 0.0730, 0.8161 ± 0.963, and 0.7075 ± 0.0746 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. Furthermore, a non-parametric Friedman statistical test demonstrates statistically significant superior performance of the proposed MSST-EEGNet model over the existing models.</p>\",\"PeriodicalId\":10500,\"journal\":{\"name\":\"Cognitive Neurodynamics\",\"volume\":\"19 1\",\"pages\":\"150\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12450197/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Neurodynamics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11571-025-10337-8\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/9/20 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Neurodynamics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11571-025-10337-8","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/20 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

运动意象分类是脑机接口系统解释和识别被试在运动意象任务可视化过程中产生的脑信号的重要组成部分。本工作的目的是开发一种新的深度学习模型来提取判别特征,以获得更好的泛化性能来识别运动图像任务。本文提出了一种新的多尺度时空网络(mst - eegnet),用于提取具有区别性的时间、光谱和空间特征,用于运动图像任务分类。提出的mst - eegnet模型包括三个模块,即扩展卷积初始模块、时间金字塔池化模块和分类模块。利用扩展卷积模块提取多尺度时间特征和空间特征。使用时间金字塔池模块提取一组多级细粒度和粗粒度特征。进一步,将分类交叉熵与中心损失相结合作为损失函数。在BCI Competition IV-2a数据集、BCI Competition IV-2b数据集和OpenBMI数据集三个基准数据集上进行了实验。评估结果表明,所提出的mst - eegnet模型在特定主题和跨会话设置的分类精度方面优于现有的8个DL模型。它也优于现有的八个深度学习模型和六个现有的跨学科迁移学习模型。在主题分类方面,mst - eegnet模型在BCI Competition IV-2a数据集、BCI Competition IV-2b数据集和OpenBMI数据集上的准确率分别为0.8426±0.1061、0.7779±0.0938和0.7365±0.1477。对于跨会话设置,所提出的mst - eegnet模型在BCI Competition IV-2a数据集、BCI Competition IV-2b数据集和OpenBMI数据集上的准确率分别为0.7709±0.1098、0.7524±0.1017和0.6860±0.0990。对于跨主题设置,本文提出的mst - eegnet模型在BCI Competition IV-2a数据集、BCI Competition IV-2b数据集和OpenBMI数据集上的准确率分别为0.7288±0.0730、0.8161±0.963和0.7075±0.0746。此外,非参数Friedman统计检验表明,所提出的mst - eegnet模型在统计上优于现有模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Msst-eegnet: multi-scale spatio-temporal feature extraction using inception and temporal pyramid pooling for motor imagery classification.

Motor imagery classification is an essential component of Brain-computer interface systems to interpret and recognize brain signals generated during the visualization of motor imagery tasks by a subject. The objective of this work is to develop a novel DL model to extract discriminative features for better generalization performance to recognize motor imagery tasks. This paper presents a novel Multi-scale spatio-temporal network (MSST-EEGNet) to extract discriminative temporal, spectral, and spatial features for motor imagery task classification. The proposed MSST-EEGNet model includes three modules namely the inception module with dilated convolution, the temporal pyramid pooling module, and the classification module. Multi-scale temporal features along with spatial features are extracted using the inception block with the dilated convolution module. A set of multi-level fine-grained and coarse-grained features are extracted using a temporal pyramid pooling module. Further, categorical cross-entropy in combination with center loss is used as a loss function. Experiments are carried out on three benchmark datasets including the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset. The evaluation results shows that the proposed MSST-EEGNet model outperforms eight existing DL models in terms of classification accuracy for subject-specific and cross-session settings. It also outperforms eight existing DL models and six existing transfer-learning models for cross-subject setting. For the subject-specific classification the proposed MSST-EEGNet model achieved an accuracy of 0.8426 ± 0.1061, 0.7779 ± 0.0938, and 0.7365 ± 0.1477 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. For the cross-session setting, the proposed MSST-EEGNet model achieved an accuracy of 0.7709 ± 0.1098, 0.7524 ± 0.1017, and 0.6860 ± 0.0990 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. For the cross-subject setting, the proposed MSST-EEGNet model achieved an accuracy of 0.7288 ± 0.0730, 0.8161 ± 0.963, and 0.7075 ± 0.0746 on the BCI Competition IV-2a dataset, the BCI Competition IV-2b dataset, and the OpenBMI dataset respectively. Furthermore, a non-parametric Friedman statistical test demonstrates statistically significant superior performance of the proposed MSST-EEGNet model over the existing models.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Cognitive Neurodynamics
Cognitive Neurodynamics 医学-神经科学
CiteScore
6.90
自引率
18.90%
发文量
140
审稿时长
12 months
期刊介绍: Cognitive Neurodynamics provides a unique forum of communication and cooperation for scientists and engineers working in the field of cognitive neurodynamics, intelligent science and applications, bridging the gap between theory and application, without any preference for pure theoretical, experimental or computational models. The emphasis is to publish original models of cognitive neurodynamics, novel computational theories and experimental results. In particular, intelligent science inspired by cognitive neuroscience and neurodynamics is also very welcome. The scope of Cognitive Neurodynamics covers cognitive neuroscience, neural computation based on dynamics, computer science, intelligent science as well as their interdisciplinary applications in the natural and engineering sciences. Papers that are appropriate for non-specialist readers are encouraged. 1. There is no page limit for manuscripts submitted to Cognitive Neurodynamics. Research papers should clearly represent an important advance of especially broad interest to researchers and technologists in neuroscience, biophysics, BCI, neural computer and intelligent robotics. 2. Cognitive Neurodynamics also welcomes brief communications: short papers reporting results that are of genuinely broad interest but that for one reason and another do not make a sufficiently complete story to justify a full article publication. Brief Communications should consist of approximately four manuscript pages. 3. Cognitive Neurodynamics publishes review articles in which a specific field is reviewed through an exhaustive literature survey. There are no restrictions on the number of pages. Review articles are usually invited, but submitted reviews will also be considered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信