EfficientNetV2_S-AbiLSTM: A novel cross-modal lightweight transfer learning framework for seizure prediction using EEG spectrograms

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biomedical Signal Processing and Control Pub Date : 2025-10-10 DOI:10.1016/j.bspc.2025.108867

Hanbo Zhang , Jincan Zhang , Wenna Chen , Ganqin Du , Qizhi Fu , Hongwei Jiang

{"title":"EfficientNetV2_S-AbiLSTM: A novel cross-modal lightweight transfer learning framework for seizure prediction using EEG spectrograms","authors":"Hanbo Zhang , Jincan Zhang , Wenna Chen , Ganqin Du , Qizhi Fu , Hongwei Jiang","doi":"10.1016/j.bspc.2025.108867","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Epilepsy is a common chronic neurological disorder that demands accurate diagnosis and prediction. EEG is the primary tool for monitoring, yet direct application of deep learning to EEG faces challenges such as limited data, poor signal adaptability, and model complexity. We propose EfficientNetV2_S-AbiLSTM, a lightweight model that leverages cross-modal transfer learning to bridge image classification and EEG processing for enhanced seizure prediction.</div></div><div><h3>Methods</h3><div>EEG signals are converted into spectrograms using the Short-Time Fourier Transform and reduced from 22 to 3 channels via group averaging, mimicking RGB images. EfficientNetV2_S with pre-trained ImageNet weights extracts time-–frequency features that are fed into an attention-enhanced bidirectional LSTM (AbiLSTM) for pattern recognition. A fully connected layer produces the final classification. The model is trained and validated on the CHB-MIT dataset using five-fold cross-validation.</div></div><div><h3>Results</h3><div>Our framework achieves 97.20 % accuracy, 97.26 % precision, 97.05 % recall, and an AUC of 0.9709. Ablation studies confirm that cross-modal transfer learning improves accuracy by 1.99 %, while EfficientNetV2_S outperforms ResNet, ResNeXt, and other EfficientNet variants with reduced training time.</div></div><div><h3>Conclusion</h3><div>Incorporating pre-trained image classification models through cross-modal transfer learning significantly enhances seizure prediction. The EfficientNetV2_S-AbiLSTM model shows promising potential in medical signal processing.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108867"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425013783","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Epilepsy is a common chronic neurological disorder that demands accurate diagnosis and prediction. EEG is the primary tool for monitoring, yet direct application of deep learning to EEG faces challenges such as limited data, poor signal adaptability, and model complexity. We propose EfficientNetV2_S-AbiLSTM, a lightweight model that leverages cross-modal transfer learning to bridge image classification and EEG processing for enhanced seizure prediction.

Methods

EEG signals are converted into spectrograms using the Short-Time Fourier Transform and reduced from 22 to 3 channels via group averaging, mimicking RGB images. EfficientNetV2_S with pre-trained ImageNet weights extracts time-–frequency features that are fed into an attention-enhanced bidirectional LSTM (AbiLSTM) for pattern recognition. A fully connected layer produces the final classification. The model is trained and validated on the CHB-MIT dataset using five-fold cross-validation.

Results

Our framework achieves 97.20 % accuracy, 97.26 % precision, 97.05 % recall, and an AUC of 0.9709. Ablation studies confirm that cross-modal transfer learning improves accuracy by 1.99 %, while EfficientNetV2_S outperforms ResNet, ResNeXt, and other EfficientNet variants with reduced training time.

Conclusion

Incorporating pre-trained image classification models through cross-modal transfer learning significantly enhances seizure prediction. The EfficientNetV2_S-AbiLSTM model shows promising potential in medical signal processing.

查看原文本刊更多论文

高效netv2_s - abilstm：一种新的跨模态轻量级迁移学习框架，用于使用脑电图图预测癫痫发作

癫痫是一种常见的慢性神经系统疾病，需要准确的诊断和预测。脑电图是监测的主要工具，但将深度学习直接应用于脑电图面临着数据有限、信号适应性差、模型复杂性等挑战。我们提出了高效netv2_s - abilstm，这是一个轻量级模型，利用跨模态迁移学习来桥接图像分类和脑电图处理，以增强癫痫发作预测。方法采用短时傅里叶变换将seeg信号转换成频谱图，并通过组平均将22通道减少到3通道，模拟RGB图像。effentnetv2_s使用预训练的ImageNet权重提取时间-频率特征，并将其输入到注意力增强的双向LSTM （AbiLSTM）中进行模式识别。一个完全连接的层产生最终的分类。该模型在CHB-MIT数据集上使用五倍交叉验证进行训练和验证。结果框架的准确率为97.20%，精密度为97.26%，召回率为97.05%，AUC为0.9709。研究证实，跨模态迁移学习的准确率提高了1.99%，而效率网络v2_s在减少训练时间的情况下优于ResNet、ResNeXt和其他效率网络变体。结论通过跨模态迁移学习结合预训练的图像分类模型可显著提高癫痫发作的预测能力。高效率netv2_s - abilstm模型在医疗信号处理中显示出良好的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.