Hanbo Zhang , Jincan Zhang , Wenna Chen , Ganqin Du , Qizhi Fu , Hongwei Jiang
{"title":"EfficientNetV2_S-AbiLSTM: A novel cross-modal lightweight transfer learning framework for seizure prediction using EEG spectrograms","authors":"Hanbo Zhang , Jincan Zhang , Wenna Chen , Ganqin Du , Qizhi Fu , Hongwei Jiang","doi":"10.1016/j.bspc.2025.108867","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Epilepsy is a common chronic neurological disorder that demands accurate diagnosis and prediction. EEG is the primary tool for monitoring, yet direct application of deep learning to EEG faces challenges such as limited data, poor signal adaptability, and model complexity. We propose EfficientNetV2_S-AbiLSTM, a lightweight model that leverages cross-modal transfer learning to bridge image classification and EEG processing for enhanced seizure prediction.</div></div><div><h3>Methods</h3><div>EEG signals are converted into spectrograms using the Short-Time Fourier Transform and reduced from 22 to 3 channels via group averaging, mimicking RGB images. EfficientNetV2_S with pre-trained ImageNet weights extracts time-–frequency features that are fed into an attention-enhanced bidirectional LSTM (AbiLSTM) for pattern recognition. A fully connected layer produces the final classification. The model is trained and validated on the CHB-MIT dataset using five-fold cross-validation.</div></div><div><h3>Results</h3><div>Our framework achieves 97.20 % accuracy, 97.26 % precision, 97.05 % recall, and an AUC of 0.9709. Ablation studies confirm that cross-modal transfer learning improves accuracy by 1.99 %, while EfficientNetV2_S outperforms ResNet, ResNeXt, and other EfficientNet variants with reduced training time.</div></div><div><h3>Conclusion</h3><div>Incorporating pre-trained image classification models through cross-modal transfer learning significantly enhances seizure prediction. The EfficientNetV2_S-AbiLSTM model shows promising potential in medical signal processing.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108867"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425013783","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Epilepsy is a common chronic neurological disorder that demands accurate diagnosis and prediction. EEG is the primary tool for monitoring, yet direct application of deep learning to EEG faces challenges such as limited data, poor signal adaptability, and model complexity. We propose EfficientNetV2_S-AbiLSTM, a lightweight model that leverages cross-modal transfer learning to bridge image classification and EEG processing for enhanced seizure prediction.
Methods
EEG signals are converted into spectrograms using the Short-Time Fourier Transform and reduced from 22 to 3 channels via group averaging, mimicking RGB images. EfficientNetV2_S with pre-trained ImageNet weights extracts time-–frequency features that are fed into an attention-enhanced bidirectional LSTM (AbiLSTM) for pattern recognition. A fully connected layer produces the final classification. The model is trained and validated on the CHB-MIT dataset using five-fold cross-validation.
Results
Our framework achieves 97.20 % accuracy, 97.26 % precision, 97.05 % recall, and an AUC of 0.9709. Ablation studies confirm that cross-modal transfer learning improves accuracy by 1.99 %, while EfficientNetV2_S outperforms ResNet, ResNeXt, and other EfficientNet variants with reduced training time.
Conclusion
Incorporating pre-trained image classification models through cross-modal transfer learning significantly enhances seizure prediction. The EfficientNetV2_S-AbiLSTM model shows promising potential in medical signal processing.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.