Md Raihan Khan, Airin Akter Tania, Mohiuddin Ahmad
{"title":"A comparative study of time–frequency features based spatio-temporal analysis with varying multiscale kernels for emotion recognition from EEG","authors":"Md Raihan Khan, Airin Akter Tania, Mohiuddin Ahmad","doi":"10.1016/j.bspc.2025.107826","DOIUrl":null,"url":null,"abstract":"<div><div>EEG-based emotion recognition has become recognized as a crucial field of study, utilizing brainwave patterns for understanding human emotional states. To identify emotions from EEG data, this study examines several time–frequency features utilizing spatiotemporal analysis using the DEAP dataset. Down sampling, bandpass filtering, segmentation, trimming, labeling, and common reference averaging are all part of the data pre-processing pipeline. Continuous Wavelet Transform (CWT) with Morlet wavelets was used to extract features, followed by the computation of differential entropy, wavelet energy, cross-correlation, and phase locking value (PLV). The distribution of features, participant-specific changes, and associations were examined through an exploratory feature analysis. Subsequently, final representations functioning in a spatiotemporal manner were constructed. For classification, a 3D Convolutional Neural Network with different kernel sizes (3×3×3, 5×5×5, and 7×7×7) was employed. Training accuracies reached up to 98.86% for arousal and 98.97% for valence, demonstrating robust generalization of particular feature-kernel combinations. The results also show that the 5×5×5 kernel size achieved the highest test accuracy for arousal (96.13% for differential entropy) and valence (96.19% for wavelet energy). In order to show the model’s universality, it was also validated using the popular SEED dataset. With PLV (kernel 7×7×7), the arousal validation accuracy was 97.42%, and with WE (kernel 3×3×3), the valence validation accuracy was 97.50%. This work sheds light on the choice of features and kernels for emotion identification models based on EEG.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"107 ","pages":"Article 107826"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425003374","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
EEG-based emotion recognition has become recognized as a crucial field of study, utilizing brainwave patterns for understanding human emotional states. To identify emotions from EEG data, this study examines several time–frequency features utilizing spatiotemporal analysis using the DEAP dataset. Down sampling, bandpass filtering, segmentation, trimming, labeling, and common reference averaging are all part of the data pre-processing pipeline. Continuous Wavelet Transform (CWT) with Morlet wavelets was used to extract features, followed by the computation of differential entropy, wavelet energy, cross-correlation, and phase locking value (PLV). The distribution of features, participant-specific changes, and associations were examined through an exploratory feature analysis. Subsequently, final representations functioning in a spatiotemporal manner were constructed. For classification, a 3D Convolutional Neural Network with different kernel sizes (3×3×3, 5×5×5, and 7×7×7) was employed. Training accuracies reached up to 98.86% for arousal and 98.97% for valence, demonstrating robust generalization of particular feature-kernel combinations. The results also show that the 5×5×5 kernel size achieved the highest test accuracy for arousal (96.13% for differential entropy) and valence (96.19% for wavelet energy). In order to show the model’s universality, it was also validated using the popular SEED dataset. With PLV (kernel 7×7×7), the arousal validation accuracy was 97.42%, and with WE (kernel 3×3×3), the valence validation accuracy was 97.50%. This work sheds light on the choice of features and kernels for emotion identification models based on EEG.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.