A comparative study of time–frequency features based spatio-temporal analysis with varying multiscale kernels for emotion recognition from EEG

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Biomedical Signal Processing and Control Pub Date : 2025-03-25 DOI:10.1016/j.bspc.2025.107826

Md Raihan Khan, Airin Akter Tania, Mohiuddin Ahmad

{"title":"A comparative study of time–frequency features based spatio-temporal analysis with varying multiscale kernels for emotion recognition from EEG","authors":"Md Raihan Khan, Airin Akter Tania, Mohiuddin Ahmad","doi":"10.1016/j.bspc.2025.107826","DOIUrl":null,"url":null,"abstract":"<div><div>EEG-based emotion recognition has become recognized as a crucial field of study, utilizing brainwave patterns for understanding human emotional states. To identify emotions from EEG data, this study examines several time–frequency features utilizing spatiotemporal analysis using the DEAP dataset. Down sampling, bandpass filtering, segmentation, trimming, labeling, and common reference averaging are all part of the data pre-processing pipeline. Continuous Wavelet Transform (CWT) with Morlet wavelets was used to extract features, followed by the computation of differential entropy, wavelet energy, cross-correlation, and phase locking value (PLV). The distribution of features, participant-specific changes, and associations were examined through an exploratory feature analysis. Subsequently, final representations functioning in a spatiotemporal manner were constructed. For classification, a 3D Convolutional Neural Network with different kernel sizes (3×3×3, 5×5×5, and 7×7×7) was employed. Training accuracies reached up to 98.86% for arousal and 98.97% for valence, demonstrating robust generalization of particular feature-kernel combinations. The results also show that the 5×5×5 kernel size achieved the highest test accuracy for arousal (96.13% for differential entropy) and valence (96.19% for wavelet energy). In order to show the model’s universality, it was also validated using the popular SEED dataset. With PLV (kernel 7×7×7), the arousal validation accuracy was 97.42%, and with WE (kernel 3×3×3), the valence validation accuracy was 97.50%. This work sheds light on the choice of features and kernels for emotion identification models based on EEG.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"107 ","pages":"Article 107826"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425003374","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

EEG-based emotion recognition has become recognized as a crucial field of study, utilizing brainwave patterns for understanding human emotional states. To identify emotions from EEG data, this study examines several time–frequency features utilizing spatiotemporal analysis using the DEAP dataset. Down sampling, bandpass filtering, segmentation, trimming, labeling, and common reference averaging are all part of the data pre-processing pipeline. Continuous Wavelet Transform (CWT) with Morlet wavelets was used to extract features, followed by the computation of differential entropy, wavelet energy, cross-correlation, and phase locking value (PLV). The distribution of features, participant-specific changes, and associations were examined through an exploratory feature analysis. Subsequently, final representations functioning in a spatiotemporal manner were constructed. For classification, a 3D Convolutional Neural Network with different kernel sizes (3×3×3, 5×5×5, and 7×7×7) was employed. Training accuracies reached up to 98.86% for arousal and 98.97% for valence, demonstrating robust generalization of particular feature-kernel combinations. The results also show that the 5×5×5 kernel size achieved the highest test accuracy for arousal (96.13% for differential entropy) and valence (96.19% for wavelet energy). In order to show the model’s universality, it was also validated using the popular SEED dataset. With PLV (kernel 7×7×7), the arousal validation accuracy was 97.42%, and with WE (kernel 3×3×3), the valence validation accuracy was 97.50%. This work sheds light on the choice of features and kernels for emotion identification models based on EEG.

Abstract Image

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Biomedical Signal Processing and Control 工程技术-工程：生物医学

CiteScore

9.80

自引率

13.70%

发文量

822

审稿时长

4 months

期刊介绍： Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.