Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

IF 1.6 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Biomedical Physics & Engineering Express Pub Date : 2025-09-26 DOI:10.1088/2057-1976/ae04ee

Meenakshi Bisla, Radhey Shyam Anand

{"title":"Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.","authors":"Meenakshi Bisla, Radhey Shyam Anand","doi":"10.1088/2057-1976/ae04ee","DOIUrl":null,"url":null,"abstract":"<p><p>Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments. This work uses an EEG dataset recorded from a 64-channel device during the imagination of long words, short words, and vowels with 15 human subjects. First, the raw EEG data is filtered between 1 Hz and 100 Hz, then segmented using a sliding window-based data augmentation technique with a window size of 100 and 50% overlap. The Fourier Transform is applied to each windowed segment to compute the amplitude and phase spectrum of the signal at each frequency point. The next step is to extract 50 statistical features from the amplitude and phase spectrum of frequency domain segments. Out of these, the 25 most statistically significant features are selected by applying the Kruskal-Walli's test. The extracted feature vectors are classified using six different machine learning based classifiers named support vector machine (SVM), K nearest neighbor (KNN), Random Forest (RF), XGBoost, LightGBM, and CatBoost. The CatBoost classifier outperforms other machine learning classifiers by achieving the highest accuracy of 91.72 ± 1.52% for long words classification, 91.68 ± 1.54% for long versus short word classification, 88.05 ± 3.07% for short word classification, and 88.89 ± 1.97% for vowel classification. The performance of the proposed model is assessed using five performance evaluation metrics: accuracy, F1-score, precision, recall, and Cohen's kappa. Compared to the existing literature, this study has achieved a 5%-7% improvement with the CatBoost classifier and extracted feature matrix.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Physics & Engineering Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2057-1976/ae04ee","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments. This work uses an EEG dataset recorded from a 64-channel device during the imagination of long words, short words, and vowels with 15 human subjects. First, the raw EEG data is filtered between 1 Hz and 100 Hz, then segmented using a sliding window-based data augmentation technique with a window size of 100 and 50% overlap. The Fourier Transform is applied to each windowed segment to compute the amplitude and phase spectrum of the signal at each frequency point. The next step is to extract 50 statistical features from the amplitude and phase spectrum of frequency domain segments. Out of these, the 25 most statistically significant features are selected by applying the Kruskal-Walli's test. The extracted feature vectors are classified using six different machine learning based classifiers named support vector machine (SVM), K nearest neighbor (KNN), Random Forest (RF), XGBoost, LightGBM, and CatBoost. The CatBoost classifier outperforms other machine learning classifiers by achieving the highest accuracy of 91.72 ± 1.52% for long words classification, 91.68 ± 1.54% for long versus short word classification, 88.05 ± 3.07% for short word classification, and 88.89 ± 1.97% for vowel classification. The performance of the proposed model is assessed using five performance evaluation metrics: accuracy, F1-score, precision, recall, and Cohen's kappa. Compared to the existing literature, this study has achieved a 5%-7% improvement with the CatBoost classifier and extracted feature matrix.

查看原文本刊更多论文

基于机器学习的基于频域脑电信号振幅和相位谱的想象语音脑电图数据分类。

想象语音分类包括解码大脑信号，在没有实际语音产生的情况下识别语言化的想法或意图。这项技术对有语言障碍的人来说意义重大，它提供了一种通过神经信号进行交流的方法。这项工作的主要目标是提出一种创新的基于机器学习（ML）的分类方法，该方法将使用滑动窗口技术的脑电图（EEG）数据增强与从频域EEG片段的幅度和相位谱中提取统计特征相结合。这项工作使用了一个来自64通道设备的脑电图数据集，该数据集记录了15名人类受试者对长词、短词和元音的想象。首先，对原始EEG数据在1 Hz到100 Hz之间进行滤波，然后使用基于滑动窗口的数据增强技术进行分割，窗口大小为100和50%重叠。对每个加窗段进行傅里叶变换，计算信号在每个频率点的幅值和相位谱。下一步从频域段的幅相谱中提取50个统计特征。其中，25个最具统计意义的特征是通过应用Kruskal-Wallis测试选择的。提取的特征向量使用六种不同的基于机器学习的分类器进行分类，分别是支持向量机（SVM）、K近邻（KNN）、随机森林（RF）、XGBoost、LightGBM和CatBoost。CatBoost分类器优于其他机器学习分类器，在长词分类方面达到了最高的准确率：91.72±1.52%，长词与短词分类方面达到了91.68±1.54%，短词分类方面达到了88.05±3.07%，元音分类方面达到了88.89±1.97%。使用五个性能评估指标评估所提出的模型的性能：准确性，f1分数，精度，召回率和科恩的kappa。与现有文献相比，本研究使用CatBoost分类器和提取的特征矩阵实现了5%- 7%的改进。。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biomedical Physics & Engineering Express RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

2.80

自引率

0.00%

发文量

153

期刊介绍： BPEX is an inclusive, international, multidisciplinary journal devoted to publishing new research on any application of physics and/or engineering in medicine and/or biology. Characterized by a broad geographical coverage and a fast-track peer-review process, relevant topics include all aspects of biophysics, medical physics and biomedical engineering. Papers that are almost entirely clinical or biological in their focus are not suitable. The journal has an emphasis on publishing interdisciplinary work and bringing research fields together, encompassing experimental, theoretical and computational work.