{"title":"评估婴儿脑电图中伪影检测的机器和深度学习方法:分类器性能、确定性和训练大小效应。","authors":"R Kemmerich, A Wienke, U Frischen, B Mathes","doi":"10.1088/2057-1976/add740","DOIUrl":null,"url":null,"abstract":"<p><p>Electroencephalography (EEG) is essential for studying infant brain activity but is highly susceptible to artifacts due to infants' movements and physiological variability. Manual artifact detection is labor-intensive and subjective, underscoring the need for automated methods. This study evaluates the performance of three machine learning classifiers - Random Forest (RF), Support Vector Machine (SVM), and a deep learning (DL) model - in detecting artifacts in infant EEG data without prior feature extraction. EEG data were collected from 294 infants (mean age 8.34 months) as part of the Bremen Initiative to Foster Early Childhood Development (BRISE). After preprocessing and manual annotation by an expert, a total of 66,851 epochs were analyzed, with 45% labeled as artifacts. The classifiers were trained on filtered EEG data without further feature extraction to directly handle the complex and noisy signals characteristic of infant EEG. Results. indicated that both the RF classifier and the DL model achieved high balanced accuracy scores (.873 and .881, respectively), substantially outperforming the SVM (.756). Further analysis showed that increasing classifier certainty improved accuracy but reduced the amount of data classified, offering a trade-off between precision and data coverage. Additionally, the RF classifier outperformed the DL model with smaller training datasets, while the DL model required larger datasets to achieve optimal performance. These findings demonstrate that RF and DL classifiers can effectively automate artifact detection in infant EEG data, reducing preprocessing time and enhancing consistency across studies. Implementing such automated methods could facilitate the inclusion of EEG in large-scale developmental research and improve reproducibility by standardizing preprocessing pipelines.</p>","PeriodicalId":8896,"journal":{"name":"Biomedical Physics & Engineering Express","volume":" ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating machine- and deep learning approaches for artifact detection in infant EEG: classifier performance, certainty, and training size effects.\",\"authors\":\"R Kemmerich, A Wienke, U Frischen, B Mathes\",\"doi\":\"10.1088/2057-1976/add740\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Electroencephalography (EEG) is essential for studying infant brain activity but is highly susceptible to artifacts due to infants' movements and physiological variability. Manual artifact detection is labor-intensive and subjective, underscoring the need for automated methods. This study evaluates the performance of three machine learning classifiers - Random Forest (RF), Support Vector Machine (SVM), and a deep learning (DL) model - in detecting artifacts in infant EEG data without prior feature extraction. EEG data were collected from 294 infants (mean age 8.34 months) as part of the Bremen Initiative to Foster Early Childhood Development (BRISE). After preprocessing and manual annotation by an expert, a total of 66,851 epochs were analyzed, with 45% labeled as artifacts. The classifiers were trained on filtered EEG data without further feature extraction to directly handle the complex and noisy signals characteristic of infant EEG. Results. indicated that both the RF classifier and the DL model achieved high balanced accuracy scores (.873 and .881, respectively), substantially outperforming the SVM (.756). Further analysis showed that increasing classifier certainty improved accuracy but reduced the amount of data classified, offering a trade-off between precision and data coverage. Additionally, the RF classifier outperformed the DL model with smaller training datasets, while the DL model required larger datasets to achieve optimal performance. These findings demonstrate that RF and DL classifiers can effectively automate artifact detection in infant EEG data, reducing preprocessing time and enhancing consistency across studies. Implementing such automated methods could facilitate the inclusion of EEG in large-scale developmental research and improve reproducibility by standardizing preprocessing pipelines.</p>\",\"PeriodicalId\":8896,\"journal\":{\"name\":\"Biomedical Physics & Engineering Express\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Physics & Engineering Express\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1088/2057-1976/add740\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Physics & Engineering Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2057-1976/add740","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Evaluating machine- and deep learning approaches for artifact detection in infant EEG: classifier performance, certainty, and training size effects.
Electroencephalography (EEG) is essential for studying infant brain activity but is highly susceptible to artifacts due to infants' movements and physiological variability. Manual artifact detection is labor-intensive and subjective, underscoring the need for automated methods. This study evaluates the performance of three machine learning classifiers - Random Forest (RF), Support Vector Machine (SVM), and a deep learning (DL) model - in detecting artifacts in infant EEG data without prior feature extraction. EEG data were collected from 294 infants (mean age 8.34 months) as part of the Bremen Initiative to Foster Early Childhood Development (BRISE). After preprocessing and manual annotation by an expert, a total of 66,851 epochs were analyzed, with 45% labeled as artifacts. The classifiers were trained on filtered EEG data without further feature extraction to directly handle the complex and noisy signals characteristic of infant EEG. Results. indicated that both the RF classifier and the DL model achieved high balanced accuracy scores (.873 and .881, respectively), substantially outperforming the SVM (.756). Further analysis showed that increasing classifier certainty improved accuracy but reduced the amount of data classified, offering a trade-off between precision and data coverage. Additionally, the RF classifier outperformed the DL model with smaller training datasets, while the DL model required larger datasets to achieve optimal performance. These findings demonstrate that RF and DL classifiers can effectively automate artifact detection in infant EEG data, reducing preprocessing time and enhancing consistency across studies. Implementing such automated methods could facilitate the inclusion of EEG in large-scale developmental research and improve reproducibility by standardizing preprocessing pipelines.
期刊介绍:
BPEX is an inclusive, international, multidisciplinary journal devoted to publishing new research on any application of physics and/or engineering in medicine and/or biology. Characterized by a broad geographical coverage and a fast-track peer-review process, relevant topics include all aspects of biophysics, medical physics and biomedical engineering. Papers that are almost entirely clinical or biological in their focus are not suitable. The journal has an emphasis on publishing interdisciplinary work and bringing research fields together, encompassing experimental, theoretical and computational work.