Özkan Aslan , Naim Karasekreter , Caner Balım , Süleyman Yaman , Süleyman Şahin
{"title":"基于土耳其脑电图报告的儿科癫痫发作类型分类","authors":"Özkan Aslan , Naim Karasekreter , Caner Balım , Süleyman Yaman , Süleyman Şahin","doi":"10.1016/j.eplepsyres.2025.107593","DOIUrl":null,"url":null,"abstract":"<div><div>This study focuses on the binary classification of pediatric epilepsy seizure types as focal or generalized using Turkish electroencephalography (EEG) reports, leveraging natural language processing (NLP) and machine learning methodologies. A novel dataset comprising 130 Turkish EEG reports was developed and publicly released, addressing the scarcity of resources in this domain. The study employed various text representation models, including TF-IDF, FastText, ElectraTR, XLM, and BERTurk, along with classifiers such as Logistic Regression, Support Vector Machines, and CatBoost. The highest classification performance was achieved using BERTurk embeddings combined with Logistic Regression, yielding an accuracy of 96.6 %. This work is significant for being the first to explore focal versus generalized seizure classification from text-based EEG reports in Turkish. It underscores the critical role of contextual embeddings in handling morphologically rich languages and demonstrates the potential of NLP techniques in advancing pediatric epilepsy diagnostics. The findings pave the way for automating diagnostic processes and improving efficiency in clinical settings. Future research aims to expand the dataset, incorporate EEG signal data, and refine the models for broader applicability.</div></div>","PeriodicalId":11914,"journal":{"name":"Epilepsy Research","volume":"215 ","pages":"Article 107593"},"PeriodicalIF":2.0000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification of epilepsy seizure types in pediatrics based on Turkish EEG reports\",\"authors\":\"Özkan Aslan , Naim Karasekreter , Caner Balım , Süleyman Yaman , Süleyman Şahin\",\"doi\":\"10.1016/j.eplepsyres.2025.107593\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study focuses on the binary classification of pediatric epilepsy seizure types as focal or generalized using Turkish electroencephalography (EEG) reports, leveraging natural language processing (NLP) and machine learning methodologies. A novel dataset comprising 130 Turkish EEG reports was developed and publicly released, addressing the scarcity of resources in this domain. The study employed various text representation models, including TF-IDF, FastText, ElectraTR, XLM, and BERTurk, along with classifiers such as Logistic Regression, Support Vector Machines, and CatBoost. The highest classification performance was achieved using BERTurk embeddings combined with Logistic Regression, yielding an accuracy of 96.6 %. This work is significant for being the first to explore focal versus generalized seizure classification from text-based EEG reports in Turkish. It underscores the critical role of contextual embeddings in handling morphologically rich languages and demonstrates the potential of NLP techniques in advancing pediatric epilepsy diagnostics. The findings pave the way for automating diagnostic processes and improving efficiency in clinical settings. Future research aims to expand the dataset, incorporate EEG signal data, and refine the models for broader applicability.</div></div>\",\"PeriodicalId\":11914,\"journal\":{\"name\":\"Epilepsy Research\",\"volume\":\"215 \",\"pages\":\"Article 107593\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Epilepsy Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0920121125000944\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epilepsy Research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0920121125000944","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
Classification of epilepsy seizure types in pediatrics based on Turkish EEG reports
This study focuses on the binary classification of pediatric epilepsy seizure types as focal or generalized using Turkish electroencephalography (EEG) reports, leveraging natural language processing (NLP) and machine learning methodologies. A novel dataset comprising 130 Turkish EEG reports was developed and publicly released, addressing the scarcity of resources in this domain. The study employed various text representation models, including TF-IDF, FastText, ElectraTR, XLM, and BERTurk, along with classifiers such as Logistic Regression, Support Vector Machines, and CatBoost. The highest classification performance was achieved using BERTurk embeddings combined with Logistic Regression, yielding an accuracy of 96.6 %. This work is significant for being the first to explore focal versus generalized seizure classification from text-based EEG reports in Turkish. It underscores the critical role of contextual embeddings in handling morphologically rich languages and demonstrates the potential of NLP techniques in advancing pediatric epilepsy diagnostics. The findings pave the way for automating diagnostic processes and improving efficiency in clinical settings. Future research aims to expand the dataset, incorporate EEG signal data, and refine the models for broader applicability.
期刊介绍:
Epilepsy Research provides for publication of high quality articles in both basic and clinical epilepsy research, with a special emphasis on translational research that ultimately relates to epilepsy as a human condition. The journal is intended to provide a forum for reporting the best and most rigorous epilepsy research from all disciplines ranging from biophysics and molecular biology to epidemiological and psychosocial research. As such the journal will publish original papers relevant to epilepsy from any scientific discipline and also studies of a multidisciplinary nature. Clinical and experimental research papers adopting fresh conceptual approaches to the study of epilepsy and its treatment are encouraged. The overriding criteria for publication are novelty, significant clinical or experimental relevance, and interest to a multidisciplinary audience in the broad arena of epilepsy. Review articles focused on any topic of epilepsy research will also be considered, but only if they present an exceptionally clear synthesis of current knowledge and future directions of a research area, based on a critical assessment of the available data or on hypotheses that are likely to stimulate more critical thinking and further advances in an area of epilepsy research.