Luís Carlos Afonso, João Rafael Almeida, José Luís Oliveira
{"title":"Combining Statistical and Deep Learning Models for Insomnia Detection.","authors":"Luís Carlos Afonso, João Rafael Almeida, José Luís Oliveira","doi":"10.3233/SHTI251525","DOIUrl":null,"url":null,"abstract":"<p><p>Insomnia is a common but often underdiagnosed condition in clinical settings, where relevant information is typically buried in unstructured free-text notes. Automated tools that can identify both the presence of insomnia and the supporting evidence are essential to improve diagnosis and enable large-scale studies. However, existing models often prioritize accuracy at the cost of interpretability, which is critical for clinical adoption. To address this, we explore a hybrid approach that balances performance with explainability. Our method combines Finite Context Models (FCMs) for character-level classification of insomnia status with a BERT-based token classification model for extracting textual evidence, using structured annotations from the MIMIC-III dataset. This complementary setup enables both accurate prediction and transparent decision-making in clinical text analysis.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"332 ","pages":"195-199"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251525","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Insomnia is a common but often underdiagnosed condition in clinical settings, where relevant information is typically buried in unstructured free-text notes. Automated tools that can identify both the presence of insomnia and the supporting evidence are essential to improve diagnosis and enable large-scale studies. However, existing models often prioritize accuracy at the cost of interpretability, which is critical for clinical adoption. To address this, we explore a hybrid approach that balances performance with explainability. Our method combines Finite Context Models (FCMs) for character-level classification of insomnia status with a BERT-based token classification model for extracting textual evidence, using structured annotations from the MIMIC-III dataset. This complementary setup enables both accurate prediction and transparent decision-making in clinical text analysis.