Identification of Risk Group for Root Caries and Analysis of Associated Factors in Older Adults Using Unsupervised Machine Learning Clustering.

IF 3.5 3区 医学 Q2 GERIATRICS & GERONTOLOGY
Clinical Interventions in Aging Pub Date : 2025-04-24 eCollection Date: 2025-01-01 DOI:10.2147/CIA.S520229
Linxin Jiang, Shaohong Huang, Daniel R Reissmann, Gerhard Schmalz, Jianbo Li
{"title":"Identification of Risk Group for Root Caries and Analysis of Associated Factors in Older Adults Using Unsupervised Machine Learning Clustering.","authors":"Linxin Jiang, Shaohong Huang, Daniel R Reissmann, Gerhard Schmalz, Jianbo Li","doi":"10.2147/CIA.S520229","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to identify the high-risk group for root caries using unsupervised machine learning and to explore the associated factors.</p><p><strong>Patients and methods: </strong>This cross-sectional study included 423 adults aged 65 to 74 years, surveyed in 2021. Clusters representing root caries risk were identified using k-prototypes clustering, with the optimal number of clusters determined by the maximum silhouette index. The confusion matrix and alluvial diagram were used to visualize the predictive accuracy and composition of the clustering results. Binary logistic regression models further analyzed the associated factors, while ROC (receiver operating characteristic) curves and the random forest model visualized the predictive performance and the most important associated factors.</p><p><strong>Results: </strong>Two clusters were identified: cluster 1, with low root caries risk (21.5% with and 78.5% without root caries), and cluster 2, with high root caries risk (83.7% with and 16.3% without root caries). The clustering results predicted root caries with an accuracy of 0.81, sensitivity of 0.79, and specificity of 0.83. Overlapping results from binary logistic regression and the random forest model indicated that older age, more periodontal pockets, more attachment loss, female, a history of systemic diseases, presence of xerostomia, and presence of unrestored tooth loss were positively associated with cluster 2. Brushing tooth ≥2 times per day and a high level of oral health knowledge were negatively associated with cluster 2. The ROC curve for the binary logistic regression model showed an AUC (area under the curve) of 0.84.</p><p><strong>Conclusion: </strong>Individuals who are older, female, with poorer oral and systemic health status, suboptimal oral hygiene behaviors, and lower oral health knowledge levels are more likely to be identified as high-risk group. The identified factors, revealed through unsupervised machine learning, can facilitate personalized prevention and management strategies for root caries in older adults.</p>","PeriodicalId":48841,"journal":{"name":"Clinical Interventions in Aging","volume":"20 ","pages":"483-493"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12036594/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Interventions in Aging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/CIA.S520229","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: This study aimed to identify the high-risk group for root caries using unsupervised machine learning and to explore the associated factors.

Patients and methods: This cross-sectional study included 423 adults aged 65 to 74 years, surveyed in 2021. Clusters representing root caries risk were identified using k-prototypes clustering, with the optimal number of clusters determined by the maximum silhouette index. The confusion matrix and alluvial diagram were used to visualize the predictive accuracy and composition of the clustering results. Binary logistic regression models further analyzed the associated factors, while ROC (receiver operating characteristic) curves and the random forest model visualized the predictive performance and the most important associated factors.

Results: Two clusters were identified: cluster 1, with low root caries risk (21.5% with and 78.5% without root caries), and cluster 2, with high root caries risk (83.7% with and 16.3% without root caries). The clustering results predicted root caries with an accuracy of 0.81, sensitivity of 0.79, and specificity of 0.83. Overlapping results from binary logistic regression and the random forest model indicated that older age, more periodontal pockets, more attachment loss, female, a history of systemic diseases, presence of xerostomia, and presence of unrestored tooth loss were positively associated with cluster 2. Brushing tooth ≥2 times per day and a high level of oral health knowledge were negatively associated with cluster 2. The ROC curve for the binary logistic regression model showed an AUC (area under the curve) of 0.84.

Conclusion: Individuals who are older, female, with poorer oral and systemic health status, suboptimal oral hygiene behaviors, and lower oral health knowledge levels are more likely to be identified as high-risk group. The identified factors, revealed through unsupervised machine learning, can facilitate personalized prevention and management strategies for root caries in older adults.

使用无监督机器学习聚类识别老年人牙根龋风险群体及相关因素分析。
目的:本研究旨在利用无监督机器学习识别牙根龋的高危人群,并探讨相关因素。患者和方法:这项横断面研究包括423名65至74岁的成年人,于2021年接受调查。采用k-原型聚类方法确定代表根龋风险的聚类,最佳聚类数由最大剪影指数确定。使用混淆矩阵和冲积图来可视化聚类结果的预测精度和组成。二元logistic回归模型进一步分析相关因素,ROC (receiver operating characteristic)曲线和随机森林模型可视化预测性能和最重要的相关因素。结果:共鉴定出两组龋病患者:第1组龋病风险低(21.5%有龋,78.5%无龋),第2组龋病风险高(83.7%有龋,16.3%无龋)。聚类结果预测牙根龋的准确性为0.81,敏感性为0.79,特异性为0.83。二元logistic回归和随机森林模型的重叠结果表明,年龄较大、牙周袋较多、附着物丢失较多、女性、系统性疾病史、存在口干症和存在未修复的牙齿脱落与聚类2呈正相关。每天刷牙≥2次和口腔健康知识水平高与聚类2呈负相关。二元logistic回归模型的ROC曲线显示AUC(曲线下面积)为0.84。结论:年龄较大、女性、口腔及全身健康状况较差、口腔卫生行为不佳、口腔卫生知识水平较低的人群更容易被确定为高危人群。通过无监督机器学习揭示确定的因素,可以促进老年人牙根龋的个性化预防和管理策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Clinical Interventions in Aging
Clinical Interventions in Aging GERIATRICS & GERONTOLOGY-
CiteScore
6.80
自引率
2.80%
发文量
193
审稿时长
6-12 weeks
期刊介绍: Clinical Interventions in Aging, is an online, peer reviewed, open access journal focusing on concise rapid reporting of original research and reviews in aging. Special attention will be given to papers reporting on actual or potential clinical applications leading to improved prevention or treatment of disease or a greater understanding of pathological processes that result from maladaptive changes in the body associated with aging. This journal is directed at a wide array of scientists, engineers, pharmacists, pharmacologists and clinical specialists wishing to maintain an up to date knowledge of this exciting and emerging field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信