Linxin Jiang, Shaohong Huang, Daniel R Reissmann, Gerhard Schmalz, Jianbo Li
{"title":"Identification of Risk Group for Root Caries and Analysis of Associated Factors in Older Adults Using Unsupervised Machine Learning Clustering.","authors":"Linxin Jiang, Shaohong Huang, Daniel R Reissmann, Gerhard Schmalz, Jianbo Li","doi":"10.2147/CIA.S520229","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to identify the high-risk group for root caries using unsupervised machine learning and to explore the associated factors.</p><p><strong>Patients and methods: </strong>This cross-sectional study included 423 adults aged 65 to 74 years, surveyed in 2021. Clusters representing root caries risk were identified using k-prototypes clustering, with the optimal number of clusters determined by the maximum silhouette index. The confusion matrix and alluvial diagram were used to visualize the predictive accuracy and composition of the clustering results. Binary logistic regression models further analyzed the associated factors, while ROC (receiver operating characteristic) curves and the random forest model visualized the predictive performance and the most important associated factors.</p><p><strong>Results: </strong>Two clusters were identified: cluster 1, with low root caries risk (21.5% with and 78.5% without root caries), and cluster 2, with high root caries risk (83.7% with and 16.3% without root caries). The clustering results predicted root caries with an accuracy of 0.81, sensitivity of 0.79, and specificity of 0.83. Overlapping results from binary logistic regression and the random forest model indicated that older age, more periodontal pockets, more attachment loss, female, a history of systemic diseases, presence of xerostomia, and presence of unrestored tooth loss were positively associated with cluster 2. Brushing tooth ≥2 times per day and a high level of oral health knowledge were negatively associated with cluster 2. The ROC curve for the binary logistic regression model showed an AUC (area under the curve) of 0.84.</p><p><strong>Conclusion: </strong>Individuals who are older, female, with poorer oral and systemic health status, suboptimal oral hygiene behaviors, and lower oral health knowledge levels are more likely to be identified as high-risk group. The identified factors, revealed through unsupervised machine learning, can facilitate personalized prevention and management strategies for root caries in older adults.</p>","PeriodicalId":48841,"journal":{"name":"Clinical Interventions in Aging","volume":"20 ","pages":"483-493"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12036594/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Interventions in Aging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/CIA.S520229","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: This study aimed to identify the high-risk group for root caries using unsupervised machine learning and to explore the associated factors.
Patients and methods: This cross-sectional study included 423 adults aged 65 to 74 years, surveyed in 2021. Clusters representing root caries risk were identified using k-prototypes clustering, with the optimal number of clusters determined by the maximum silhouette index. The confusion matrix and alluvial diagram were used to visualize the predictive accuracy and composition of the clustering results. Binary logistic regression models further analyzed the associated factors, while ROC (receiver operating characteristic) curves and the random forest model visualized the predictive performance and the most important associated factors.
Results: Two clusters were identified: cluster 1, with low root caries risk (21.5% with and 78.5% without root caries), and cluster 2, with high root caries risk (83.7% with and 16.3% without root caries). The clustering results predicted root caries with an accuracy of 0.81, sensitivity of 0.79, and specificity of 0.83. Overlapping results from binary logistic regression and the random forest model indicated that older age, more periodontal pockets, more attachment loss, female, a history of systemic diseases, presence of xerostomia, and presence of unrestored tooth loss were positively associated with cluster 2. Brushing tooth ≥2 times per day and a high level of oral health knowledge were negatively associated with cluster 2. The ROC curve for the binary logistic regression model showed an AUC (area under the curve) of 0.84.
Conclusion: Individuals who are older, female, with poorer oral and systemic health status, suboptimal oral hygiene behaviors, and lower oral health knowledge levels are more likely to be identified as high-risk group. The identified factors, revealed through unsupervised machine learning, can facilitate personalized prevention and management strategies for root caries in older adults.
期刊介绍:
Clinical Interventions in Aging, is an online, peer reviewed, open access journal focusing on concise rapid reporting of original research and reviews in aging. Special attention will be given to papers reporting on actual or potential clinical applications leading to improved prevention or treatment of disease or a greater understanding of pathological processes that result from maladaptive changes in the body associated with aging. This journal is directed at a wide array of scientists, engineers, pharmacists, pharmacologists and clinical specialists wishing to maintain an up to date knowledge of this exciting and emerging field.