Artificial Intelligence Models for Diagnosis of Periodontitis Using Non-Invasive Biological Markers: A Systematic Review and Meta-Analysis of Patient-Based Studies.
Carlos M Ardila, Anny M Vivares-Builes, Pradeep Kumar Yadalam
{"title":"Artificial Intelligence Models for Diagnosis of Periodontitis Using Non-Invasive Biological Markers: A Systematic Review and Meta-Analysis of Patient-Based Studies.","authors":"Carlos M Ardila, Anny M Vivares-Builes, Pradeep Kumar Yadalam","doi":"10.3390/medsci13030159","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background/Objectives</b>: Early diagnosis of periodontitis remains challenging using traditional clinical methods. This systematic review and meta-analysis evaluated the diagnostic accuracy of artificial intelligence (AI) models trained on non-invasive or minimally invasive biomarkers-including saliva, gingival crevicular fluid (GCF), and immunologic profiles-for diagnosing and classifying periodontitis in human subjects. <b>Methods</b>: A comprehensive search of PubMed/MEDLINE, Scopus, Web of Science, EMBASE, and Cochrane CENTRAL was conducted from database inception to June 2025. Eligible studies used AI or machine learning models with patient-derived biomarker data and reported diagnostic performance metrics. <b>Results</b>: Seven studies were included, employing various AI models such as random forest, artificial neural networks, and gradient boosting. Biomarkers were derived from saliva (n = 4), saliva-derived biomarkers from oral rinse (n = 1), immunologic profiles (n = 1), and tissue-based gene expression (n = 1). Reported area under the receiver operating characteristic (ROC) curve (AUC) ranged from 0.83 to 0.96. Meta-analysis of studies with comparable outcomes showed a pooled sensitivity of 0.89 (95% CI: 0.84-0.93), a specificity of 0.87 (95% CI: 0.80-0.92), and a summary AUC of 0.92. Subgroup analysis revealed that models using salivary biomarkers achieved a higher pooled AUC (0.94) than those using GCF or immunologic markers (AUC: 0.89). Sensitivity analyses excluding studies with unclear bias did not significantly alter pooled estimates, affirming robustness. The overall certainty of evidence was rated as moderate to high. <b>Conclusions</b>: AI-based diagnostic models utilizing salivary, microbiome, or immunologic biomarkers demonstrated quantitatively high accuracy; however, the overall certainty of evidence was rated as moderate to high due to limitations in study design and validation.</p>","PeriodicalId":74152,"journal":{"name":"Medical sciences (Basel, Switzerland)","volume":"13 3","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12452651/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical sciences (Basel, Switzerland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/medsci13030159","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Background/Objectives: Early diagnosis of periodontitis remains challenging using traditional clinical methods. This systematic review and meta-analysis evaluated the diagnostic accuracy of artificial intelligence (AI) models trained on non-invasive or minimally invasive biomarkers-including saliva, gingival crevicular fluid (GCF), and immunologic profiles-for diagnosing and classifying periodontitis in human subjects. Methods: A comprehensive search of PubMed/MEDLINE, Scopus, Web of Science, EMBASE, and Cochrane CENTRAL was conducted from database inception to June 2025. Eligible studies used AI or machine learning models with patient-derived biomarker data and reported diagnostic performance metrics. Results: Seven studies were included, employing various AI models such as random forest, artificial neural networks, and gradient boosting. Biomarkers were derived from saliva (n = 4), saliva-derived biomarkers from oral rinse (n = 1), immunologic profiles (n = 1), and tissue-based gene expression (n = 1). Reported area under the receiver operating characteristic (ROC) curve (AUC) ranged from 0.83 to 0.96. Meta-analysis of studies with comparable outcomes showed a pooled sensitivity of 0.89 (95% CI: 0.84-0.93), a specificity of 0.87 (95% CI: 0.80-0.92), and a summary AUC of 0.92. Subgroup analysis revealed that models using salivary biomarkers achieved a higher pooled AUC (0.94) than those using GCF or immunologic markers (AUC: 0.89). Sensitivity analyses excluding studies with unclear bias did not significantly alter pooled estimates, affirming robustness. The overall certainty of evidence was rated as moderate to high. Conclusions: AI-based diagnostic models utilizing salivary, microbiome, or immunologic biomarkers demonstrated quantitatively high accuracy; however, the overall certainty of evidence was rated as moderate to high due to limitations in study design and validation.