Accuracy Assessment of Human and Artificial Intelligence-Assisted Bitewing Radiography and Near-Infrared Reflectance Imaging-Based Methods for Interproximal Caries Detection: A Histological Validation.
Nicole Rodrigues, Francisco Martinez-Rus, Alicia Miguel-Calvo, Guillermo Pradíes, Maria Paz Salido
{"title":"Accuracy Assessment of Human and Artificial Intelligence-Assisted Bitewing Radiography and Near-Infrared Reflectance Imaging-Based Methods for Interproximal Caries Detection: A Histological Validation.","authors":"Nicole Rodrigues, Francisco Martinez-Rus, Alicia Miguel-Calvo, Guillermo Pradíes, Maria Paz Salido","doi":"10.1159/000546644","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>This study compared the diagnostic accuracy of interproximal caries detection using intraoral bitewing radiographs, assessed by both human operators and an artificial intelligence (AI) program, a near-infrared reflectance imaging (NIRI) system with operator-conducted assessment, and histological evaluation as the reference.</p><p><strong>Methods: </strong>100 posterior teeth with or without caries were mounted on 10 typodonts. Initially, 180 surfaces were examined, but eight were excluded for different reasons. Therefore, 171 proximal surfaces were analyzed. NIRI imaging was performed using the iTero Element 5D®, and radiographs were analyzed by operators and an AI program, Denti.AI. Results were compared with histology, assessing sensitivity (Se), specificity (Sp), positive (PPV) and negative (NPV) predictive values, F1-score, areas under receiver operating characteristic curves (AUCs), and the Fleiss Kappa coefficient (k).</p><p><strong>Results: </strong>The statistical analysis results for each diagnostic test were as follows: examiner radiographic assessment (Se = 52%, Sp = 84.6%, PPV = 71.6%, NPV = 70.3%, F1-score = 60%, AUC = 0.684, k = 0.459); NIRI (Se = 37%, Sp = 98.9%, PPV = 96.4%, NPV = 67.8%, F1-score = 52%, AUC = 0.673, k = 0.475); AI-guided radiographic assessment (Se = 13.7%, Sp = 95.9%, PPV = 71%, NPV = 59.8%, F1-score = 23%, AUC = 0.548). McNemar's test showed a p < 0.05 for all diagnostic tests.</p><p><strong>Conclusion: </strong>Both the operator-conducted NIRI system and examiner radiographic assessment demonstrated superior detection capabilities compared to the AI program. Among these methods, the examiner radiographic assessment yielded the best results, followed by the NIRI system, demonstrating its potential for clinical use.</p>","PeriodicalId":9620,"journal":{"name":"Caries Research","volume":" ","pages":"1-12"},"PeriodicalIF":2.6000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Caries Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1159/000546644","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: This study compared the diagnostic accuracy of interproximal caries detection using intraoral bitewing radiographs, assessed by both human operators and an artificial intelligence (AI) program, a near-infrared reflectance imaging (NIRI) system with operator-conducted assessment, and histological evaluation as the reference.
Methods: 100 posterior teeth with or without caries were mounted on 10 typodonts. Initially, 180 surfaces were examined, but eight were excluded for different reasons. Therefore, 171 proximal surfaces were analyzed. NIRI imaging was performed using the iTero Element 5D®, and radiographs were analyzed by operators and an AI program, Denti.AI. Results were compared with histology, assessing sensitivity (Se), specificity (Sp), positive (PPV) and negative (NPV) predictive values, F1-score, areas under receiver operating characteristic curves (AUCs), and the Fleiss Kappa coefficient (k).
Results: The statistical analysis results for each diagnostic test were as follows: examiner radiographic assessment (Se = 52%, Sp = 84.6%, PPV = 71.6%, NPV = 70.3%, F1-score = 60%, AUC = 0.684, k = 0.459); NIRI (Se = 37%, Sp = 98.9%, PPV = 96.4%, NPV = 67.8%, F1-score = 52%, AUC = 0.673, k = 0.475); AI-guided radiographic assessment (Se = 13.7%, Sp = 95.9%, PPV = 71%, NPV = 59.8%, F1-score = 23%, AUC = 0.548). McNemar's test showed a p < 0.05 for all diagnostic tests.
Conclusion: Both the operator-conducted NIRI system and examiner radiographic assessment demonstrated superior detection capabilities compared to the AI program. Among these methods, the examiner radiographic assessment yielded the best results, followed by the NIRI system, demonstrating its potential for clinical use.
期刊介绍:
''Caries Research'' publishes epidemiological, clinical and laboratory studies in dental caries, erosion and related dental diseases. Some studies build on the considerable advances already made in caries prevention, e.g. through fluoride application. Some aim to improve understanding of the increasingly important problem of dental erosion and the associated tooth wear process. Others monitor the changing pattern of caries in different populations, explore improved methods of diagnosis or evaluate methods of prevention or treatment. The broad coverage of current research has given the journal an international reputation as an indispensable source for both basic scientists and clinicians engaged in understanding, investigating and preventing dental disease.