Marcos Paulo Maia-Lima, Livian Isabel de Medeiros Carvalho, Eduarda Gomes Onofre de Araújo, Hélder Domiciano Dantas Martins, Renato Assis Machado, Livia Maria Ferreira Sobrinho, Hercílio Martelli-Júnior, Paulo Rogério Ferreti Bonan
{"title":"Performance of a virtual assistant based on ChatGPT-4 in the diagnosis of syndromes with orofacial manifestations.","authors":"Marcos Paulo Maia-Lima, Livian Isabel de Medeiros Carvalho, Eduarda Gomes Onofre de Araújo, Hélder Domiciano Dantas Martins, Renato Assis Machado, Livia Maria Ferreira Sobrinho, Hercílio Martelli-Júnior, Paulo Rogério Ferreti Bonan","doi":"10.1016/j.oooo.2025.04.002","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To evaluate the performance of the virtual assistant \"Syndromic Diseases and Orofacial Features\" (SDOF), developed based on the Generative Pre-trained Transformer 4 model, in formulating diagnostic hypotheses and recommendations for syndromes with orofacial manifestations.</p><p><strong>Study design: </strong>Twenty-six anonymized, previously diagnosed clinical cases, including clinical features and images, were selected. The assistant was trained using scientific references and configured to generate diagnostic hypotheses and suggest complementary exams. The responses were evaluated by two oral diagnosis specialists based on criteria such as accuracy, completeness, relevance, and comprehensibility. Statistical analysis was performed using RStudio software to calculate means and standard deviations.</p><p><strong>Results: </strong>The SDOF correctly identified 96.2% of the cases, with 80.8% being the first diagnostic hypothesis and 15.4% being the second. In only one case (3.8%), the correct diagnosis was presented as the third hypothesis. The assistant performed best in the criteria \"Relevance,\" \"Practicality,\" and \"Readability,\" while \"Completeness\" and \"Up-to-dateness\" scored the lowest. Despite the high accuracy rate, the assistant failed to mention all diagnostic steps in 7.69% of the cases.</p><p><strong>Conclusions: </strong>The SDOF demonstrated significant potential to assist in the diagnosis of orofacial syndromes, with promising accuracy rates. However, the tool still requires professional supervision and improvements in completeness and up-to-dateness.</p>","PeriodicalId":49010,"journal":{"name":"Oral Surgery Oral Medicine Oral Pathology Oral Radiology","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oral Surgery Oral Medicine Oral Pathology Oral Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.oooo.2025.04.002","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To evaluate the performance of the virtual assistant "Syndromic Diseases and Orofacial Features" (SDOF), developed based on the Generative Pre-trained Transformer 4 model, in formulating diagnostic hypotheses and recommendations for syndromes with orofacial manifestations.
Study design: Twenty-six anonymized, previously diagnosed clinical cases, including clinical features and images, were selected. The assistant was trained using scientific references and configured to generate diagnostic hypotheses and suggest complementary exams. The responses were evaluated by two oral diagnosis specialists based on criteria such as accuracy, completeness, relevance, and comprehensibility. Statistical analysis was performed using RStudio software to calculate means and standard deviations.
Results: The SDOF correctly identified 96.2% of the cases, with 80.8% being the first diagnostic hypothesis and 15.4% being the second. In only one case (3.8%), the correct diagnosis was presented as the third hypothesis. The assistant performed best in the criteria "Relevance," "Practicality," and "Readability," while "Completeness" and "Up-to-dateness" scored the lowest. Despite the high accuracy rate, the assistant failed to mention all diagnostic steps in 7.69% of the cases.
Conclusions: The SDOF demonstrated significant potential to assist in the diagnosis of orofacial syndromes, with promising accuracy rates. However, the tool still requires professional supervision and improvements in completeness and up-to-dateness.
期刊介绍:
Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology is required reading for anyone in the fields of oral surgery, oral medicine, oral pathology, oral radiology or advanced general practice dentistry. It is the only major dental journal that provides a practical and complete overview of the medical and surgical techniques of dental practice in four areas. Topics covered include such current issues as dental implants, treatment of HIV-infected patients, and evaluation and treatment of TMJ disorders. The official publication for nine societies, the Journal is recommended for initial purchase in the Brandon Hill study, Selected List of Books and Journals for the Small Medical Library.