{"title":"聊天生成预训练Transformer-4.0在评估颈椎和手腕成熟阶段的准确性:一项回顾性研究。","authors":"Meryem Akpınar, Farhad Salmanpour","doi":"10.1016/j.ajodo.2025.08.010","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>This study aimed to evaluate the diagnostic accuracy of Chat Generative Pretrained Transformer version 4.0 (ChatGPT-4.0) in determining cervical vertebrae and hand-wrist maturation stages using cephalometric and hand-wrist radiographic films.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on 238 subjects who had cephalometric and hand-wrist radiographs taken on the same day. Each hand-wrist maturation stage was independently evaluated by 3 orthodontists using the method described by Björk and Helm, whereas cervical vertebrae maturation stages were assessed following the methodology proposed by Bacetti and coworkers. These evaluations served as the reference standard for measuring the performance of ChatGPT-4.0. The hand-wrist and cephalometric radiographs were analyzed by ChatGPT-4.0, and the results were recorded by the primary researcher.</p><p><strong>Results: </strong>The model achieved its highest performance in the hand-wrist maturation stages during the RU stage, with an area under the curve (AUC) value of 0.89. However, despite high precision values in the PP3U and MP3U stages, the model exhibited low recall values, indicating that certain positive instances were missed. In other stages, particularly DP3U and MP3CAP, low precision and recall values limited classification accuracy. Regarding cervical vertebral maturation stages (CVS), the model performed best in CVS1 (AUC, 0.82; precision, 0.806), with relatively favorable AUC values observed in CVS2 (AUC, 0.77). However, its predictive performance in CVS3 and CVS6 stages was suboptimal (AUC <0.67).</p><p><strong>Conclusions: </strong>ChatGPT-4.0 demonstrated accurate predictions in the RU and CVS1 stages. However, its overall performance was found to be inferior to that of other artificial intelligence models.</p>","PeriodicalId":50806,"journal":{"name":"American Journal of Orthodontics and Dentofacial Orthopedics","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chat Generative Pretrained Transformer-4.0's accuracy in assessing cervical vertebrae and hand-wrist maturation stages: A retrospective study.\",\"authors\":\"Meryem Akpınar, Farhad Salmanpour\",\"doi\":\"10.1016/j.ajodo.2025.08.010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>This study aimed to evaluate the diagnostic accuracy of Chat Generative Pretrained Transformer version 4.0 (ChatGPT-4.0) in determining cervical vertebrae and hand-wrist maturation stages using cephalometric and hand-wrist radiographic films.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on 238 subjects who had cephalometric and hand-wrist radiographs taken on the same day. Each hand-wrist maturation stage was independently evaluated by 3 orthodontists using the method described by Björk and Helm, whereas cervical vertebrae maturation stages were assessed following the methodology proposed by Bacetti and coworkers. These evaluations served as the reference standard for measuring the performance of ChatGPT-4.0. The hand-wrist and cephalometric radiographs were analyzed by ChatGPT-4.0, and the results were recorded by the primary researcher.</p><p><strong>Results: </strong>The model achieved its highest performance in the hand-wrist maturation stages during the RU stage, with an area under the curve (AUC) value of 0.89. However, despite high precision values in the PP3U and MP3U stages, the model exhibited low recall values, indicating that certain positive instances were missed. In other stages, particularly DP3U and MP3CAP, low precision and recall values limited classification accuracy. Regarding cervical vertebral maturation stages (CVS), the model performed best in CVS1 (AUC, 0.82; precision, 0.806), with relatively favorable AUC values observed in CVS2 (AUC, 0.77). However, its predictive performance in CVS3 and CVS6 stages was suboptimal (AUC <0.67).</p><p><strong>Conclusions: </strong>ChatGPT-4.0 demonstrated accurate predictions in the RU and CVS1 stages. However, its overall performance was found to be inferior to that of other artificial intelligence models.</p>\",\"PeriodicalId\":50806,\"journal\":{\"name\":\"American Journal of Orthodontics and Dentofacial Orthopedics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American Journal of Orthodontics and Dentofacial Orthopedics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ajodo.2025.08.010\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"DENTISTRY, ORAL SURGERY & MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Orthodontics and Dentofacial Orthopedics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ajodo.2025.08.010","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
Chat Generative Pretrained Transformer-4.0's accuracy in assessing cervical vertebrae and hand-wrist maturation stages: A retrospective study.
Introduction: This study aimed to evaluate the diagnostic accuracy of Chat Generative Pretrained Transformer version 4.0 (ChatGPT-4.0) in determining cervical vertebrae and hand-wrist maturation stages using cephalometric and hand-wrist radiographic films.
Methods: A retrospective analysis was conducted on 238 subjects who had cephalometric and hand-wrist radiographs taken on the same day. Each hand-wrist maturation stage was independently evaluated by 3 orthodontists using the method described by Björk and Helm, whereas cervical vertebrae maturation stages were assessed following the methodology proposed by Bacetti and coworkers. These evaluations served as the reference standard for measuring the performance of ChatGPT-4.0. The hand-wrist and cephalometric radiographs were analyzed by ChatGPT-4.0, and the results were recorded by the primary researcher.
Results: The model achieved its highest performance in the hand-wrist maturation stages during the RU stage, with an area under the curve (AUC) value of 0.89. However, despite high precision values in the PP3U and MP3U stages, the model exhibited low recall values, indicating that certain positive instances were missed. In other stages, particularly DP3U and MP3CAP, low precision and recall values limited classification accuracy. Regarding cervical vertebral maturation stages (CVS), the model performed best in CVS1 (AUC, 0.82; precision, 0.806), with relatively favorable AUC values observed in CVS2 (AUC, 0.77). However, its predictive performance in CVS3 and CVS6 stages was suboptimal (AUC <0.67).
Conclusions: ChatGPT-4.0 demonstrated accurate predictions in the RU and CVS1 stages. However, its overall performance was found to be inferior to that of other artificial intelligence models.
期刊介绍:
Published for more than 100 years, the American Journal of Orthodontics and Dentofacial Orthopedics remains the leading orthodontic resource. It is the official publication of the American Association of Orthodontists, its constituent societies, the American Board of Orthodontics, and the College of Diplomates of the American Board of Orthodontics. Each month its readers have access to original peer-reviewed articles that examine all phases of orthodontic treatment. Illustrated throughout, the publication includes tables, color photographs, and statistical data. Coverage includes successful diagnostic procedures, imaging techniques, bracket and archwire materials, extraction and impaction concerns, orthognathic surgery, TMJ disorders, removable appliances, and adult therapy.