基于人工智能的骨龄评估工具与Greulich-Pyle和Tanner-Whitehouse version 2方法在儿科队列中的概念验证比较

IF 2.3 3区 医学 Q2 PEDIATRICS
Luca Marinelli, Antonio Lo Mastro, Francesca Grassi, Daniela Berritto, Anna Russo, Vittorio Patanè, Anna Festa, Enrico Grassi, Anna Grandone, Luigi Aurelio Nasto, Enrico Pola, Alfonso Reginelli
{"title":"基于人工智能的骨龄评估工具与Greulich-Pyle和Tanner-Whitehouse version 2方法在儿科队列中的概念验证比较","authors":"Luca Marinelli, Antonio Lo Mastro, Francesca Grassi, Daniela Berritto, Anna Russo, Vittorio Patanè, Anna Festa, Enrico Grassi, Anna Grandone, Luigi Aurelio Nasto, Enrico Pola, Alfonso Reginelli","doi":"10.1007/s00247-025-06405-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Bone age assessment is essential in evaluating pediatric growth disorders. Artificial intelligence (AI) systems offer potential improvements in accuracy and reproducibility compared to traditional methods.</p><p><strong>Objective: </strong>To compare the performance of a commercially available artificial intelligence-based software (BoneView BoneAge, Gleamer, Paris, France) against two human-assessed methods-the Greulich-Pyle (GP) atlas and Tanner-Whitehouse version 2 (TW2)-in a pediatric population.</p><p><strong>Materials and methods: </strong>This proof-of-concept study included 203 pediatric patients (mean age, 9.0 years; range, 2.0-17.0 years) who underwent hand and wrist radiographs for suspected endocrine or growth-related conditions. After excluding technically inadequate images, 157 cases were analyzed using AI and GP-assessed methods. A subset of 35 patients was also evaluated using the TW2 method by a pediatric endocrinologist. Performance was measured using mean absolute error (MAE), root mean square error (RMSE), bias, and Pearson's correlation coefficient, using chronological age as reference.</p><p><strong>Results: </strong>The AI model achieved a MAE of 1.38 years, comparable to the radiologist's GP-based estimate (MAE, 1.30 years), and superior to TW2 (MAE, 2.86 years). RMSE values were 1.75 years, 1.80 years, and 3.88 years, respectively. AI showed minimal bias (-0.05 years), while TW2-based assessments systematically underestimated bone age (bias, -2.63 years). Strong correlations with chronological age were observed for AI (r=0.857) and GP (r=0.894), but not for TW2 (r=0.490).</p><p><strong>Conclusion: </strong>BoneView demonstrated comparable accuracy to radiologist-assessed GP method and outperformed TW2 assessments in this cohort. AI-based systems may enhance consistency in pediatric bone age estimation but require careful validation, especially in ethnically diverse populations.</p>","PeriodicalId":19755,"journal":{"name":"Pediatric Radiology","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Proof-of-concept comparison of an artificial intelligence-based bone age assessment tool with Greulich-Pyle and Tanner-Whitehouse version 2 methods in a pediatric cohort.\",\"authors\":\"Luca Marinelli, Antonio Lo Mastro, Francesca Grassi, Daniela Berritto, Anna Russo, Vittorio Patanè, Anna Festa, Enrico Grassi, Anna Grandone, Luigi Aurelio Nasto, Enrico Pola, Alfonso Reginelli\",\"doi\":\"10.1007/s00247-025-06405-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Bone age assessment is essential in evaluating pediatric growth disorders. Artificial intelligence (AI) systems offer potential improvements in accuracy and reproducibility compared to traditional methods.</p><p><strong>Objective: </strong>To compare the performance of a commercially available artificial intelligence-based software (BoneView BoneAge, Gleamer, Paris, France) against two human-assessed methods-the Greulich-Pyle (GP) atlas and Tanner-Whitehouse version 2 (TW2)-in a pediatric population.</p><p><strong>Materials and methods: </strong>This proof-of-concept study included 203 pediatric patients (mean age, 9.0 years; range, 2.0-17.0 years) who underwent hand and wrist radiographs for suspected endocrine or growth-related conditions. After excluding technically inadequate images, 157 cases were analyzed using AI and GP-assessed methods. A subset of 35 patients was also evaluated using the TW2 method by a pediatric endocrinologist. Performance was measured using mean absolute error (MAE), root mean square error (RMSE), bias, and Pearson's correlation coefficient, using chronological age as reference.</p><p><strong>Results: </strong>The AI model achieved a MAE of 1.38 years, comparable to the radiologist's GP-based estimate (MAE, 1.30 years), and superior to TW2 (MAE, 2.86 years). RMSE values were 1.75 years, 1.80 years, and 3.88 years, respectively. AI showed minimal bias (-0.05 years), while TW2-based assessments systematically underestimated bone age (bias, -2.63 years). Strong correlations with chronological age were observed for AI (r=0.857) and GP (r=0.894), but not for TW2 (r=0.490).</p><p><strong>Conclusion: </strong>BoneView demonstrated comparable accuracy to radiologist-assessed GP method and outperformed TW2 assessments in this cohort. AI-based systems may enhance consistency in pediatric bone age estimation but require careful validation, especially in ethnically diverse populations.</p>\",\"PeriodicalId\":19755,\"journal\":{\"name\":\"Pediatric Radiology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pediatric Radiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00247-025-06405-0\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PEDIATRICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pediatric Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00247-025-06405-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PEDIATRICS","Score":null,"Total":0}
引用次数: 0

摘要

背景:骨龄评估是评估儿童生长障碍的必要条件。与传统方法相比,人工智能(AI)系统在准确性和可重复性方面提供了潜在的改进。目的:比较市售人工智能软件(BoneView BoneAge, Gleamer, Paris, France)与两种人类评估方法(Greulich-Pyle (GP)图谱和Tanner-Whitehouse version 2 (TW2))在儿科人群中的表现。材料和方法:这项概念验证研究包括203名儿童患者(平均年龄9.0岁,范围2.0-17.0岁),他们因疑似内分泌或生长相关疾病接受了手腕部x线片检查。在排除了技术上不充分的图像后,使用AI和gp评估方法分析了157例病例。一名儿科内分泌学家也使用TW2方法对35名患者进行了评估。以实际年龄为参考,采用平均绝对误差(MAE)、均方根误差(RMSE)、偏倚和Pearson相关系数来衡量绩效。结果:AI模型的MAE为1.38年,与放射科医生基于gp的估计(MAE, 1.30年)相当,优于TW2 (MAE, 2.86年)。RMSE值分别为1.75年、1.80年和3.88年。AI显示最小偏差(-0.05年),而基于tw2的评估系统地低估了骨龄(偏差,-2.63年)。AI (r=0.857)和GP (r=0.894)与实足年龄密切相关,而TW2 (r=0.490)与实足年龄无关。结论:BoneView显示出与放射科医师评估的GP方法相当的准确性,并且在该队列中优于TW2评估。基于人工智能的系统可以提高儿童骨龄估计的一致性,但需要仔细验证,特别是在不同种族的人群中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Proof-of-concept comparison of an artificial intelligence-based bone age assessment tool with Greulich-Pyle and Tanner-Whitehouse version 2 methods in a pediatric cohort.

Background: Bone age assessment is essential in evaluating pediatric growth disorders. Artificial intelligence (AI) systems offer potential improvements in accuracy and reproducibility compared to traditional methods.

Objective: To compare the performance of a commercially available artificial intelligence-based software (BoneView BoneAge, Gleamer, Paris, France) against two human-assessed methods-the Greulich-Pyle (GP) atlas and Tanner-Whitehouse version 2 (TW2)-in a pediatric population.

Materials and methods: This proof-of-concept study included 203 pediatric patients (mean age, 9.0 years; range, 2.0-17.0 years) who underwent hand and wrist radiographs for suspected endocrine or growth-related conditions. After excluding technically inadequate images, 157 cases were analyzed using AI and GP-assessed methods. A subset of 35 patients was also evaluated using the TW2 method by a pediatric endocrinologist. Performance was measured using mean absolute error (MAE), root mean square error (RMSE), bias, and Pearson's correlation coefficient, using chronological age as reference.

Results: The AI model achieved a MAE of 1.38 years, comparable to the radiologist's GP-based estimate (MAE, 1.30 years), and superior to TW2 (MAE, 2.86 years). RMSE values were 1.75 years, 1.80 years, and 3.88 years, respectively. AI showed minimal bias (-0.05 years), while TW2-based assessments systematically underestimated bone age (bias, -2.63 years). Strong correlations with chronological age were observed for AI (r=0.857) and GP (r=0.894), but not for TW2 (r=0.490).

Conclusion: BoneView demonstrated comparable accuracy to radiologist-assessed GP method and outperformed TW2 assessments in this cohort. AI-based systems may enhance consistency in pediatric bone age estimation but require careful validation, especially in ethnically diverse populations.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pediatric Radiology
Pediatric Radiology 医学-核医学
CiteScore
4.40
自引率
17.40%
发文量
300
审稿时长
3-6 weeks
期刊介绍: Official Journal of the European Society of Pediatric Radiology, the Society for Pediatric Radiology and the Asian and Oceanic Society for Pediatric Radiology Pediatric Radiology informs its readers of new findings and progress in all areas of pediatric imaging and in related fields. This is achieved by a blend of original papers, complemented by reviews that set out the present state of knowledge in a particular area of the specialty or summarize specific topics in which discussion has led to clear conclusions. Advances in technology, methodology, apparatus and auxiliary equipment are presented, and modifications of standard techniques are described. Manuscripts submitted for publication must contain a statement to the effect that all human studies have been reviewed by the appropriate ethics committee and have therefore been performed in accordance with the ethical standards laid down in an appropriate version of the 1964 Declaration of Helsinki. It should also be stated clearly in the text that all persons gave their informed consent prior to their inclusion in the study. Details that might disclose the identity of the subjects under study should be omitted.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信