Paulina Vargova, Matej Varga, Beatriz Izquierdo-Hernández, Cristina Gutierrez-Alonso, Ainara Gonazlez-Esgueda, Maria Victoria Cobos-Hernández, Rafael Fernandez-Atuan, Yurema Gonzalez-Ruiz, Paolo Bragagnini-Rodriguez, María Del-Peral-Samaniego, Carolina Corona-Bellostas
{"title":"与临床专家相比,人工智能提高了对比灌肠对先天性巨结肠疾病的诊断准确性。","authors":"Paulina Vargova, Matej Varga, Beatriz Izquierdo-Hernández, Cristina Gutierrez-Alonso, Ainara Gonazlez-Esgueda, Maria Victoria Cobos-Hernández, Rafael Fernandez-Atuan, Yurema Gonzalez-Ruiz, Paolo Bragagnini-Rodriguez, María Del-Peral-Samaniego, Carolina Corona-Bellostas","doi":"10.1055/a-2646-2052","DOIUrl":null,"url":null,"abstract":"<p><p>Contrast enema (CE) is widely used in the evaluation of suspected Hirschsprung disease (HD). Deep learning is a promising tool to standardize image assessment and support clinical decision-making. This study assesses the diagnostic performance of a deep neural network (DNN), with and without clinical data, and compares its interpretation with that of pediatric surgeons and radiologists.In this retrospective study, 1,471 CE images from patients <15 years were analyzed, with 218 images used for testing. A DNN, pediatric radiologists, and surgeons independently reviewed the testing set, with and without clinical data. Diagnostic performance was assessed using ROC and PR curves, and interobserver agreement was evaluated using Fleiss' kappa. Rectal biopsy served as the reference standard.The DNN achieved high diagnostic accuracy (area under the receiver operating characteristic curve [AUC-ROC] = 0.87) in CE interpretation, with improved performance when combining anteroposterior and lateral images (AUC-ROC = 0.92). Clinical data integration further enhanced model sensitivity and negative predictive value. The super-surgeon (majority voting of colorectal surgeons) outperformed most individual clinicians (sensitivity 81.8%, specificity 79.1%), while the super-radiologist (majority voting of radiologists) showed moderate accuracy. Interobserver analysis revealed strong agreement between the model and surgeons (Cohen's kappa = 0.73), and overall consistency among experts and the model (Fleiss' kappa = 0.62).Artificial intelligence-assisted CE interpretation achieved higher specificity and comparable sensitivity to that of the clinicians. Its consistent performance and substantial agreement with experts support its potential role in improving CE assessment in HD.</p>","PeriodicalId":56316,"journal":{"name":"European Journal of Pediatric Surgery","volume":" ","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Artificial Intelligence Enhances Diagnostic Accuracy of Contrast Enemas in Hirschsprung Disease Compared to Clinical Experts.\",\"authors\":\"Paulina Vargova, Matej Varga, Beatriz Izquierdo-Hernández, Cristina Gutierrez-Alonso, Ainara Gonazlez-Esgueda, Maria Victoria Cobos-Hernández, Rafael Fernandez-Atuan, Yurema Gonzalez-Ruiz, Paolo Bragagnini-Rodriguez, María Del-Peral-Samaniego, Carolina Corona-Bellostas\",\"doi\":\"10.1055/a-2646-2052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Contrast enema (CE) is widely used in the evaluation of suspected Hirschsprung disease (HD). Deep learning is a promising tool to standardize image assessment and support clinical decision-making. This study assesses the diagnostic performance of a deep neural network (DNN), with and without clinical data, and compares its interpretation with that of pediatric surgeons and radiologists.In this retrospective study, 1,471 CE images from patients <15 years were analyzed, with 218 images used for testing. A DNN, pediatric radiologists, and surgeons independently reviewed the testing set, with and without clinical data. Diagnostic performance was assessed using ROC and PR curves, and interobserver agreement was evaluated using Fleiss' kappa. Rectal biopsy served as the reference standard.The DNN achieved high diagnostic accuracy (area under the receiver operating characteristic curve [AUC-ROC] = 0.87) in CE interpretation, with improved performance when combining anteroposterior and lateral images (AUC-ROC = 0.92). Clinical data integration further enhanced model sensitivity and negative predictive value. The super-surgeon (majority voting of colorectal surgeons) outperformed most individual clinicians (sensitivity 81.8%, specificity 79.1%), while the super-radiologist (majority voting of radiologists) showed moderate accuracy. Interobserver analysis revealed strong agreement between the model and surgeons (Cohen's kappa = 0.73), and overall consistency among experts and the model (Fleiss' kappa = 0.62).Artificial intelligence-assisted CE interpretation achieved higher specificity and comparable sensitivity to that of the clinicians. Its consistent performance and substantial agreement with experts support its potential role in improving CE assessment in HD.</p>\",\"PeriodicalId\":56316,\"journal\":{\"name\":\"European Journal of Pediatric Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Pediatric Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1055/a-2646-2052\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PEDIATRICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Pediatric Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-2646-2052","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PEDIATRICS","Score":null,"Total":0}
Artificial Intelligence Enhances Diagnostic Accuracy of Contrast Enemas in Hirschsprung Disease Compared to Clinical Experts.
Contrast enema (CE) is widely used in the evaluation of suspected Hirschsprung disease (HD). Deep learning is a promising tool to standardize image assessment and support clinical decision-making. This study assesses the diagnostic performance of a deep neural network (DNN), with and without clinical data, and compares its interpretation with that of pediatric surgeons and radiologists.In this retrospective study, 1,471 CE images from patients <15 years were analyzed, with 218 images used for testing. A DNN, pediatric radiologists, and surgeons independently reviewed the testing set, with and without clinical data. Diagnostic performance was assessed using ROC and PR curves, and interobserver agreement was evaluated using Fleiss' kappa. Rectal biopsy served as the reference standard.The DNN achieved high diagnostic accuracy (area under the receiver operating characteristic curve [AUC-ROC] = 0.87) in CE interpretation, with improved performance when combining anteroposterior and lateral images (AUC-ROC = 0.92). Clinical data integration further enhanced model sensitivity and negative predictive value. The super-surgeon (majority voting of colorectal surgeons) outperformed most individual clinicians (sensitivity 81.8%, specificity 79.1%), while the super-radiologist (majority voting of radiologists) showed moderate accuracy. Interobserver analysis revealed strong agreement between the model and surgeons (Cohen's kappa = 0.73), and overall consistency among experts and the model (Fleiss' kappa = 0.62).Artificial intelligence-assisted CE interpretation achieved higher specificity and comparable sensitivity to that of the clinicians. Its consistent performance and substantial agreement with experts support its potential role in improving CE assessment in HD.
期刊介绍:
This broad-based international journal updates you on vital developments in pediatric surgery through original articles, abstracts of the literature, and meeting announcements.
You will find state-of-the-art information on:
abdominal and thoracic surgery
neurosurgery
urology
gynecology
oncology
orthopaedics
traumatology
anesthesiology
child pathology
embryology
morphology
Written by surgeons, physicians, anesthesiologists, radiologists, and others involved in the surgical care of neonates, infants, and children, the EJPS is an indispensable resource for all specialists.