Daiwei Lu,Ekamjit S Deol,Tatsuki Koyama,Ipek Oguz,Nicholas L Kavoussi
{"title":"用于自动肾结石分割的计算机视觉模型及其与外科医生的性能评估。","authors":"Daiwei Lu,Ekamjit S Deol,Tatsuki Koyama,Ipek Oguz,Nicholas L Kavoussi","doi":"10.1111/bju.70001","DOIUrl":null,"url":null,"abstract":"OBJECTIVES\r\nTo develop a computer vision model that segments stones to improve visualisation during ureteroscopy (URS) and to compare model performance to that of experts.\r\n\r\nMATERIALS AND METHODS\r\nWe collected 136 videos of URS for intrarenal kidney stone treatment. Frames were extracted at 3 frames per second (FPS) and manually annotated. The video dataset was split into training (75%), validation (5%) and testing (20%) subsets. Model performance was evaluated for stone localisation, laser ablation, and final evaluation of remaining fragments based on area under the receiver-operating curve, binary cross-entropy loss and Dice similarity coefficient (DSC). Model performance was compared to the manual annotations of five board-certified urologists through pairwise comparison of frame-by-frame segmentation accuracy.\r\n\r\nRESULTS\r\nThe final dataset consisted of 21 718 frames from 38 fibreoptic and 98 digital videos. Overall, the model showed excellent performance: DSC 0.97 (interquartile range [IQR] 0.91, 0.99) and could segment at 30 FPS. Performance was similar for both fibreoptic (0.97 [IQR 0.91, 0.99]) and digital scopes (0.97 [IQR 0.92, 0.99]). Additionally, the model demonstrated good performance during stone localisation (0.98 [IQR 0.93, 0.99]) and stone laser ablation (0.96 [IQR 0.89, 0.97]), with slightly worse performance during evaluation of residual fragments (0.91 [IQR 0.50, 0.97]). Model performance was comparable to the five expert surgeons overall. In a head-to-head comparison, the model significantly outperformed three of the five experts and performed similarly to the other two.\r\n\r\nCONCLUSION\r\nThe computer vision model demonstrates good performance for task-specific stone segmentation evaluation during URS. The segmentation performance of the model was similar to the segmentation performance of expert surgeons, demonstrating the feasibility of its real-time intra-operative utilisation.","PeriodicalId":8985,"journal":{"name":"BJU International","volume":"41 1","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A computer vision model for automated kidney stone segmentation and evaluation of its performance vs surgeons.\",\"authors\":\"Daiwei Lu,Ekamjit S Deol,Tatsuki Koyama,Ipek Oguz,Nicholas L Kavoussi\",\"doi\":\"10.1111/bju.70001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"OBJECTIVES\\r\\nTo develop a computer vision model that segments stones to improve visualisation during ureteroscopy (URS) and to compare model performance to that of experts.\\r\\n\\r\\nMATERIALS AND METHODS\\r\\nWe collected 136 videos of URS for intrarenal kidney stone treatment. Frames were extracted at 3 frames per second (FPS) and manually annotated. The video dataset was split into training (75%), validation (5%) and testing (20%) subsets. Model performance was evaluated for stone localisation, laser ablation, and final evaluation of remaining fragments based on area under the receiver-operating curve, binary cross-entropy loss and Dice similarity coefficient (DSC). Model performance was compared to the manual annotations of five board-certified urologists through pairwise comparison of frame-by-frame segmentation accuracy.\\r\\n\\r\\nRESULTS\\r\\nThe final dataset consisted of 21 718 frames from 38 fibreoptic and 98 digital videos. Overall, the model showed excellent performance: DSC 0.97 (interquartile range [IQR] 0.91, 0.99) and could segment at 30 FPS. Performance was similar for both fibreoptic (0.97 [IQR 0.91, 0.99]) and digital scopes (0.97 [IQR 0.92, 0.99]). Additionally, the model demonstrated good performance during stone localisation (0.98 [IQR 0.93, 0.99]) and stone laser ablation (0.96 [IQR 0.89, 0.97]), with slightly worse performance during evaluation of residual fragments (0.91 [IQR 0.50, 0.97]). Model performance was comparable to the five expert surgeons overall. In a head-to-head comparison, the model significantly outperformed three of the five experts and performed similarly to the other two.\\r\\n\\r\\nCONCLUSION\\r\\nThe computer vision model demonstrates good performance for task-specific stone segmentation evaluation during URS. The segmentation performance of the model was similar to the segmentation performance of expert surgeons, demonstrating the feasibility of its real-time intra-operative utilisation.\",\"PeriodicalId\":8985,\"journal\":{\"name\":\"BJU International\",\"volume\":\"41 1\",\"pages\":\"\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BJU International\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/bju.70001\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"UROLOGY & NEPHROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BJU International","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/bju.70001","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
A computer vision model for automated kidney stone segmentation and evaluation of its performance vs surgeons.
OBJECTIVES
To develop a computer vision model that segments stones to improve visualisation during ureteroscopy (URS) and to compare model performance to that of experts.
MATERIALS AND METHODS
We collected 136 videos of URS for intrarenal kidney stone treatment. Frames were extracted at 3 frames per second (FPS) and manually annotated. The video dataset was split into training (75%), validation (5%) and testing (20%) subsets. Model performance was evaluated for stone localisation, laser ablation, and final evaluation of remaining fragments based on area under the receiver-operating curve, binary cross-entropy loss and Dice similarity coefficient (DSC). Model performance was compared to the manual annotations of five board-certified urologists through pairwise comparison of frame-by-frame segmentation accuracy.
RESULTS
The final dataset consisted of 21 718 frames from 38 fibreoptic and 98 digital videos. Overall, the model showed excellent performance: DSC 0.97 (interquartile range [IQR] 0.91, 0.99) and could segment at 30 FPS. Performance was similar for both fibreoptic (0.97 [IQR 0.91, 0.99]) and digital scopes (0.97 [IQR 0.92, 0.99]). Additionally, the model demonstrated good performance during stone localisation (0.98 [IQR 0.93, 0.99]) and stone laser ablation (0.96 [IQR 0.89, 0.97]), with slightly worse performance during evaluation of residual fragments (0.91 [IQR 0.50, 0.97]). Model performance was comparable to the five expert surgeons overall. In a head-to-head comparison, the model significantly outperformed three of the five experts and performed similarly to the other two.
CONCLUSION
The computer vision model demonstrates good performance for task-specific stone segmentation evaluation during URS. The segmentation performance of the model was similar to the segmentation performance of expert surgeons, demonstrating the feasibility of its real-time intra-operative utilisation.
期刊介绍:
BJUI is one of the most highly respected medical journals in the world, with a truly international range of published papers and appeal. Every issue gives invaluable practical information in the form of original articles, reviews, comments, surgical education articles, and translational science articles in the field of urology. BJUI employs topical sections, and is in full colour, making it easier to browse or search for something specific.