Roger D Soberanis-Mukul, Rohit Shankar, Lalithkumar Seenivasan, Jose L Porras, Masaru Ishii, Mathias Unberath
{"title":"可解释的手术时间信息:可解释的手术时间完成预测。","authors":"Roger D Soberanis-Mukul, Rohit Shankar, Lalithkumar Seenivasan, Jose L Porras, Masaru Ishii, Mathias Unberath","doi":"10.1007/s11548-025-03448-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Predicting surgical time completion helps streamline surgical workflow and OR utilization, enhancing hospital efficacy. When time prediction is based on interventional video of the surgical site, time predictions may correlate with technical proficiency of the surgeon because skill is a useful proxy of completion time. To understand features that are predictive of surgical time in surgical site video, we develop prototype-like visual explanations, making them applicable to video sequences.</p><p><strong>Methods: </strong>We introduce an interpretable method for predicting surgical duration by identifying prototype-like patterns within egocentric video of the surgical site. Unlike conventional image-based prototype models that generate patch-based prototypes, our method extracts video-based explanations tied to segments of surgical videos with similar time deviation patterns. We achieve this by comparing the principal components of feature representation differences at various time points in the predictions. To effectively capture long-range dependencies in the prediction task, we employ an informer as the primary predictive model.</p><p><strong>Results: </strong>This model is applied to a dataset of 42 point-of-view craniotomy videos, collected under an approved IRB protocol. On average, our interpretable model performs better than the baseline models in surgical time completion.</p><p><strong>Conclusion: </strong>Our approach not only contributes to the interpretability of surgical time predictions but also takes full advantage of the detailed information provided by surgical video data.</p>","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The interpretable surgical temporal informer: explainable surgical time completion prediction.\",\"authors\":\"Roger D Soberanis-Mukul, Rohit Shankar, Lalithkumar Seenivasan, Jose L Porras, Masaru Ishii, Mathias Unberath\",\"doi\":\"10.1007/s11548-025-03448-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Predicting surgical time completion helps streamline surgical workflow and OR utilization, enhancing hospital efficacy. When time prediction is based on interventional video of the surgical site, time predictions may correlate with technical proficiency of the surgeon because skill is a useful proxy of completion time. To understand features that are predictive of surgical time in surgical site video, we develop prototype-like visual explanations, making them applicable to video sequences.</p><p><strong>Methods: </strong>We introduce an interpretable method for predicting surgical duration by identifying prototype-like patterns within egocentric video of the surgical site. Unlike conventional image-based prototype models that generate patch-based prototypes, our method extracts video-based explanations tied to segments of surgical videos with similar time deviation patterns. We achieve this by comparing the principal components of feature representation differences at various time points in the predictions. To effectively capture long-range dependencies in the prediction task, we employ an informer as the primary predictive model.</p><p><strong>Results: </strong>This model is applied to a dataset of 42 point-of-view craniotomy videos, collected under an approved IRB protocol. On average, our interpretable model performs better than the baseline models in surgical time completion.</p><p><strong>Conclusion: </strong>Our approach not only contributes to the interpretability of surgical time predictions but also takes full advantage of the detailed information provided by surgical video data.</p>\",\"PeriodicalId\":51251,\"journal\":{\"name\":\"International Journal of Computer Assisted Radiology and Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Computer Assisted Radiology and Surgery\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11548-025-03448-4\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Assisted Radiology and Surgery","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11548-025-03448-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
The interpretable surgical temporal informer: explainable surgical time completion prediction.
Purpose: Predicting surgical time completion helps streamline surgical workflow and OR utilization, enhancing hospital efficacy. When time prediction is based on interventional video of the surgical site, time predictions may correlate with technical proficiency of the surgeon because skill is a useful proxy of completion time. To understand features that are predictive of surgical time in surgical site video, we develop prototype-like visual explanations, making them applicable to video sequences.
Methods: We introduce an interpretable method for predicting surgical duration by identifying prototype-like patterns within egocentric video of the surgical site. Unlike conventional image-based prototype models that generate patch-based prototypes, our method extracts video-based explanations tied to segments of surgical videos with similar time deviation patterns. We achieve this by comparing the principal components of feature representation differences at various time points in the predictions. To effectively capture long-range dependencies in the prediction task, we employ an informer as the primary predictive model.
Results: This model is applied to a dataset of 42 point-of-view craniotomy videos, collected under an approved IRB protocol. On average, our interpretable model performs better than the baseline models in surgical time completion.
Conclusion: Our approach not only contributes to the interpretability of surgical time predictions but also takes full advantage of the detailed information provided by surgical video data.
期刊介绍:
The International Journal for Computer Assisted Radiology and Surgery (IJCARS) is a peer-reviewed journal that provides a platform for closing the gap between medical and technical disciplines, and encourages interdisciplinary research and development activities in an international environment.