Deep hashing for global registration of preoperative CT and video images for laparoscopic liver surgery.

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery Pub Date : 2025-05-23 DOI:10.1007/s11548-025-03418-w

Hanyuan Zhang, Sandun Bulathsinhala, Brian R Davidson, Matthew J Clarkson, João Ramalhinho

{"title":"Deep hashing for global registration of preoperative CT and video images for laparoscopic liver surgery.","authors":"Hanyuan Zhang, Sandun Bulathsinhala, Brian R Davidson, Matthew J Clarkson, João Ramalhinho","doi":"10.1007/s11548-025-03418-w","DOIUrl":null,"url":null,"abstract":"Purpose: Registration of computed tomography (CT) to laparoscopic video images is vital to enable augmented reality (AR), a technology that holds the promise of minimising the risk of complications during laparoscopic liver surgery. Although several solutions have been presented in the literature, they always rely on an accurate initialisation of the registration that is either obtained manually or automatically estimated on very specific views of the liver. These limitations pose a challenge to the clinical translation of AR.Methods: We propose the use of a content-based image retrieval (CBIR) framework to obtain an automatic robust initialisation to the registration. Instead of directly registering video and CT, we render a dense set of possible views of the liver from CT and extract liver contour features. To reduce feature maps to lower dimension vectors, we use a deep hashing (DH) network that is trained in a triplet scheme. Registration is obtained by matching the intra-operative image hashing encoding to the closest encodings found in the pre-operative renderings.Results: We validate our method on synthetic and real data from a phantom and real patient data from eight surgeries. Phantom experiments show that registration errors acceptable for an initial registration are obtained if sufficient pre-operative solutions are considered. In seven out of eight patients, the method is able to obtain a clinically relevant alignment.Conclusion: We present the first work to adapt DH to the CT to video registration problem. Our results indicate that this framework can effectively replace manual initialisations in multiple views, potentially increasing the translation of these techniques.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Assisted Radiology and Surgery","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11548-025-03418-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Registration of computed tomography (CT) to laparoscopic video images is vital to enable augmented reality (AR), a technology that holds the promise of minimising the risk of complications during laparoscopic liver surgery. Although several solutions have been presented in the literature, they always rely on an accurate initialisation of the registration that is either obtained manually or automatically estimated on very specific views of the liver. These limitations pose a challenge to the clinical translation of AR.

Methods: We propose the use of a content-based image retrieval (CBIR) framework to obtain an automatic robust initialisation to the registration. Instead of directly registering video and CT, we render a dense set of possible views of the liver from CT and extract liver contour features. To reduce feature maps to lower dimension vectors, we use a deep hashing (DH) network that is trained in a triplet scheme. Registration is obtained by matching the intra-operative image hashing encoding to the closest encodings found in the pre-operative renderings.

Results: We validate our method on synthetic and real data from a phantom and real patient data from eight surgeries. Phantom experiments show that registration errors acceptable for an initial registration are obtained if sufficient pre-operative solutions are considered. In seven out of eight patients, the method is able to obtain a clinically relevant alignment.

Conclusion: We present the first work to adapt DH to the CT to video registration problem. Our results indicate that this framework can effectively replace manual initialisations in multiple views, potentially increasing the translation of these techniques.

查看原文本刊更多论文

基于深度散列的腹腔镜肝脏手术术前CT和视频图像全局配准。

目的：将计算机断层扫描（CT）配准到腹腔镜视频图像对于实现增强现实（AR）至关重要，增强现实技术有望最大限度地降低腹腔镜肝脏手术期间并发症的风险。虽然文献中提出了几种解决方案，但它们总是依赖于精确的初始化注册，这种初始化注册要么是手动获得的，要么是根据非常具体的肝脏视图自动估计的。这些限制对ar的临床翻译提出了挑战。方法：我们建议使用基于内容的图像检索（CBIR）框架来获得自动鲁棒初始化注册。我们不是直接注册视频和CT，而是从CT中绘制密集的肝脏可能视图集，并提取肝脏轮廓特征。为了将特征映射减少到低维向量，我们使用了一个在三元组方案中训练的深度哈希（DH）网络。通过将术中图像哈希编码与术前渲染中找到的最接近的编码匹配来获得配准。结果：我们用8例手术的模拟数据和真实患者数据验证了我们的方法。模拟实验表明，如果考虑足够的术前解决方案，可以获得初始注册可接受的注册误差。在7 / 8的患者中，该方法能够获得临床相关的对齐。结论：本文首次提出了将DH应用于CT的视频配准问题。我们的结果表明，该框架可以有效地取代多个视图中的手动初始化，潜在地增加这些技术的转换。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Computer Assisted Radiology and Surgery ENGINEERING, BIOMEDICAL-RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

CiteScore

5.90

自引率

6.70%

发文量

243

审稿时长

6-12 weeks

期刊介绍： The International Journal for Computer Assisted Radiology and Surgery (IJCARS) is a peer-reviewed journal that provides a platform for closing the gap between medical and technical disciplines, and encourages interdisciplinary research and development activities in an international environment.