虚拟胶囊内窥镜合成数据集

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-06-28 DOI:10.1016/j.media.2025.103706

Sarita Singh, Basabi Bhaumik, Shouri Chatterjee

{"title":"虚拟胶囊内窥镜合成数据集","authors":"Sarita Singh, Basabi Bhaumik, Shouri Chatterjee","doi":"10.1016/j.media.2025.103706","DOIUrl":null,"url":null,"abstract":"<div><div>The absence of accurately position-annotated datasets of the human gastrointestinal (GI) tract limits the efficient learning of deep learning-based models and their effective performance evaluation for depth and pose estimation. The currently available synthetic datasets for the GI tract lack the intrinsic anatomical features and the associated textural characteristics. In this work, we have developed a method to generate virtual models of the small and large intestines of human gastrointestinal system integrated with a virtual capsule endoscope, that generate position-annotated image dataset (SimIntestine) along with ground truth depth maps. The virtual intestines incorporate the distinctive anatomical characteristics of the real intestines, such as plicae circulares, villi, haustral folds, realistic textures; and the physiological processes such as peristalsis. The virtual endoscope navigates through the virtual intestine analogous to a real capsule endoscope and generates images that closely approximate the visual characteristics of those captured by a real endoscope. The framework additionally provides information on the camera’s orientation and position inside the virtual intestine; along with the depth information for each image pixel. The proposed framework provides a comprehensive and physically realistic annotated synthetic dataset benchmark of intestines which can be used to improve endoscopic video analysis, specifically in the domain of pose estimation and simultaneous localization and mapping which is challenging to obtain using real endoscope unannotated dataset. The SimIntestine dataset is utilized to evaluate the established benchmark techniques for depth and ego-motion estimation - Endo-SfMLearner and Monodepth2, and their results are discussed. The dataset has also been evaluated against other existing datasets, and its efficacy has been quantitatively affirmed by the enhanced performance metrics.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"105 ","pages":"Article 103706"},"PeriodicalIF":11.8000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SimIntestine: A synthetic dataset from virtual capsule endoscope\",\"authors\":\"Sarita Singh, Basabi Bhaumik, Shouri Chatterjee\",\"doi\":\"10.1016/j.media.2025.103706\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The absence of accurately position-annotated datasets of the human gastrointestinal (GI) tract limits the efficient learning of deep learning-based models and their effective performance evaluation for depth and pose estimation. The currently available synthetic datasets for the GI tract lack the intrinsic anatomical features and the associated textural characteristics. In this work, we have developed a method to generate virtual models of the small and large intestines of human gastrointestinal system integrated with a virtual capsule endoscope, that generate position-annotated image dataset (SimIntestine) along with ground truth depth maps. The virtual intestines incorporate the distinctive anatomical characteristics of the real intestines, such as plicae circulares, villi, haustral folds, realistic textures; and the physiological processes such as peristalsis. The virtual endoscope navigates through the virtual intestine analogous to a real capsule endoscope and generates images that closely approximate the visual characteristics of those captured by a real endoscope. The framework additionally provides information on the camera’s orientation and position inside the virtual intestine; along with the depth information for each image pixel. The proposed framework provides a comprehensive and physically realistic annotated synthetic dataset benchmark of intestines which can be used to improve endoscopic video analysis, specifically in the domain of pose estimation and simultaneous localization and mapping which is challenging to obtain using real endoscope unannotated dataset. The SimIntestine dataset is utilized to evaluate the established benchmark techniques for depth and ego-motion estimation - Endo-SfMLearner and Monodepth2, and their results are discussed. The dataset has also been evaluated against other existing datasets, and its efficacy has been quantitatively affirmed by the enhanced performance metrics.</div></div>\",\"PeriodicalId\":18328,\"journal\":{\"name\":\"Medical image analysis\",\"volume\":\"105 \",\"pages\":\"Article 103706\"},\"PeriodicalIF\":11.8000,\"publicationDate\":\"2025-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical image analysis\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1361841525002531\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525002531","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

缺乏准确的人体胃肠道位置标注数据集限制了基于深度学习的模型的高效学习及其在深度和姿态估计方面的有效性能评估。目前可用的胃肠道合成数据集缺乏固有的解剖特征和相关的结构特征。在这项工作中，我们开发了一种方法来生成与虚拟胶囊内窥镜集成的人类胃肠道系统的小肠和大肠的虚拟模型，该模型生成位置注释图像数据集（simmintestine）以及地面真实深度图。虚拟肠道结合了真实肠道的独特解剖特征，如皱襞环、绒毛、端部褶皱、逼真的纹理；以及蠕动等生理过程。虚拟内窥镜在虚拟肠道中导航，类似于真实的胶囊内窥镜，并生成与真实内窥镜捕获的视觉特征非常接近的图像。所述框架还提供关于所述摄像机在虚拟肠道内的方向和位置的信息；以及每个图像像素的深度信息。该框架提供了一个全面且物理逼真的肠道注释合成数据集基准，可用于改进内镜视频分析，特别是在姿态估计和同步定位和映射领域，这是使用真实内镜未注释数据集难以获得的。SimIntestine数据集被用来评估深度和自我运动估计的基准技术- ento - sfmlearner和Monodepth2，并讨论了它们的结果。该数据集还与其他现有数据集进行了评估，并通过增强的性能指标定量地确认了其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

SimIntestine: A synthetic dataset from virtual capsule endoscope

查看原文本刊更多论文

SimIntestine: A synthetic dataset from virtual capsule endoscope

The absence of accurately position-annotated datasets of the human gastrointestinal (GI) tract limits the efficient learning of deep learning-based models and their effective performance evaluation for depth and pose estimation. The currently available synthetic datasets for the GI tract lack the intrinsic anatomical features and the associated textural characteristics. In this work, we have developed a method to generate virtual models of the small and large intestines of human gastrointestinal system integrated with a virtual capsule endoscope, that generate position-annotated image dataset (SimIntestine) along with ground truth depth maps. The virtual intestines incorporate the distinctive anatomical characteristics of the real intestines, such as plicae circulares, villi, haustral folds, realistic textures; and the physiological processes such as peristalsis. The virtual endoscope navigates through the virtual intestine analogous to a real capsule endoscope and generates images that closely approximate the visual characteristics of those captured by a real endoscope. The framework additionally provides information on the camera’s orientation and position inside the virtual intestine; along with the depth information for each image pixel. The proposed framework provides a comprehensive and physically realistic annotated synthetic dataset benchmark of intestines which can be used to improve endoscopic video analysis, specifically in the domain of pose estimation and simultaneous localization and mapping which is challenging to obtain using real endoscope unannotated dataset. The SimIntestine dataset is utilized to evaluate the established benchmark techniques for depth and ego-motion estimation - Endo-SfMLearner and Monodepth2, and their results are discussed. The dataset has also been evaluated against other existing datasets, and its efficacy has been quantitatively affirmed by the enhanced performance metrics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.