{"title":"An innovative transfer learning-polyp detection from wireless capsule endoscopy videos with optimal key frame selection and depth estimation","authors":"Madhura Prakash M, Krishnamurthy G.N","doi":"10.1016/j.bspc.2025.107963","DOIUrl":null,"url":null,"abstract":"<div><div>In modern technology, the Wireless Capsule Endoscopy (WCE) is used to analyze and diagnose the small intestine part of the human body in a non-invasive manner. It consists of similar and non-modified information regarding texture and color, the absence of shot boundary makes the conventional keyframe mining and shot detection technique ineffective for this purpose. To resolve this issue, a deep learning-oriented keyframe mining method for extracting the keyframes from the captured video using the WCE procedure is suggested in this research work. With the help of benchmark databases, the endoscopic videos are collected. The gathered video is composed of numerous images frame in order to estimate the depth of the video. Here, the transfer learning techniques are adopted here for estimating the depths. The MobileNetV2 model’s encoder-decoder unit is attached to the UNet model of TransUNet + system to improve the rate of estimation of the depths. The deployed transfer learning model is named as TransUnet + with MobileNetv2 (TU-MNetv2). A new heuristic algorithm called the Improved Fitness-based American Zebra Optimization Algorithm (IF-AZOA) is implemented to select the ideal keyframes in terms of constraints like entropy information, moment of the images in depth estimated frame, key points and edge density. The estimated depth results are compared with the results obtained from several conventional classifiers and heuristic algorithms in order to prove the better performance obtained by the implemented depth estimation technique.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"108 ","pages":"Article 107963"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425004744","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
In modern technology, the Wireless Capsule Endoscopy (WCE) is used to analyze and diagnose the small intestine part of the human body in a non-invasive manner. It consists of similar and non-modified information regarding texture and color, the absence of shot boundary makes the conventional keyframe mining and shot detection technique ineffective for this purpose. To resolve this issue, a deep learning-oriented keyframe mining method for extracting the keyframes from the captured video using the WCE procedure is suggested in this research work. With the help of benchmark databases, the endoscopic videos are collected. The gathered video is composed of numerous images frame in order to estimate the depth of the video. Here, the transfer learning techniques are adopted here for estimating the depths. The MobileNetV2 model’s encoder-decoder unit is attached to the UNet model of TransUNet + system to improve the rate of estimation of the depths. The deployed transfer learning model is named as TransUnet + with MobileNetv2 (TU-MNetv2). A new heuristic algorithm called the Improved Fitness-based American Zebra Optimization Algorithm (IF-AZOA) is implemented to select the ideal keyframes in terms of constraints like entropy information, moment of the images in depth estimated frame, key points and edge density. The estimated depth results are compared with the results obtained from several conventional classifiers and heuristic algorithms in order to prove the better performance obtained by the implemented depth estimation technique.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.