{"title":"Research on the Diagnosis Method of Pancreatic Lesions by Endoscopic Ultrasound Based on Twin Network Structure","authors":"Xiao Xin, Huang Danping, Hu Shanshan, Shen Yang","doi":"10.1049/ipr2.70063","DOIUrl":null,"url":null,"abstract":"<p>To tackle the endoscopic ultrasound (EUS) pancreatic visual information, we propose a novel twin diagnostic network architecture (TDN) which consists of two identical feature extraction network structures. Model 1 is used to distinguish the categories of pancreatic visual information. Model 2 distinguishes whether there is cancerous information in pancreatic visual information. If cancerous information is included, gradient-weighted class activation mapping (Grad-CAM) is employed to calculate the activation heat map of visual information to present the specific location of the cancerous area in the visual information. To effectively integrate detailed texture information with abstract semantic information, we find the optimal proportion relationship required for feature fusion in each stage output feature vector dimension. The experimental results show that the classification accuracy of the TDN network can reach 98.344% for the pancreatic part and 99.471% for the specific part of the pancreas whether canceration occurs.</p>","PeriodicalId":56303,"journal":{"name":"IET Image Processing","volume":"19 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ipr2.70063","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Image Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ipr2.70063","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
To tackle the endoscopic ultrasound (EUS) pancreatic visual information, we propose a novel twin diagnostic network architecture (TDN) which consists of two identical feature extraction network structures. Model 1 is used to distinguish the categories of pancreatic visual information. Model 2 distinguishes whether there is cancerous information in pancreatic visual information. If cancerous information is included, gradient-weighted class activation mapping (Grad-CAM) is employed to calculate the activation heat map of visual information to present the specific location of the cancerous area in the visual information. To effectively integrate detailed texture information with abstract semantic information, we find the optimal proportion relationship required for feature fusion in each stage output feature vector dimension. The experimental results show that the classification accuracy of the TDN network can reach 98.344% for the pancreatic part and 99.471% for the specific part of the pancreas whether canceration occurs.
为了处理内镜超声胰腺视觉信息,我们提出了一种新的双诊断网络结构(TDN),该结构由两个相同的特征提取网络结构组成。模型1用于区分胰腺视觉信息的类别。模型2区分胰腺视觉信息中是否存在癌性信息。如果包含癌性信息,则采用梯度加权类激活映射(gradient-weighted class activation mapping, Grad-CAM)计算视觉信息的激活热图,以呈现癌性区域在视觉信息中的具体位置。为了将纹理细节信息与抽象语义信息有效融合,我们在每一阶段输出的特征向量维度中找到特征融合所需的最优比例关系。实验结果表明,TDN网络对胰腺部位的分类准确率可达98.344%,对胰腺特定部位是否发生癌变的分类准确率可达99.471%。
期刊介绍:
The IET Image Processing journal encompasses research areas related to the generation, processing and communication of visual information. The focus of the journal is the coverage of the latest research results in image and video processing, including image generation and display, enhancement and restoration, segmentation, colour and texture analysis, coding and communication, implementations and architectures as well as innovative applications.
Principal topics include:
Generation and Display - Imaging sensors and acquisition systems, illumination, sampling and scanning, quantization, colour reproduction, image rendering, display and printing systems, evaluation of image quality.
Processing and Analysis - Image enhancement, restoration, segmentation, registration, multispectral, colour and texture processing, multiresolution processing and wavelets, morphological operations, stereoscopic and 3-D processing, motion detection and estimation, video and image sequence processing.
Implementations and Architectures - Image and video processing hardware and software, design and construction, architectures and software, neural, adaptive, and fuzzy processing.
Coding and Transmission - Image and video compression and coding, compression standards, noise modelling, visual information networks, streamed video.
Retrieval and Multimedia - Storage of images and video, database design, image retrieval, video annotation and editing, mixed media incorporating visual information, multimedia systems and applications, image and video watermarking, steganography.
Applications - Innovative application of image and video processing technologies to any field, including life sciences, earth sciences, astronomy, document processing and security.
Current Special Issue Call for Papers:
Evolutionary Computation for Image Processing - https://digital-library.theiet.org/files/IET_IPR_CFP_EC.pdf
AI-Powered 3D Vision - https://digital-library.theiet.org/files/IET_IPR_CFP_AIPV.pdf
Multidisciplinary advancement of Imaging Technologies: From Medical Diagnostics and Genomics to Cognitive Machine Vision, and Artificial Intelligence - https://digital-library.theiet.org/files/IET_IPR_CFP_IST.pdf
Deep Learning for 3D Reconstruction - https://digital-library.theiet.org/files/IET_IPR_CFP_DLR.pdf