{"title":"DivGI:深入研究消化内镜图像分类。","authors":"Qi He, Sophia Bano, Danail Stoyanov, Siyang Zuo","doi":"10.1007/s11548-025-03441-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Gastrointestinal (GI) endoscopic imaging involves capturing routine anatomical landmarks and suspected lesions during endoscopic procedures for the clinical diagnosis of GI diseases. These images present three key challenges compared to typical scene images: significant class imbalance, a lack of distinctive features, and high similarity between some categories. While existing research has addressed the issue of image quantity imbalance, the challenges posed by indistinct features and inter-category similarity remain unresolved. This study proposes a unified image classification framework designed to tackle all three of these challenges comprehensively.</p><p><strong>Methods: </strong>We present a novel network architecture, DivGI, which integrates three essential strategies-balanced sampling, fine-grained classification, and multi-label classification-within a single framework. The balanced sampling strategy is implemented via resampling and mix-up techniques, fine-grained classification is enabled through multi-granularity feature learning, and multi-label classification is achieved using hierarchical label joint learning. The performance of our method is validated using three publicly available datasets.</p><p><strong>Results: </strong>Extensive experimental results demonstrate that DivGI significantly improves classification accuracy compared to existing approaches, with Matthews correlation coefficients (MCC) of 91.31% on the HyperKvasir dataset, 86.72% on the Upper GI dataset, and 82.88% on the GastroVision dataset. These results highlight that DivGI is more effective and efficient compared to existing methods.</p><p><strong>Conclusion: </strong>The proposed GI classification network, which incorporates multiple strategies, effectively classifies both routine landmark and suspected lesion images, aiming to facilitate better clinical diagnostics in gastrointestinal endoscopy. The code and data are publicly available at https://github.com/howardchina/DivGI.</p>","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DivGI: delve into digestive endoscopy image classification.\",\"authors\":\"Qi He, Sophia Bano, Danail Stoyanov, Siyang Zuo\",\"doi\":\"10.1007/s11548-025-03441-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Gastrointestinal (GI) endoscopic imaging involves capturing routine anatomical landmarks and suspected lesions during endoscopic procedures for the clinical diagnosis of GI diseases. These images present three key challenges compared to typical scene images: significant class imbalance, a lack of distinctive features, and high similarity between some categories. While existing research has addressed the issue of image quantity imbalance, the challenges posed by indistinct features and inter-category similarity remain unresolved. This study proposes a unified image classification framework designed to tackle all three of these challenges comprehensively.</p><p><strong>Methods: </strong>We present a novel network architecture, DivGI, which integrates three essential strategies-balanced sampling, fine-grained classification, and multi-label classification-within a single framework. The balanced sampling strategy is implemented via resampling and mix-up techniques, fine-grained classification is enabled through multi-granularity feature learning, and multi-label classification is achieved using hierarchical label joint learning. The performance of our method is validated using three publicly available datasets.</p><p><strong>Results: </strong>Extensive experimental results demonstrate that DivGI significantly improves classification accuracy compared to existing approaches, with Matthews correlation coefficients (MCC) of 91.31% on the HyperKvasir dataset, 86.72% on the Upper GI dataset, and 82.88% on the GastroVision dataset. These results highlight that DivGI is more effective and efficient compared to existing methods.</p><p><strong>Conclusion: </strong>The proposed GI classification network, which incorporates multiple strategies, effectively classifies both routine landmark and suspected lesion images, aiming to facilitate better clinical diagnostics in gastrointestinal endoscopy. The code and data are publicly available at https://github.com/howardchina/DivGI.</p>\",\"PeriodicalId\":51251,\"journal\":{\"name\":\"International Journal of Computer Assisted Radiology and Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Computer Assisted Radiology and Surgery\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11548-025-03441-x\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Assisted Radiology and Surgery","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11548-025-03441-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
DivGI: delve into digestive endoscopy image classification.
Purpose: Gastrointestinal (GI) endoscopic imaging involves capturing routine anatomical landmarks and suspected lesions during endoscopic procedures for the clinical diagnosis of GI diseases. These images present three key challenges compared to typical scene images: significant class imbalance, a lack of distinctive features, and high similarity between some categories. While existing research has addressed the issue of image quantity imbalance, the challenges posed by indistinct features and inter-category similarity remain unresolved. This study proposes a unified image classification framework designed to tackle all three of these challenges comprehensively.
Methods: We present a novel network architecture, DivGI, which integrates three essential strategies-balanced sampling, fine-grained classification, and multi-label classification-within a single framework. The balanced sampling strategy is implemented via resampling and mix-up techniques, fine-grained classification is enabled through multi-granularity feature learning, and multi-label classification is achieved using hierarchical label joint learning. The performance of our method is validated using three publicly available datasets.
Results: Extensive experimental results demonstrate that DivGI significantly improves classification accuracy compared to existing approaches, with Matthews correlation coefficients (MCC) of 91.31% on the HyperKvasir dataset, 86.72% on the Upper GI dataset, and 82.88% on the GastroVision dataset. These results highlight that DivGI is more effective and efficient compared to existing methods.
Conclusion: The proposed GI classification network, which incorporates multiple strategies, effectively classifies both routine landmark and suspected lesion images, aiming to facilitate better clinical diagnostics in gastrointestinal endoscopy. The code and data are publicly available at https://github.com/howardchina/DivGI.
期刊介绍:
The International Journal for Computer Assisted Radiology and Surgery (IJCARS) is a peer-reviewed journal that provides a platform for closing the gap between medical and technical disciplines, and encourages interdisciplinary research and development activities in an international environment.