Jiahui Yu , Xuna Wang , Weiming Fan , Yuping Guo , Junfen Fu , Yingke Xu
{"title":"Exploring Complexity-Calibrated morphological distribution for whole slide image classification and difficulty-grading","authors":"Jiahui Yu , Xuna Wang , Weiming Fan , Yuping Guo , Junfen Fu , Yingke Xu","doi":"10.1016/j.media.2025.103707","DOIUrl":null,"url":null,"abstract":"<div><div>Multiple Instance Learning (MIL) is essential for accurate pathological image classification under limited annotations. Global-local morphological modeling approaches have shown promise in whole slide image (WSI) analysis by aligning patches with spatial positions. However, these methods fail to differentiate samples by complexity during morphological distribution construction, treating all samples equally important for model training. This oversight disregards the impact of difficult-to-recognize samples, leading to a morphological fitting bottleneck that hinders the clinical application of deep learning across centers, subtypes, and imaging standards. To address this, we propose Complexity-Calibrated MIL (CoCaMIL) for WSI classification and difficulty grading. CoCaMIL emphasizes the synergistic effects between morphological distribution and key complexity factors, including blur, tumor size, coloring style, brightness, and stain. Specifically, we developed an image–text contrastive pretraining framework to jointly learn multiple complexity factors, enhancing morphological distribution fitting. Additionally, to reduce the tendency to focus on difficult samples overly, we introduce a complexity calibration method, which forms a distance-prioritized feature distribution by incorporating objective factors during training. CoCaMIL achieved top classification performance across three large benchmarks and established a reliable system for grading sample difficulty. To our knowledge, CoCaMIL is the first approach to construct WSI morphological representations based on the collaborative integration of complexity factors, offering a new perspective to broaden the clinical use of deep learning in digital pathology. The code is available at <span><span>https://github.com/sm8754/cocamil</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"105 ","pages":"Article 103707"},"PeriodicalIF":11.8000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525002543","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Multiple Instance Learning (MIL) is essential for accurate pathological image classification under limited annotations. Global-local morphological modeling approaches have shown promise in whole slide image (WSI) analysis by aligning patches with spatial positions. However, these methods fail to differentiate samples by complexity during morphological distribution construction, treating all samples equally important for model training. This oversight disregards the impact of difficult-to-recognize samples, leading to a morphological fitting bottleneck that hinders the clinical application of deep learning across centers, subtypes, and imaging standards. To address this, we propose Complexity-Calibrated MIL (CoCaMIL) for WSI classification and difficulty grading. CoCaMIL emphasizes the synergistic effects between morphological distribution and key complexity factors, including blur, tumor size, coloring style, brightness, and stain. Specifically, we developed an image–text contrastive pretraining framework to jointly learn multiple complexity factors, enhancing morphological distribution fitting. Additionally, to reduce the tendency to focus on difficult samples overly, we introduce a complexity calibration method, which forms a distance-prioritized feature distribution by incorporating objective factors during training. CoCaMIL achieved top classification performance across three large benchmarks and established a reliable system for grading sample difficulty. To our knowledge, CoCaMIL is the first approach to construct WSI morphological representations based on the collaborative integration of complexity factors, offering a new perspective to broaden the clinical use of deep learning in digital pathology. The code is available at https://github.com/sm8754/cocamil.
期刊介绍:
Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.