Yuren Hu , Zengqiang Yan , Zhuo Kuang , Xianbo Deng , Li Yu
{"title":"Transformer-based weakly supervised intracerebral hemorrhage segmentation using image-level labels","authors":"Yuren Hu , Zengqiang Yan , Zhuo Kuang , Xianbo Deng , Li Yu","doi":"10.1016/j.hest.2025.04.003","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>The segmentation of intracerebral hemorrhage (ICH) lesions in brain CT scans is of paramount importance for the diagnosis and treatment of stroke. Given the tremendous challenge of pixel-wise annotation in intracerebral hemorrhage, weakly supervised segmentation for ICH based on image-level labels has drawn great attention. Typical methods constructed based on convolutional neural networks often suffer from insufficient global perception, making it difficult to address ICH lesion diversity. Therefore, vision transformer, building pair-wise global dependency, becomes a popular alternative. Unfortunately, the data-hungry nature of vision transformer hinders its full exploitation given relatively limited medical imaging data, resulting in over-smoothing issue.</div></div><div><h3>Methods</h3><div>In this paper, based on the observation that most patches/tokens tend to build pair-wise dependency with intracerebral hemorrhage lesion, we propose weighted attention fusion (WAF) to fully utilize over-smoothing attention maps produced by ViT under conditions of limited training data. Compared to existing research, no additional parameters or computational complexity is introduced by WAF when incorporating target-relevant information. In addition, to recall low-confidence/-salient regions in segmentation, a patch-erasing re-activation mechanism is proposed by forcing the model to explore more class-specific regions.</div></div><div><h3>Results</h3><div>Experimental results on three datasets, <em>i.e.</em>, INSTANCE2022, LocalBrainCT and BraTS2021 demonstrates the effectiveness of the proposed ICH weakly supervised segmentation framework. Compared to the previous works on the weakly supervised sementation, the proposed architecture obtains the state-of-the-art performance on intracerebral hemorrhage segmentation (Dice of 72.39).</div></div><div><h3>Conclusion</h3><div>This study focus on weakly supervised intracerebral hemorrhage segmentation, and propose a transformer-based framework with weighted attention fusion module and patch-erasing re-activation mechanism. It achives superior performance than previous methods under various settings.</div></div>","PeriodicalId":33969,"journal":{"name":"Brain Hemorrhages","volume":"6 5","pages":"Pages 195-205"},"PeriodicalIF":1.3000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain Hemorrhages","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589238X25000361","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
The segmentation of intracerebral hemorrhage (ICH) lesions in brain CT scans is of paramount importance for the diagnosis and treatment of stroke. Given the tremendous challenge of pixel-wise annotation in intracerebral hemorrhage, weakly supervised segmentation for ICH based on image-level labels has drawn great attention. Typical methods constructed based on convolutional neural networks often suffer from insufficient global perception, making it difficult to address ICH lesion diversity. Therefore, vision transformer, building pair-wise global dependency, becomes a popular alternative. Unfortunately, the data-hungry nature of vision transformer hinders its full exploitation given relatively limited medical imaging data, resulting in over-smoothing issue.
Methods
In this paper, based on the observation that most patches/tokens tend to build pair-wise dependency with intracerebral hemorrhage lesion, we propose weighted attention fusion (WAF) to fully utilize over-smoothing attention maps produced by ViT under conditions of limited training data. Compared to existing research, no additional parameters or computational complexity is introduced by WAF when incorporating target-relevant information. In addition, to recall low-confidence/-salient regions in segmentation, a patch-erasing re-activation mechanism is proposed by forcing the model to explore more class-specific regions.
Results
Experimental results on three datasets, i.e., INSTANCE2022, LocalBrainCT and BraTS2021 demonstrates the effectiveness of the proposed ICH weakly supervised segmentation framework. Compared to the previous works on the weakly supervised sementation, the proposed architecture obtains the state-of-the-art performance on intracerebral hemorrhage segmentation (Dice of 72.39).
Conclusion
This study focus on weakly supervised intracerebral hemorrhage segmentation, and propose a transformer-based framework with weighted attention fusion module and patch-erasing re-activation mechanism. It achives superior performance than previous methods under various settings.