一种新的轻量级深度注意网络用于组织病理图像的自动核分割

IF 6.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-06-27 DOI:10.1016/j.neucom.2025.130797

Rashika Bagri, Ankit Rajpal, Naveen Kumar

{"title":"一种新的轻量级深度注意网络用于组织病理图像的自动核分割","authors":"Rashika Bagri, Ankit Rajpal, Naveen Kumar","doi":"10.1016/j.neucom.2025.130797","DOIUrl":null,"url":null,"abstract":"<div><div>Cell nuclei segmentation in histopathological images is essential for developing computer-aided diagnostic (CAD) systems for cancer diagnosis and prognosis. However, this task remains challenging due to poor staining, low contrast, irregular shapes, and overlapping nuclei in histology tissue images. Traditional encoder–decoder-based architectures often struggle to capture fine-grained spatial details while also being computationally expensive. To address these challenges, we propose a novel lightweight deep attention network comprising three residual blocks in the encoder, a bottleneck block, and three wavelet-driven attention blocks in the decoder. The residual blocks used in the encoder effectively extract high-level features, while the bottleneck block captures global multi-resolution features. A newly introduced wavelet-driven attention block in the decoder leverage high-frequency two-dimensional discrete wavelet transform coefficients, that captures the finer edge-level details that are often lost during the encoding process. Evaluated on two publicly available datasets, PanNuke and TNBC, the proposed architecture achieved five-fold cross-validation Jaccard Index scores of <span><math><mrow><mn>74</mn><mo>.</mo><mn>47</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>74</mn></mrow></math></span> and <span><math><mrow><mn>71</mn><mo>.</mo><mn>21</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>82</mn></mrow></math></span>, respectively, at a 95% confidence level. The proposed architecture has significantly fewer trainable parameters and a smaller model size than existing architectures without compromising its performance. To further validate its efficacy, the model was tested on the MonuSeg dataset as an independent cohort, achieving a Jaccard Index score of <span><math><mrow><mn>64</mn><mo>.</mo><mn>17</mn><mo>±</mo><mn>1</mn><mo>.</mo><mn>75</mn></mrow></math></span>. Wilcoxon signed-rank and Scott-Knott ESD tests confirmed that the proposed architecture is statistically superior to existing models. Finally, Grad-CAM heatmaps revealed its superior focus on nuclei regions compared to conventional designs.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"649 ","pages":"Article 130797"},"PeriodicalIF":6.5000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel lightweight deep attention network for automated nuclei segmentation in histopathology images\",\"authors\":\"Rashika Bagri, Ankit Rajpal, Naveen Kumar\",\"doi\":\"10.1016/j.neucom.2025.130797\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Cell nuclei segmentation in histopathological images is essential for developing computer-aided diagnostic (CAD) systems for cancer diagnosis and prognosis. However, this task remains challenging due to poor staining, low contrast, irregular shapes, and overlapping nuclei in histology tissue images. Traditional encoder–decoder-based architectures often struggle to capture fine-grained spatial details while also being computationally expensive. To address these challenges, we propose a novel lightweight deep attention network comprising three residual blocks in the encoder, a bottleneck block, and three wavelet-driven attention blocks in the decoder. The residual blocks used in the encoder effectively extract high-level features, while the bottleneck block captures global multi-resolution features. A newly introduced wavelet-driven attention block in the decoder leverage high-frequency two-dimensional discrete wavelet transform coefficients, that captures the finer edge-level details that are often lost during the encoding process. Evaluated on two publicly available datasets, PanNuke and TNBC, the proposed architecture achieved five-fold cross-validation Jaccard Index scores of <span><math><mrow><mn>74</mn><mo>.</mo><mn>47</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>74</mn></mrow></math></span> and <span><math><mrow><mn>71</mn><mo>.</mo><mn>21</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>82</mn></mrow></math></span>, respectively, at a 95% confidence level. The proposed architecture has significantly fewer trainable parameters and a smaller model size than existing architectures without compromising its performance. To further validate its efficacy, the model was tested on the MonuSeg dataset as an independent cohort, achieving a Jaccard Index score of <span><math><mrow><mn>64</mn><mo>.</mo><mn>17</mn><mo>±</mo><mn>1</mn><mo>.</mo><mn>75</mn></mrow></math></span>. Wilcoxon signed-rank and Scott-Knott ESD tests confirmed that the proposed architecture is statistically superior to existing models. Finally, Grad-CAM heatmaps revealed its superior focus on nuclei regions compared to conventional designs.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"649 \",\"pages\":\"Article 130797\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225014699\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225014699","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

组织病理学图像中的细胞核分割对于开发用于癌症诊断和预后的计算机辅助诊断（CAD）系统至关重要。然而，由于组织学组织图像中染色差、对比度低、形状不规则和细胞核重叠，这项任务仍然具有挑战性。传统的基于编码器-解码器的架构通常难以捕获细粒度的空间细节，同时计算成本也很高。为了解决这些挑战，我们提出了一种新的轻量级深度注意力网络，该网络由编码器中的三个残差块、解码器中的一个瓶颈块和三个小波驱动的注意力块组成。编码器中使用的残差块有效地提取高级特征，而瓶颈块捕获全局多分辨率特征。解码器中新引入的小波驱动的注意力块利用高频二维离散小波变换系数，捕获在编码过程中经常丢失的更精细的边缘级细节。在PanNuke和TNBC两个公开可用的数据集上进行评估，该架构在95%置信水平下分别获得了74.47±0.74和71.21±0.82的五倍交叉验证Jaccard指数得分。所提出的体系结构具有比现有体系结构更少的可训练参数和更小的模型尺寸，而不会影响其性能。为了进一步验证其有效性，将该模型作为独立队列在MonuSeg数据集上进行测试，其Jaccard Index得分为64.17±1.75。Wilcoxon sign -rank和Scott-Knott ESD测试证实，所提出的体系结构在统计上优于现有模型。最后，与传统设计相比，Grad-CAM热图显示了其对核区域的优越关注。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A novel lightweight deep attention network for automated nuclei segmentation in histopathology images

Cell nuclei segmentation in histopathological images is essential for developing computer-aided diagnostic (CAD) systems for cancer diagnosis and prognosis. However, this task remains challenging due to poor staining, low contrast, irregular shapes, and overlapping nuclei in histology tissue images. Traditional encoder–decoder-based architectures often struggle to capture fine-grained spatial details while also being computationally expensive. To address these challenges, we propose a novel lightweight deep attention network comprising three residual blocks in the encoder, a bottleneck block, and three wavelet-driven attention blocks in the decoder. The residual blocks used in the encoder effectively extract high-level features, while the bottleneck block captures global multi-resolution features. A newly introduced wavelet-driven attention block in the decoder leverage high-frequency two-dimensional discrete wavelet transform coefficients, that captures the finer edge-level details that are often lost during the encoding process. Evaluated on two publicly available datasets, PanNuke and TNBC, the proposed architecture achieved five-fold cross-validation Jaccard Index scores of

74.47 \pm 0.74

and

71.21 \pm 0.82

, respectively, at a 95% confidence level. The proposed architecture has significantly fewer trainable parameters and a smaller model size than existing architectures without compromising its performance. To further validate its efficacy, the model was tested on the MonuSeg dataset as an independent cohort, achieving a Jaccard Index score of

64.17 \pm 1.75

. Wilcoxon signed-rank and Scott-Knott ESD tests confirmed that the proposed architecture is statistically superior to existing models. Finally, Grad-CAM heatmaps revealed its superior focus on nuclei regions compared to conventional designs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.