{"title":"一种新的轻量级深度注意网络用于组织病理图像的自动核分割","authors":"Rashika Bagri, Ankit Rajpal, Naveen Kumar","doi":"10.1016/j.neucom.2025.130797","DOIUrl":null,"url":null,"abstract":"<div><div>Cell nuclei segmentation in histopathological images is essential for developing computer-aided diagnostic (CAD) systems for cancer diagnosis and prognosis. However, this task remains challenging due to poor staining, low contrast, irregular shapes, and overlapping nuclei in histology tissue images. Traditional encoder–decoder-based architectures often struggle to capture fine-grained spatial details while also being computationally expensive. To address these challenges, we propose a novel lightweight deep attention network comprising three residual blocks in the encoder, a bottleneck block, and three wavelet-driven attention blocks in the decoder. The residual blocks used in the encoder effectively extract high-level features, while the bottleneck block captures global multi-resolution features. A newly introduced wavelet-driven attention block in the decoder leverage high-frequency two-dimensional discrete wavelet transform coefficients, that captures the finer edge-level details that are often lost during the encoding process. Evaluated on two publicly available datasets, PanNuke and TNBC, the proposed architecture achieved five-fold cross-validation Jaccard Index scores of <span><math><mrow><mn>74</mn><mo>.</mo><mn>47</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>74</mn></mrow></math></span> and <span><math><mrow><mn>71</mn><mo>.</mo><mn>21</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>82</mn></mrow></math></span>, respectively, at a 95% confidence level. The proposed architecture has significantly fewer trainable parameters and a smaller model size than existing architectures without compromising its performance. To further validate its efficacy, the model was tested on the MonuSeg dataset as an independent cohort, achieving a Jaccard Index score of <span><math><mrow><mn>64</mn><mo>.</mo><mn>17</mn><mo>±</mo><mn>1</mn><mo>.</mo><mn>75</mn></mrow></math></span>. Wilcoxon signed-rank and Scott-Knott ESD tests confirmed that the proposed architecture is statistically superior to existing models. Finally, Grad-CAM heatmaps revealed its superior focus on nuclei regions compared to conventional designs.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"649 ","pages":"Article 130797"},"PeriodicalIF":6.5000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel lightweight deep attention network for automated nuclei segmentation in histopathology images\",\"authors\":\"Rashika Bagri, Ankit Rajpal, Naveen Kumar\",\"doi\":\"10.1016/j.neucom.2025.130797\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Cell nuclei segmentation in histopathological images is essential for developing computer-aided diagnostic (CAD) systems for cancer diagnosis and prognosis. However, this task remains challenging due to poor staining, low contrast, irregular shapes, and overlapping nuclei in histology tissue images. Traditional encoder–decoder-based architectures often struggle to capture fine-grained spatial details while also being computationally expensive. To address these challenges, we propose a novel lightweight deep attention network comprising three residual blocks in the encoder, a bottleneck block, and three wavelet-driven attention blocks in the decoder. The residual blocks used in the encoder effectively extract high-level features, while the bottleneck block captures global multi-resolution features. A newly introduced wavelet-driven attention block in the decoder leverage high-frequency two-dimensional discrete wavelet transform coefficients, that captures the finer edge-level details that are often lost during the encoding process. Evaluated on two publicly available datasets, PanNuke and TNBC, the proposed architecture achieved five-fold cross-validation Jaccard Index scores of <span><math><mrow><mn>74</mn><mo>.</mo><mn>47</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>74</mn></mrow></math></span> and <span><math><mrow><mn>71</mn><mo>.</mo><mn>21</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>82</mn></mrow></math></span>, respectively, at a 95% confidence level. The proposed architecture has significantly fewer trainable parameters and a smaller model size than existing architectures without compromising its performance. To further validate its efficacy, the model was tested on the MonuSeg dataset as an independent cohort, achieving a Jaccard Index score of <span><math><mrow><mn>64</mn><mo>.</mo><mn>17</mn><mo>±</mo><mn>1</mn><mo>.</mo><mn>75</mn></mrow></math></span>. Wilcoxon signed-rank and Scott-Knott ESD tests confirmed that the proposed architecture is statistically superior to existing models. Finally, Grad-CAM heatmaps revealed its superior focus on nuclei regions compared to conventional designs.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"649 \",\"pages\":\"Article 130797\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225014699\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225014699","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A novel lightweight deep attention network for automated nuclei segmentation in histopathology images
Cell nuclei segmentation in histopathological images is essential for developing computer-aided diagnostic (CAD) systems for cancer diagnosis and prognosis. However, this task remains challenging due to poor staining, low contrast, irregular shapes, and overlapping nuclei in histology tissue images. Traditional encoder–decoder-based architectures often struggle to capture fine-grained spatial details while also being computationally expensive. To address these challenges, we propose a novel lightweight deep attention network comprising three residual blocks in the encoder, a bottleneck block, and three wavelet-driven attention blocks in the decoder. The residual blocks used in the encoder effectively extract high-level features, while the bottleneck block captures global multi-resolution features. A newly introduced wavelet-driven attention block in the decoder leverage high-frequency two-dimensional discrete wavelet transform coefficients, that captures the finer edge-level details that are often lost during the encoding process. Evaluated on two publicly available datasets, PanNuke and TNBC, the proposed architecture achieved five-fold cross-validation Jaccard Index scores of and , respectively, at a 95% confidence level. The proposed architecture has significantly fewer trainable parameters and a smaller model size than existing architectures without compromising its performance. To further validate its efficacy, the model was tested on the MonuSeg dataset as an independent cohort, achieving a Jaccard Index score of . Wilcoxon signed-rank and Scott-Knott ESD tests confirmed that the proposed architecture is statistically superior to existing models. Finally, Grad-CAM heatmaps revealed its superior focus on nuclei regions compared to conventional designs.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.