{"title":"MAPFUNet: Multi-attention Perception-Fusion U-Net for Liver Tumor Segmentation","authors":"Junding Sun, Biao Wang, Xiaosheng Wu, Chaosheng Tang, Shuihua Wang, Yudong Zhang","doi":"10.1007/s42235-024-00562-y","DOIUrl":null,"url":null,"abstract":"<div><p>The second-leading cause of cancer-related deaths globally is liver cancer. The treatment of liver cancers depends heavily on the accurate segmentation of liver tumors from CT scans. The improved method based on U-Net has achieved good performance for liver tumor segmentation, but these methods can still be improved. To deal with the problems of poor performance from the original U-Net framework in the segmentation of small-sized liver tumors and the position information of tumors that is seriously lost in the down-sampling process, we propose the Multi-attention Perception-fusion U-Net (MAPFUNet). We propose the Position ResBlock (PResBlock) in the encoder stage to promote the feature extraction capability of MAPFUNet while retaining the position information regarding liver tumors. A Dual-branch Attention Module (DWAM) is proposed in the skip connections, which narrows the semantic gap between the encoder's and decoder's features and enables the network to utilize the encoder's multi-stage and multi-scale features. We propose the Channel-wise ASPP with Attention (CAA) module at the bottleneck, which can be combined with multi-scale features and contributes to the recovery of micro-tumor feature information. Finally, we evaluated MAPFUNet on the LITS2017 dataset and the 3DIRCADB-01 dataset, with Dice values of 85.81 and 83.84% for liver tumor segmentation, which were 2.89 and 7.89% higher than the baseline model, respectively. The experiment results show that MAPFUNet is superior to other networks with better tumor feature representation and higher accuracy of liver tumor segmentation. We also extended MAPFUNet to brain tumor segmentation on the BraTS2019 dataset. The results indicate that MAPFUNet performs well on the brain tumor segmentation task, and its Dice values on the three tumor regions are 83.27% (WT), 84.77% (TC), and 76.98% (ET), respectively.</p></div>","PeriodicalId":614,"journal":{"name":"Journal of Bionic Engineering","volume":"21 5","pages":"2515 - 2539"},"PeriodicalIF":4.9000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Bionic Engineering","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s42235-024-00562-y","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The second-leading cause of cancer-related deaths globally is liver cancer. The treatment of liver cancers depends heavily on the accurate segmentation of liver tumors from CT scans. The improved method based on U-Net has achieved good performance for liver tumor segmentation, but these methods can still be improved. To deal with the problems of poor performance from the original U-Net framework in the segmentation of small-sized liver tumors and the position information of tumors that is seriously lost in the down-sampling process, we propose the Multi-attention Perception-fusion U-Net (MAPFUNet). We propose the Position ResBlock (PResBlock) in the encoder stage to promote the feature extraction capability of MAPFUNet while retaining the position information regarding liver tumors. A Dual-branch Attention Module (DWAM) is proposed in the skip connections, which narrows the semantic gap between the encoder's and decoder's features and enables the network to utilize the encoder's multi-stage and multi-scale features. We propose the Channel-wise ASPP with Attention (CAA) module at the bottleneck, which can be combined with multi-scale features and contributes to the recovery of micro-tumor feature information. Finally, we evaluated MAPFUNet on the LITS2017 dataset and the 3DIRCADB-01 dataset, with Dice values of 85.81 and 83.84% for liver tumor segmentation, which were 2.89 and 7.89% higher than the baseline model, respectively. The experiment results show that MAPFUNet is superior to other networks with better tumor feature representation and higher accuracy of liver tumor segmentation. We also extended MAPFUNet to brain tumor segmentation on the BraTS2019 dataset. The results indicate that MAPFUNet performs well on the brain tumor segmentation task, and its Dice values on the three tumor regions are 83.27% (WT), 84.77% (TC), and 76.98% (ET), respectively.
期刊介绍:
The Journal of Bionic Engineering (JBE) is a peer-reviewed journal that publishes original research papers and reviews that apply the knowledge learned from nature and biological systems to solve concrete engineering problems. The topics that JBE covers include but are not limited to:
Mechanisms, kinematical mechanics and control of animal locomotion, development of mobile robots with walking (running and crawling), swimming or flying abilities inspired by animal locomotion.
Structures, morphologies, composition and physical properties of natural and biomaterials; fabrication of new materials mimicking the properties and functions of natural and biomaterials.
Biomedical materials, artificial organs and tissue engineering for medical applications; rehabilitation equipment and devices.
Development of bioinspired computation methods and artificial intelligence for engineering applications.