V. Nguyen, Do-Hai-Ninh Nham, Van-Truong Pham, Thi-Thao Tran
{"title":"U-shape-based architecture with Adjusted Bilateral Guided Aggregation Layer for nuclei image segmentation","authors":"V. Nguyen, Do-Hai-Ninh Nham, Van-Truong Pham, Thi-Thao Tran","doi":"10.1109/ICCE55644.2022.9852083","DOIUrl":null,"url":null,"abstract":"Medical image segmentation with AI models has been showing its incredible improvements recently and one of the most significant building blocks is U-shape architecture. With that being said, a big issue of encoder-decoder based models is the lack of semantic information during the decoding process despite the backing up of the skip connection layer. In this paper, we address this challenge with our proposed model. Inheriting the idea of spatial path and semantic path from BiSeNetV2, we replaced the skip connection between encoder and decoder blocks with Adjusted Bilateral Guided Aggregation (ABGA) layer. In addition, we also leveraged the efficiency and elegance from EffecientNet as the model’s encoder. Besides that, a new loss function based on Kulczycki 2 coefficient is introduced.","PeriodicalId":388547,"journal":{"name":"2022 IEEE Ninth International Conference on Communications and Electronics (ICCE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Ninth International Conference on Communications and Electronics (ICCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE55644.2022.9852083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Medical image segmentation with AI models has been showing its incredible improvements recently and one of the most significant building blocks is U-shape architecture. With that being said, a big issue of encoder-decoder based models is the lack of semantic information during the decoding process despite the backing up of the skip connection layer. In this paper, we address this challenge with our proposed model. Inheriting the idea of spatial path and semantic path from BiSeNetV2, we replaced the skip connection between encoder and decoder blocks with Adjusted Bilateral Guided Aggregation (ABGA) layer. In addition, we also leveraged the efficiency and elegance from EffecientNet as the model’s encoder. Besides that, a new loss function based on Kulczycki 2 coefficient is introduced.