Muhammad Zubair Khan, Yugyung Lee, M. Khan, Arslan Munir
{"title":"Towards Long - Range Pixels Connection for Context-Aware Semantic Segmentation","authors":"Muhammad Zubair Khan, Yugyung Lee, M. Khan, Arslan Munir","doi":"10.1109/BHI56158.2022.9926855","DOIUrl":null,"url":null,"abstract":"Semantic segmentation is one of the challenging tasks in computer vision. Before the advent of deep learning, hand-crafted features were used to semantically extract the region-of-interest (ROI). Deep learning has recently achieved enormous response in semantic image segmentation. The previously developed U-Net inspired architectures operate with continuous stride and pooling operations, leading to spatial data loss. Also, the methods lack establishing long-term pixels connection to preserve context knowledge and reduce spatial loss in prediction. This article developed encoder-decoder architecture with a sequential block embedded in long skip-connections and densely connected convolution blocks. The network non-linearly combines the feature maps across encoder-decoder paths for finding dependency and correlation between image pixels. Additionally, the densely connected convolutional blocks are kept in the final encoding layer to reuse features and prevent redundant data sharing. The method applied batch-normalization to reduce internal covariate shift in data distributions. We have used LUNA, ISIC2018, and DRIVE datasets to reflect three different segmentation problems (lung nodules, skin lesions, vessels) and claim the effectiveness of the proposed architecture. The network is also compared with other techniques designed to highlight similar problems. It is found through empirical evidence that our method shows promising results when compared with other segmentation techniques.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"2003 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI56158.2022.9926855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Semantic segmentation is one of the challenging tasks in computer vision. Before the advent of deep learning, hand-crafted features were used to semantically extract the region-of-interest (ROI). Deep learning has recently achieved enormous response in semantic image segmentation. The previously developed U-Net inspired architectures operate with continuous stride and pooling operations, leading to spatial data loss. Also, the methods lack establishing long-term pixels connection to preserve context knowledge and reduce spatial loss in prediction. This article developed encoder-decoder architecture with a sequential block embedded in long skip-connections and densely connected convolution blocks. The network non-linearly combines the feature maps across encoder-decoder paths for finding dependency and correlation between image pixels. Additionally, the densely connected convolutional blocks are kept in the final encoding layer to reuse features and prevent redundant data sharing. The method applied batch-normalization to reduce internal covariate shift in data distributions. We have used LUNA, ISIC2018, and DRIVE datasets to reflect three different segmentation problems (lung nodules, skin lesions, vessels) and claim the effectiveness of the proposed architecture. The network is also compared with other techniques designed to highlight similar problems. It is found through empirical evidence that our method shows promising results when compared with other segmentation techniques.