{"title":"Improving the Performance of Human Part Segmentation Based on Swin Transformer","authors":"Juan Du, Tao Yang","doi":"10.3103/S1060992X23020030","DOIUrl":null,"url":null,"abstract":"<p>One of the current challenges in deep learning is semantic segmentation. Moreover, human part segmentation is a sub-task in image segmentation, which differs from traditional segmentation to understand the human body’s intrinsic connections. Convolutional Neural Network (CNN) has always been a standard feature extraction network in human part segmentation. Recently, the proposed Swin Transformer surpasses CNN for many image applications. However, few articles have explored the performance of Swin Transformer in human part segmentation compared to CNN. In this paper, we make a comparison experiment on this issue, and the experimental results prove that even in the area of human part segmentation and without any additional trick, the Swin Transformer has good results compared with CNN. At the same time, this paper also combines the Edge Perceiving Module (EPM) currently commonly used in CNN with Swin Transformer to prove that Swin Transformer can see the intrinsic connection of segmented parts. This research demonstrates the feasibility of applying Swin Transformer to the part segmentation of images, which is conducive to advancing image segmentation technology in the future.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 2","pages":"101 - 107"},"PeriodicalIF":1.0000,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optical Memory and Neural Networks","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S1060992X23020030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
One of the current challenges in deep learning is semantic segmentation. Moreover, human part segmentation is a sub-task in image segmentation, which differs from traditional segmentation to understand the human body’s intrinsic connections. Convolutional Neural Network (CNN) has always been a standard feature extraction network in human part segmentation. Recently, the proposed Swin Transformer surpasses CNN for many image applications. However, few articles have explored the performance of Swin Transformer in human part segmentation compared to CNN. In this paper, we make a comparison experiment on this issue, and the experimental results prove that even in the area of human part segmentation and without any additional trick, the Swin Transformer has good results compared with CNN. At the same time, this paper also combines the Edge Perceiving Module (EPM) currently commonly used in CNN with Swin Transformer to prove that Swin Transformer can see the intrinsic connection of segmented parts. This research demonstrates the feasibility of applying Swin Transformer to the part segmentation of images, which is conducive to advancing image segmentation technology in the future.
期刊介绍:
The journal covers a wide range of issues in information optics such as optical memory, mechanisms for optical data recording and processing, photosensitive materials, optical, optoelectronic and holographic nanostructures, and many other related topics. Papers on memory systems using holographic and biological structures and concepts of brain operation are also included. The journal pays particular attention to research in the field of neural net systems that may lead to a new generation of computional technologies by endowing them with intelligence.