{"title":"An Image Semantic Communication System Based on GAN and Relative Position-Encoding","authors":"Ziqi Zhang;Weijie Tan;Chunguo Li","doi":"10.1109/LCOMM.2025.3559202","DOIUrl":null,"url":null,"abstract":"In image semantic communication, the complex wireless channel environment leads to the loss of image details and performance degradation during transmission. To address this issue, we propose an image semantic communication system based on Generative Adversarial Networks (GAN) and Relative Position-Encoding (RPE), named GAN-RpeSC. Specifically, in the system design, we introduce the GAN network at the output of the semantic communication framework to enhance the image restoration. We integrate the RPE module into the Visual Transformer (ViT) module, adding the RPE module to its Multihead Self Attention (MHSA), so that the model can better capture the spatial location information of the image. Experimental results show that the proposed GAN-RpeSC has significant advantages in improving image transmission quality, with a 0.3 dB improvement in peak signal-to-noise ratio (PSNR) compared to the ViT module alone.","PeriodicalId":13197,"journal":{"name":"IEEE Communications Letters","volume":"29 6","pages":"1280-1284"},"PeriodicalIF":4.4000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10960394/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
In image semantic communication, the complex wireless channel environment leads to the loss of image details and performance degradation during transmission. To address this issue, we propose an image semantic communication system based on Generative Adversarial Networks (GAN) and Relative Position-Encoding (RPE), named GAN-RpeSC. Specifically, in the system design, we introduce the GAN network at the output of the semantic communication framework to enhance the image restoration. We integrate the RPE module into the Visual Transformer (ViT) module, adding the RPE module to its Multihead Self Attention (MHSA), so that the model can better capture the spatial location information of the image. Experimental results show that the proposed GAN-RpeSC has significant advantages in improving image transmission quality, with a 0.3 dB improvement in peak signal-to-noise ratio (PSNR) compared to the ViT module alone.
期刊介绍:
The IEEE Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of communication over different media and channels including wire, underground, waveguide, optical fiber, and storage channels. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of communication systems.