{"title":"采用非对称架构的 GAI 增强型稳健语义通信","authors":"Pengfei Ren;Jingjing Wang;Xiangwang Hou;Jianrui Chen;Chunxiao Jiang","doi":"10.1109/TCCN.2024.3520132","DOIUrl":null,"url":null,"abstract":"Semantic communication (SC), regarded as a next-generation communication architecture that breaks through the Shannon paradigm, is considered a key technology for realizing future sixth-generation wireless networks and cognitive communications. Instead of focusing on the bit error rate, SC is dedicated to extracting abstract semantic information from original data to enhance communication efficiency for specific tasks. However, current SC systems mostly rely on symmetric architectures based on convolutional neural networks, which not only severely limits the capacity of the network but also leads to a high degree of coupling between the encoder and decoder. Additionally, it also lacks robustness in noise reference. The emergence of generative artificial intelligence (GAI) breaks this bottleneck. In this paper, we propose an asymmetric end-to-end SC architecture based on GAI, named masked joint source-channel coding (M-JSCC). In our model, the encoder serves as a universal semantic extractor, while the decoder is tailored to specific tasks. During the model training, we introduce a masking mechanism that improves the performance of M-JSCC to extract semantic information and enhances the robustness under various channel conditions. Moreover, it also endows M-JSCC with remarkable data generation abilities. Benefiting from the asymmetric architecture, the decoder no longer depends on the encoder, which allows it to be switched according to the specific requirements to better adapt to different task-oriented scenarios. Finally, comprehensive experiments demonstrate the excellent semantic understanding and communication robustness of M-JSCC.","PeriodicalId":13069,"journal":{"name":"IEEE Transactions on Cognitive Communications and Networking","volume":"11 2","pages":"700-711"},"PeriodicalIF":7.0000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GAI-Enhanced Robust Semantic Communication With Asymmetric Architecture\",\"authors\":\"Pengfei Ren;Jingjing Wang;Xiangwang Hou;Jianrui Chen;Chunxiao Jiang\",\"doi\":\"10.1109/TCCN.2024.3520132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic communication (SC), regarded as a next-generation communication architecture that breaks through the Shannon paradigm, is considered a key technology for realizing future sixth-generation wireless networks and cognitive communications. Instead of focusing on the bit error rate, SC is dedicated to extracting abstract semantic information from original data to enhance communication efficiency for specific tasks. However, current SC systems mostly rely on symmetric architectures based on convolutional neural networks, which not only severely limits the capacity of the network but also leads to a high degree of coupling between the encoder and decoder. Additionally, it also lacks robustness in noise reference. The emergence of generative artificial intelligence (GAI) breaks this bottleneck. In this paper, we propose an asymmetric end-to-end SC architecture based on GAI, named masked joint source-channel coding (M-JSCC). In our model, the encoder serves as a universal semantic extractor, while the decoder is tailored to specific tasks. During the model training, we introduce a masking mechanism that improves the performance of M-JSCC to extract semantic information and enhances the robustness under various channel conditions. Moreover, it also endows M-JSCC with remarkable data generation abilities. Benefiting from the asymmetric architecture, the decoder no longer depends on the encoder, which allows it to be switched according to the specific requirements to better adapt to different task-oriented scenarios. Finally, comprehensive experiments demonstrate the excellent semantic understanding and communication robustness of M-JSCC.\",\"PeriodicalId\":13069,\"journal\":{\"name\":\"IEEE Transactions on Cognitive Communications and Networking\",\"volume\":\"11 2\",\"pages\":\"700-711\"},\"PeriodicalIF\":7.0000,\"publicationDate\":\"2024-12-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cognitive Communications and Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10807235/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cognitive Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10807235/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
GAI-Enhanced Robust Semantic Communication With Asymmetric Architecture
Semantic communication (SC), regarded as a next-generation communication architecture that breaks through the Shannon paradigm, is considered a key technology for realizing future sixth-generation wireless networks and cognitive communications. Instead of focusing on the bit error rate, SC is dedicated to extracting abstract semantic information from original data to enhance communication efficiency for specific tasks. However, current SC systems mostly rely on symmetric architectures based on convolutional neural networks, which not only severely limits the capacity of the network but also leads to a high degree of coupling between the encoder and decoder. Additionally, it also lacks robustness in noise reference. The emergence of generative artificial intelligence (GAI) breaks this bottleneck. In this paper, we propose an asymmetric end-to-end SC architecture based on GAI, named masked joint source-channel coding (M-JSCC). In our model, the encoder serves as a universal semantic extractor, while the decoder is tailored to specific tasks. During the model training, we introduce a masking mechanism that improves the performance of M-JSCC to extract semantic information and enhances the robustness under various channel conditions. Moreover, it also endows M-JSCC with remarkable data generation abilities. Benefiting from the asymmetric architecture, the decoder no longer depends on the encoder, which allows it to be switched according to the specific requirements to better adapt to different task-oriented scenarios. Finally, comprehensive experiments demonstrate the excellent semantic understanding and communication robustness of M-JSCC.
期刊介绍:
The IEEE Transactions on Cognitive Communications and Networking (TCCN) aims to publish high-quality manuscripts that push the boundaries of cognitive communications and networking research. Cognitive, in this context, refers to the application of perception, learning, reasoning, memory, and adaptive approaches in communication system design. The transactions welcome submissions that explore various aspects of cognitive communications and networks, focusing on innovative and holistic approaches to complex system design. Key topics covered include architecture, protocols, cross-layer design, and cognition cycle design for cognitive networks. Additionally, research on machine learning, artificial intelligence, end-to-end and distributed intelligence, software-defined networking, cognitive radios, spectrum sharing, and security and privacy issues in cognitive networks are of interest. The publication also encourages papers addressing novel services and applications enabled by these cognitive concepts.