Guangyao Cheng;Zhengchuan Chen;Rui She;Min Liu;Tony Q. S. Quek
{"title":"从单语到多语:深度学习增强的多语文本语义交流系统","authors":"Guangyao Cheng;Zhengchuan Chen;Rui She;Min Liu;Tony Q. S. Quek","doi":"10.1109/LCOMM.2025.3565692","DOIUrl":null,"url":null,"abstract":"While semantic communication systems outperform traditional ones, most current research focuses on a single language, overlooking multilingual contexts. This letter proposes two approaches to extend the Text Semantic Communication System (TSC) to a Multilingual Text Semantic Communication System (MTSC). The first employs centralized learning on a hybrid dataset, processing multilingual texts with composite word-sequence indices. The second utilizes federated learning to aggregate linguistic features while preserving user data privacy. To assess the MTSC system, we introduce the Multi-Bilingual Evaluation Understudy (MBLEU) score. Experimental results show that the MTSC can extend the TSC without increasing model size, with federated learning achieving superior multilingual performance while protecting data privacy.","PeriodicalId":13197,"journal":{"name":"IEEE Communications Letters","volume":"29 6","pages":"1461-1465"},"PeriodicalIF":3.7000,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From Monolingualism to Multilingualism: Deep Learning-Enhanced Multilingual Text Semantic Communication System\",\"authors\":\"Guangyao Cheng;Zhengchuan Chen;Rui She;Min Liu;Tony Q. S. Quek\",\"doi\":\"10.1109/LCOMM.2025.3565692\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"While semantic communication systems outperform traditional ones, most current research focuses on a single language, overlooking multilingual contexts. This letter proposes two approaches to extend the Text Semantic Communication System (TSC) to a Multilingual Text Semantic Communication System (MTSC). The first employs centralized learning on a hybrid dataset, processing multilingual texts with composite word-sequence indices. The second utilizes federated learning to aggregate linguistic features while preserving user data privacy. To assess the MTSC system, we introduce the Multi-Bilingual Evaluation Understudy (MBLEU) score. Experimental results show that the MTSC can extend the TSC without increasing model size, with federated learning achieving superior multilingual performance while protecting data privacy.\",\"PeriodicalId\":13197,\"journal\":{\"name\":\"IEEE Communications Letters\",\"volume\":\"29 6\",\"pages\":\"1461-1465\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-04-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Communications Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10980276/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10980276/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
From Monolingualism to Multilingualism: Deep Learning-Enhanced Multilingual Text Semantic Communication System
While semantic communication systems outperform traditional ones, most current research focuses on a single language, overlooking multilingual contexts. This letter proposes two approaches to extend the Text Semantic Communication System (TSC) to a Multilingual Text Semantic Communication System (MTSC). The first employs centralized learning on a hybrid dataset, processing multilingual texts with composite word-sequence indices. The second utilizes federated learning to aggregate linguistic features while preserving user data privacy. To assess the MTSC system, we introduce the Multi-Bilingual Evaluation Understudy (MBLEU) score. Experimental results show that the MTSC can extend the TSC without increasing model size, with federated learning achieving superior multilingual performance while protecting data privacy.
期刊介绍:
The IEEE Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of communication over different media and channels including wire, underground, waveguide, optical fiber, and storage channels. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of communication systems.