{"title":"Unsupervised Sentiment and Style Transfer from Massive Texts","authors":"Xianjie Shen, Wei Chen, Shuren Xu","doi":"10.1145/3573428.3573464","DOIUrl":null,"url":null,"abstract":"Unsupervised style transfer aims to transfer the intrinsic style of text while preserving its content without parallel datasets. Many sophisticated methods using reinforcement learning and neural networks have been developed to address this problem, however, their performance is not very ideal yet. We observe that given massive unpaired texts, there would exist high-quality sentence pairs that have similar style-independent content but different style words. Inspiring by this observation, in this paper, we propose a simple yet effective method without any neural network. Specifically, we consider both embedding similarity and BLEU score to locate similar sentences of different styles for a pseudo-parallel dataset construction. From this pseudo-parallel dataset, we distill the style words and align them into pairs based on statistical signals. We further refine our pseudo-parallel dataset by ignoring the identified style words during similarity calculation. After the style word pairs converged, we put them together as a lookup table to recognize and replace style words for style transfer. Extensive experiments demonstrate that our method is effective in different style transferring settings, such as sentiment and formality, outperforming state-of-the-art methods.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573428.3573464","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Unsupervised style transfer aims to transfer the intrinsic style of text while preserving its content without parallel datasets. Many sophisticated methods using reinforcement learning and neural networks have been developed to address this problem, however, their performance is not very ideal yet. We observe that given massive unpaired texts, there would exist high-quality sentence pairs that have similar style-independent content but different style words. Inspiring by this observation, in this paper, we propose a simple yet effective method without any neural network. Specifically, we consider both embedding similarity and BLEU score to locate similar sentences of different styles for a pseudo-parallel dataset construction. From this pseudo-parallel dataset, we distill the style words and align them into pairs based on statistical signals. We further refine our pseudo-parallel dataset by ignoring the identified style words during similarity calculation. After the style word pairs converged, we put them together as a lookup table to recognize and replace style words for style transfer. Extensive experiments demonstrate that our method is effective in different style transferring settings, such as sentiment and formality, outperforming state-of-the-art methods.