Junli Wang , Chenyang Zhang , Dongyu Zhang , Haibo Tong , Chungang Yan , Changjun Jiang
{"title":"A recent survey on controllable text generation: A causal perspective","authors":"Junli Wang , Chenyang Zhang , Dongyu Zhang , Haibo Tong , Chungang Yan , Changjun Jiang","doi":"10.1016/j.fmre.2024.01.001","DOIUrl":null,"url":null,"abstract":"<div><div>As an important subject of natural language generation, Controllable Text Generation (CTG) focuses on integrating additional constraints and controls while generating texts and has attracted a lot of attention. Existing controllable text generation approaches mainly capture the statistical association implied within training texts, but generated texts lack causality consideration. This paper intends to review recent CTG approaches from a causal perspective. Firstly, according to previous research on basic types of CTG models, it is discovered that their essence is to obtain the association, and then four kinds of challenges caused by absence of causality are introduced. Next, this paper reviews the improvements to address these challenges from four aspects, namely representation disentanglement, causal inference, knowledge enhancement and multi-aspect CTG respectively. Additionally, this paper inspects existing evaluations of CTG, especially evaluations for causality of CTG. Finally, this review discusses some future research directions for the causality improvement of CTG and makes a conclusion.</div></div>","PeriodicalId":34602,"journal":{"name":"Fundamental Research","volume":"5 3","pages":"Pages 1194-1203"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fundamental Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667325824000062","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0
Abstract
As an important subject of natural language generation, Controllable Text Generation (CTG) focuses on integrating additional constraints and controls while generating texts and has attracted a lot of attention. Existing controllable text generation approaches mainly capture the statistical association implied within training texts, but generated texts lack causality consideration. This paper intends to review recent CTG approaches from a causal perspective. Firstly, according to previous research on basic types of CTG models, it is discovered that their essence is to obtain the association, and then four kinds of challenges caused by absence of causality are introduced. Next, this paper reviews the improvements to address these challenges from four aspects, namely representation disentanglement, causal inference, knowledge enhancement and multi-aspect CTG respectively. Additionally, this paper inspects existing evaluations of CTG, especially evaluations for causality of CTG. Finally, this review discusses some future research directions for the causality improvement of CTG and makes a conclusion.
可控文本生成(controlled Text generation, CTG)是自然语言生成的一个重要课题,其重点是在生成文本的同时集成额外的约束和控制,引起了人们的广泛关注。现有的可控文本生成方法主要捕获训练文本中隐含的统计关联,但生成的文本缺乏因果关系考虑。本文拟从因果关系的角度对近年来的CTG研究方法进行综述。首先,根据前人对CTG模型基本类型的研究,发现其本质是获取关联,然后介绍了由于缺乏因果关系而带来的四种挑战。接下来,本文分别从表征解纠缠、因果推理、知识增强和多向CTG四个方面综述了应对这些挑战的改进措施。此外,本文还考察了现有的对CTG的评价,特别是对CTG因果关系的评价。最后,本文对今后的研究方向进行了展望,并给出了结论。