{"title":"Diffulex:具有混合吸收状态和约束平衡的基于扩散的词汇约束文本生成","authors":"Fengrui Kang, Xianying Huang, Bingyu Li","doi":"10.1016/j.eswa.2025.127614","DOIUrl":null,"url":null,"abstract":"<div><div>Lexically constrained text generation aims to generate complete text with given keywords, which can be applied in many fields, such as dialogue systems, automatic summarization, and story generation. However, the current methods often find it difficult to strike a balance between generation quality, constraint ability, and generation speed, and most of them can only focus on one aspect, which seriously limits their applications. To solve this problem, we propose Diffulex, a lexically constrained text generation model based on the diffusion model, which achieves faster generation speed and higher flexibility. In response to the characteristics of lexically constrained text generation tasks, Diffulex employs a forward process of mixed absorbing states, converting tokens into different types of [MASK] tags to capture the semantic relationship between constraints and tokens. Through the constraint balance of the reverse process, more attention will be paid to the prediction tokens that meet the constraint conditions and promote the dynamic fusion of the hidden state constraint information, achieving the balance between the generation quality and the constraint ability. We compared Diffulex with advanced work in the field and popular large language models as baselines, and our results on multiple datasets show that Diffulex outperforms the baseline in various aspects. Our code is available on <span><span>https://github.com/Kenfree0/Diffulex</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"281 ","pages":"Article 127614"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffulex: Diffusion based lexically constrained text generation with mixed absorbing state and constraint balance\",\"authors\":\"Fengrui Kang, Xianying Huang, Bingyu Li\",\"doi\":\"10.1016/j.eswa.2025.127614\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Lexically constrained text generation aims to generate complete text with given keywords, which can be applied in many fields, such as dialogue systems, automatic summarization, and story generation. However, the current methods often find it difficult to strike a balance between generation quality, constraint ability, and generation speed, and most of them can only focus on one aspect, which seriously limits their applications. To solve this problem, we propose Diffulex, a lexically constrained text generation model based on the diffusion model, which achieves faster generation speed and higher flexibility. In response to the characteristics of lexically constrained text generation tasks, Diffulex employs a forward process of mixed absorbing states, converting tokens into different types of [MASK] tags to capture the semantic relationship between constraints and tokens. Through the constraint balance of the reverse process, more attention will be paid to the prediction tokens that meet the constraint conditions and promote the dynamic fusion of the hidden state constraint information, achieving the balance between the generation quality and the constraint ability. We compared Diffulex with advanced work in the field and popular large language models as baselines, and our results on multiple datasets show that Diffulex outperforms the baseline in various aspects. Our code is available on <span><span>https://github.com/Kenfree0/Diffulex</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"281 \",\"pages\":\"Article 127614\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425012369\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425012369","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Diffulex: Diffusion based lexically constrained text generation with mixed absorbing state and constraint balance
Lexically constrained text generation aims to generate complete text with given keywords, which can be applied in many fields, such as dialogue systems, automatic summarization, and story generation. However, the current methods often find it difficult to strike a balance between generation quality, constraint ability, and generation speed, and most of them can only focus on one aspect, which seriously limits their applications. To solve this problem, we propose Diffulex, a lexically constrained text generation model based on the diffusion model, which achieves faster generation speed and higher flexibility. In response to the characteristics of lexically constrained text generation tasks, Diffulex employs a forward process of mixed absorbing states, converting tokens into different types of [MASK] tags to capture the semantic relationship between constraints and tokens. Through the constraint balance of the reverse process, more attention will be paid to the prediction tokens that meet the constraint conditions and promote the dynamic fusion of the hidden state constraint information, achieving the balance between the generation quality and the constraint ability. We compared Diffulex with advanced work in the field and popular large language models as baselines, and our results on multiple datasets show that Diffulex outperforms the baseline in various aspects. Our code is available on https://github.com/Kenfree0/Diffulex.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.