An Improved mT5 Model for Chinese Text Summary Generation

Fuping Ren, Jian Chen, Defu Zhang
{"title":"An Improved mT5 Model for Chinese Text Summary Generation","authors":"Fuping Ren, Jian Chen, Defu Zhang","doi":"10.5121/csit.2024.140214","DOIUrl":null,"url":null,"abstract":"Understanding complex policy documents can be challenging, highlighting the need for intelligent interpretation of Chinese policies. To enhance Chinese text summarization, this study utilized the mT5 model as the core framework and initial weights. Additionally, it reduced model size through parameter clipping, employed the Gap Sentence Generation (GSG) method as an unsupervised technique, and enhanced the Chinese tokenizer. After training on a meticulously processed 30GB Chinese training corpus, the study developed the enhanced mT5- GSG model. When fine-tuning on Chinese policy texts, it adopted the \"Dropout Twice\" approach and ingeniously merged the probability distribution of the two dropouts using the Wasserstein distance. Experimental results indicate that the proposed model achieved Rouge-1, Rouge-2, and Rouge-L scores of 56.13%, 45.76%, and 56.41% respectively on the Chinese policy text summarization dataset.","PeriodicalId":104179,"journal":{"name":"AI, Machine Learning and Applications","volume":"86 9-10","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI, Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/csit.2024.140214","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Understanding complex policy documents can be challenging, highlighting the need for intelligent interpretation of Chinese policies. To enhance Chinese text summarization, this study utilized the mT5 model as the core framework and initial weights. Additionally, it reduced model size through parameter clipping, employed the Gap Sentence Generation (GSG) method as an unsupervised technique, and enhanced the Chinese tokenizer. After training on a meticulously processed 30GB Chinese training corpus, the study developed the enhanced mT5- GSG model. When fine-tuning on Chinese policy texts, it adopted the "Dropout Twice" approach and ingeniously merged the probability distribution of the two dropouts using the Wasserstein distance. Experimental results indicate that the proposed model achieved Rouge-1, Rouge-2, and Rouge-L scores of 56.13%, 45.76%, and 56.41% respectively on the Chinese policy text summarization dataset.
用于生成中文文本摘要的改进型 mT5 模型
理解复杂的政策文件可能具有挑战性,因此需要对中文政策进行智能解读。为加强中文文本摘要,本研究利用 mT5 模型作为核心框架和初始权重。此外,它还通过参数裁剪缩小了模型大小,采用了间隙句生成(GSG)方法作为无监督技术,并增强了中文标记符。在对经过精心处理的 30GB 中文训练语料进行训练后,该研究开发出了增强型 mT5- GSG 模型。在对中文政策文本进行微调时,研究采用了 "Dropout Twice "方法,并巧妙地利用 Wasserstein 距离合并了两次 dropout 的概率分布。实验结果表明,在中文政策文本摘要数据集上,所提出的模型分别获得了 56.13%、45.76% 和 56.41% 的 Rouge-1、Rouge-2 和 Rouge-L 分数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信