Improving Text Summarization Quality by Combining T5-Based Models and Convolutional Seq2Seq Models

Journal of Applied Engineering and Technological Science (JAETS) Pub Date : 2023-12-10 DOI:10.37385/jaets.v5i1.2503

Arif Ridho Lubis, Habibi Ramdani Safitri, I. Irvan, M. Lubis, A. Al-Khowarizmi

引用次数: 0

Abstract

In the natural language processing field, there are several sub-fields that are very closely related to information retrieval, such as the automatic text summarization sub-field. obtained from the convolutional T5 and Seq2Seq models in summarizing text on hugging faces found features that can affect text summary such as upper- and lower-case letters which have an impact on changing the understanding of the text of the document. This study uses a combination of parameters such as layer dimensions, learning rate, batch size, and the use of Dropout to avoid model overfitting. The results can be seen by evaluating metrics using ROUGE. This study produces a value of ROUGE-1 on 4 documents that are tested which produces an average of 0.8 which is the optimal value, for ROUGE-2 on 4 documents that are tested which results in an average of 0.83 which is an optimal value while ROUGE-L on 4 documents conducted tests that produce an average of 0.8 which is the optimal value for the summary model.

查看原文本刊更多论文

结合基于 T5 的模型和卷积 Seq2Seq 模型提高文本摘要质量

在自然语言处理领域，有几个子领域与信息检索关系非常密切，例如自动文本摘要子领域。本研究综合使用了层维度、学习率、批量大小等参数，并使用了 Dropout 来避免模型过拟合。结果可以通过使用 ROUGE 评估指标来体现。本研究对 4 篇文档进行了 ROUGE-1 值测试，得出的平均值为 0.8，这是最佳值；对 4 篇文档进行了 ROUGE-2 值测试，得出的平均值为 0.83，这是最佳值；对 4 篇文档进行了 ROUGE-L 值测试，得出的平均值为 0.8，这是摘要模型的最佳值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Applied Engineering and Technological Science (JAETS)

自引率

0.00%

发文量