Information Extraction and Sentence Ordering in Multi-Document Summarization using Preference Learning

2022 International Conference on Edge Computing and Applications (ICECAA) Pub Date : 2022-10-13 DOI:10.1109/ICECAA55415.2022.9936218

Anuj Kumar, Atul Kumar Uttam

引用次数: 0

Abstract

Multi-document summarizing is a process that automatically extracts information from many texts that are related to the same subject. For the purpose of information extraction, a technique that uses multi-document summarization which is based on phrase frequency is used. As a result of the phrases being picked from the documents depending on how important they are, the summary loses their coherence and the sequence in which the information is presented, which reduces the readability of the summary. A method of sentence ordering that is predicated on the chronological order of the phrases has been used in order to address this issue. According to the findings of this research, a multi-document summarizer that is based on a word frequency approach performs very well when it comes to the process of extracting relevant content units and increasing the readability of the summary via sentence sequencing.

查看原文本刊更多论文

基于偏好学习的多文档摘要信息提取与句子排序

多文档摘要是一种从与同一主题相关的许多文本中自动提取信息的过程。为了达到信息提取的目的，采用了一种基于短语频率的多文档摘要技术。由于根据短语的重要性从文档中选择短语，摘要失去了其连贯性和信息呈现的顺序，从而降低了摘要的可读性。为了解决这个问题，一种基于短语时间顺序的句子排序方法已经被使用。根据本研究的结果，基于词频方法的多文档摘要器在提取相关内容单元和通过句子排序提高摘要的可读性方面表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Edge Computing and Applications (ICECAA)

自引率

0.00%

发文量