Survey on Graph and Cluster Based approaches in Multi-document Text Summarization

Y. Meena, Ashish Jain, D. Gopalani
{"title":"Survey on Graph and Cluster Based approaches in Multi-document Text Summarization","authors":"Y. Meena, Ashish Jain, D. Gopalani","doi":"10.1109/ICRAIE.2014.6909126","DOIUrl":null,"url":null,"abstract":"In today's era of World Wide Web, on-line information is increasing exponentially day by day. So there is a need to condense corpus of documents into useful information automatically. Automatic Text summarization plays an important role to extract salient feature from corpus of documents, which helps user to get useful information in short time and less effort. Summarization reduces the complexity of a document while retaining its important features. Recently, most researchers have transferred their efforts from single to multi document summarization but they have to be aware of the issues of redundancy, sentence ordering, fluency, etc. There are wide varieties of approaches in Multi-document Text Summarization like Graph Based, Cluster Based, Time Based and Term frequency -Inverse document frequency Based etc. The survey starts introducing Multi-document text Summarization (MDS) and then discusses various methods of MDS which fall under the Graph and Cluster Based methods. In this paper, we have analysed Graph and Cluster Based methods proposed by various researchers in the field and we sort out some of the problems in applied procedures and also pin out advantages, which would help future researchers working in the area, to get significant instruction for further analysis. Using this information one can generate new or even hybrid methods in Multi-document summarization.","PeriodicalId":355706,"journal":{"name":"International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2014)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAIE.2014.6909126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26

Abstract

In today's era of World Wide Web, on-line information is increasing exponentially day by day. So there is a need to condense corpus of documents into useful information automatically. Automatic Text summarization plays an important role to extract salient feature from corpus of documents, which helps user to get useful information in short time and less effort. Summarization reduces the complexity of a document while retaining its important features. Recently, most researchers have transferred their efforts from single to multi document summarization but they have to be aware of the issues of redundancy, sentence ordering, fluency, etc. There are wide varieties of approaches in Multi-document Text Summarization like Graph Based, Cluster Based, Time Based and Term frequency -Inverse document frequency Based etc. The survey starts introducing Multi-document text Summarization (MDS) and then discusses various methods of MDS which fall under the Graph and Cluster Based methods. In this paper, we have analysed Graph and Cluster Based methods proposed by various researchers in the field and we sort out some of the problems in applied procedures and also pin out advantages, which would help future researchers working in the area, to get significant instruction for further analysis. Using this information one can generate new or even hybrid methods in Multi-document summarization.
基于图和聚类的多文档文本摘要方法研究
在当今的万维网时代,在线信息日益呈指数级增长。因此,有必要将语料库自动压缩成有用的信息。文本自动摘要对于从文档语料库中提取显著特征起着重要作用,可以帮助用户在短时间内更省力地获取有用的信息。摘要可以在保留文档重要特性的同时降低文档的复杂性。近年来,大多数研究人员的工作已经从单文档摘要转向多文档摘要,但他们必须意识到冗余、句子顺序、流畅性等问题。多文档文本摘要有多种方法,如基于图的、基于聚类的、基于时间的、基于词频-逆文档频率的等。本文首先介绍了多文档文本摘要(MDS),然后讨论了基于图和聚类的多文档文本摘要方法。在本文中,我们分析了不同领域的研究者提出的基于图和聚类的方法,并对应用过程中存在的一些问题进行了梳理,同时也指出了它们的优点,这将有助于未来在该领域工作的研究者,为进一步的分析提供有意义的指导。使用这些信息可以在多文档摘要中生成新的甚至是混合的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信