Text Summarization: An Overview

Samrat Ashok Babar, M. Tech-Cse, Rit
{"title":"Text Summarization: An Overview","authors":"Samrat Ashok Babar, M. Tech-Cse, Rit","doi":"10.14233/ajomc.2021.ajomc-pp162-165","DOIUrl":null,"url":null,"abstract":"1.Abstract: In this new era,where tremondous information is available on the internet,it is most important to provide the improved mechanism to extract the information quickly and most efficiently. It is very difficult for human beings to manually extract the summary of a large documents of text. There are plenty of text material available on the internet. So there is a problem of searching for relevant documents from the number of documents available, and absorbing relevant information from it.In order to solve the above two problems, the automatic text summarization is very much necessary.Text summarization is the process of identifying the most important meaningful information in a document or set of related documents and compressing them into a shorter version preserving its overall meanings. 2.Introduction: Before going to the Text summarization, first we, have to know that what a summary is. A summary is a text that is produced from one or more texts, that conveys important information in the original text, and it is of a shorter form. The goal of automatic text summarization is presenting the source text into a shorter version with semantics.The most important advantage of using a summary is ,it reduces the reading time. Text Summarization methods can be classified into extractive and abstractive summarization. An extractive summarization method consists of selecting important sentences, paragraphs etc. from the original document and concatenating them into shorter form. An Abstractive summarization is an understanding of the main concepts in a document and then express those concepts in clear natural language. There are two different groups of text summarization : indicative and informative.Inductive summarization only represent the main idea of the text to the user. The typical length of this type of summarization is 5 to 10 percent of the main text.On the other hand, the informative summarization systems gives concise information of the main text .The length of informative summary is 20 to 30 percent of the main text. 3.Main steps for text summarization: There are three main steps for summarizing documents.These are topic identification, interpretation and summary generation. 3.1. Topic Identificatio:The most prominent information in the text is identified .There are different techniques for topic identification are used which are Position, Cue Phrases, word frequency.Methods which are based on the position of phrases are the most useful methods for topic identification. 3.2. Interpretation :Abstract summaries need to go through interpretation step. In This step, different subjects are …","PeriodicalId":349092,"journal":{"name":"Special Issue III","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Special Issue III","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14233/ajomc.2021.ajomc-pp162-165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

1.Abstract: In this new era,where tremondous information is available on the internet,it is most important to provide the improved mechanism to extract the information quickly and most efficiently. It is very difficult for human beings to manually extract the summary of a large documents of text. There are plenty of text material available on the internet. So there is a problem of searching for relevant documents from the number of documents available, and absorbing relevant information from it.In order to solve the above two problems, the automatic text summarization is very much necessary.Text summarization is the process of identifying the most important meaningful information in a document or set of related documents and compressing them into a shorter version preserving its overall meanings. 2.Introduction: Before going to the Text summarization, first we, have to know that what a summary is. A summary is a text that is produced from one or more texts, that conveys important information in the original text, and it is of a shorter form. The goal of automatic text summarization is presenting the source text into a shorter version with semantics.The most important advantage of using a summary is ,it reduces the reading time. Text Summarization methods can be classified into extractive and abstractive summarization. An extractive summarization method consists of selecting important sentences, paragraphs etc. from the original document and concatenating them into shorter form. An Abstractive summarization is an understanding of the main concepts in a document and then express those concepts in clear natural language. There are two different groups of text summarization : indicative and informative.Inductive summarization only represent the main idea of the text to the user. The typical length of this type of summarization is 5 to 10 percent of the main text.On the other hand, the informative summarization systems gives concise information of the main text .The length of informative summary is 20 to 30 percent of the main text. 3.Main steps for text summarization: There are three main steps for summarizing documents.These are topic identification, interpretation and summary generation. 3.1. Topic Identificatio:The most prominent information in the text is identified .There are different techniques for topic identification are used which are Position, Cue Phrases, word frequency.Methods which are based on the position of phrases are the most useful methods for topic identification. 3.2. Interpretation :Abstract summaries need to go through interpretation step. In This step, different subjects are …
文本摘要:概述
1.摘要:在互联网上海量信息的新时代,提供一种快速高效的信息提取机制显得尤为重要。人工提取大型文本文档的摘要是非常困难的。网上有大量的文字资料。因此,从现有的文献中搜索相关文献,并从中吸收相关信息是一个问题。为了解决以上两个问题,自动文本摘要是非常有必要的。文本摘要是识别一个文档或一组相关文档中最重要的有意义的信息,并将其压缩成一个更短的版本,保留其总体含义的过程。2.引言:在进行文本摘要之前,首先我们要知道什么是摘要。摘要是由一个或多个文本生成的文本,它传达了原始文本中的重要信息,并且形式较短。自动文本摘要的目标是将源文本呈现为具有语义的较短版本。使用摘要最重要的好处是,它减少了阅读时间。文本摘要方法可分为抽取式摘要和抽象式摘要。摘要提取法是指从原始文献中选择重要的句子、段落等,并将其串联成较短的形式。抽象摘要是对文档中主要概念的理解,然后用清晰的自然语言表达这些概念。有两种不同的文本摘要:指示性和信息性。归纳式摘要只向读者表达文章的主要思想。这种类型的摘要的典型长度是正文的5%到10%。另一方面,信息性摘要系统给出了正文的简明信息,信息性摘要的长度是正文的20%到30%。3.文本摘要的主要步骤:摘要文档有三个主要步骤。这些是主题识别、解释和总结生成。3.1. 主题识别:识别文本中最重要的信息。主题识别使用了不同的技术,包括位置,提示短语,词频。基于短语位置的方法是最有用的主题识别方法。3.2. 解读:摘要需要经过解读步骤。在这一步中,不同的主题是……
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信