Nguyen Quang Uy, P. Anh, Truong Cong Doan, N. X. Hoai
{"title":"A Study on the Use of Genetic Programming for Automatic Text Summarization","authors":"Nguyen Quang Uy, P. Anh, Truong Cong Doan, N. X. Hoai","doi":"10.1109/KSE.2012.10","DOIUrl":null,"url":null,"abstract":"Text Summarization is the process of identifying and extracting the most vital information in a document. It has been seen as an effective method for dealing with increasing amount of information on the Internet nowadays. In this paper, we present an application of Genetic Programming to the problem of Automatic Text Summarization. Genetic Programming was used to evolve the function that ranks the sentences in a document based on their importance. The summary was extracted by selecting the sentences that have the highest rankings. The experiment was conducted on a number of Vietnamese news documents. The result showed that the summaries created by Genetic Programming are better than those created by a number of statistic based methods and even by human (non-experts).","PeriodicalId":122680,"journal":{"name":"2012 Fourth International Conference on Knowledge and Systems Engineering","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Fourth International Conference on Knowledge and Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE.2012.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Text Summarization is the process of identifying and extracting the most vital information in a document. It has been seen as an effective method for dealing with increasing amount of information on the Internet nowadays. In this paper, we present an application of Genetic Programming to the problem of Automatic Text Summarization. Genetic Programming was used to evolve the function that ranks the sentences in a document based on their importance. The summary was extracted by selecting the sentences that have the highest rankings. The experiment was conducted on a number of Vietnamese news documents. The result showed that the summaries created by Genetic Programming are better than those created by a number of statistic based methods and even by human (non-experts).