{"title":"基于遗传算法的单文档提取文本摘要","authors":"N. Chatterjee, A. Mittal, S. Goyal","doi":"10.1109/EAIT.2012.6407852","DOIUrl":null,"url":null,"abstract":"This paper presents an extraction based single document text summarization technique using Genetic Algorithms. A given document is represented as a weighted Directed Acyclic Graph. A fitness function is defined to mathematically express the quality of a summary in terms of some desired properties of a summary, such as, topic relation, cohesion and readability. Genetic Algorithm is designed to maximize this fitness function, and get the corresponding summary by extracting the most important sentences. Results are compared with a couple of other existing text summarization methods keeping the DUC2002 data as benchmark, and using the precision-recall evaluation technique. The initial results obtained seem promising and encouraging for future work in this area.","PeriodicalId":194103,"journal":{"name":"2012 Third International Conference on Emerging Applications of Information Technology","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":"{\"title\":\"Single document extractive text summarization using Genetic Algorithms\",\"authors\":\"N. Chatterjee, A. Mittal, S. Goyal\",\"doi\":\"10.1109/EAIT.2012.6407852\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an extraction based single document text summarization technique using Genetic Algorithms. A given document is represented as a weighted Directed Acyclic Graph. A fitness function is defined to mathematically express the quality of a summary in terms of some desired properties of a summary, such as, topic relation, cohesion and readability. Genetic Algorithm is designed to maximize this fitness function, and get the corresponding summary by extracting the most important sentences. Results are compared with a couple of other existing text summarization methods keeping the DUC2002 data as benchmark, and using the precision-recall evaluation technique. The initial results obtained seem promising and encouraging for future work in this area.\",\"PeriodicalId\":194103,\"journal\":{\"name\":\"2012 Third International Conference on Emerging Applications of Information Technology\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"27\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Third International Conference on Emerging Applications of Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EAIT.2012.6407852\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Third International Conference on Emerging Applications of Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EAIT.2012.6407852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Single document extractive text summarization using Genetic Algorithms
This paper presents an extraction based single document text summarization technique using Genetic Algorithms. A given document is represented as a weighted Directed Acyclic Graph. A fitness function is defined to mathematically express the quality of a summary in terms of some desired properties of a summary, such as, topic relation, cohesion and readability. Genetic Algorithm is designed to maximize this fitness function, and get the corresponding summary by extracting the most important sentences. Results are compared with a couple of other existing text summarization methods keeping the DUC2002 data as benchmark, and using the precision-recall evaluation technique. The initial results obtained seem promising and encouraging for future work in this area.