{"title":"基于遗传算法的文本自动摘要提取","authors":"Abdullah Ammar Karcioglu, Ahmet Cahit Yaşa","doi":"10.1109/SIU49456.2020.9302205","DOIUrl":null,"url":null,"abstract":"Automatic text summarization is one of the applications of natural language processing that has been studied for a long time. The increase in the amount of information in web resources has increased the need for automatic text summarization methods. It is difficult to design a system to produce abstracts created by human hands. For this reason, many researchers have focused on extracting sentences or paragraphs, which is a kind of summary. In this study, we introduce a method that was created using genetic algorithms to generate such summaries. After the texts are preprocessed, vocabulary is created and given as input to the proposed method. The sentence selection based on Genetic Algorithm is used to summarize and after that the summary is created, it is evaluated using the fitness function. In our first model, the fitness function is based on the frequency of each word and the word pair frequencies. The results of the applied model are discussed using the same dataset in another method based on tf-idf, with precision, recall, fscore and Rouge metrics.","PeriodicalId":312627,"journal":{"name":"2020 28th Signal Processing and Communications Applications Conference (SIU)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automatic Summary Extraction in Texts Using Genetic Algorithms\",\"authors\":\"Abdullah Ammar Karcioglu, Ahmet Cahit Yaşa\",\"doi\":\"10.1109/SIU49456.2020.9302205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic text summarization is one of the applications of natural language processing that has been studied for a long time. The increase in the amount of information in web resources has increased the need for automatic text summarization methods. It is difficult to design a system to produce abstracts created by human hands. For this reason, many researchers have focused on extracting sentences or paragraphs, which is a kind of summary. In this study, we introduce a method that was created using genetic algorithms to generate such summaries. After the texts are preprocessed, vocabulary is created and given as input to the proposed method. The sentence selection based on Genetic Algorithm is used to summarize and after that the summary is created, it is evaluated using the fitness function. In our first model, the fitness function is based on the frequency of each word and the word pair frequencies. The results of the applied model are discussed using the same dataset in another method based on tf-idf, with precision, recall, fscore and Rouge metrics.\",\"PeriodicalId\":312627,\"journal\":{\"name\":\"2020 28th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 28th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU49456.2020.9302205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 28th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU49456.2020.9302205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Summary Extraction in Texts Using Genetic Algorithms
Automatic text summarization is one of the applications of natural language processing that has been studied for a long time. The increase in the amount of information in web resources has increased the need for automatic text summarization methods. It is difficult to design a system to produce abstracts created by human hands. For this reason, many researchers have focused on extracting sentences or paragraphs, which is a kind of summary. In this study, we introduce a method that was created using genetic algorithms to generate such summaries. After the texts are preprocessed, vocabulary is created and given as input to the proposed method. The sentence selection based on Genetic Algorithm is used to summarize and after that the summary is created, it is evaluated using the fitness function. In our first model, the fitness function is based on the frequency of each word and the word pair frequencies. The results of the applied model are discussed using the same dataset in another method based on tf-idf, with precision, recall, fscore and Rouge metrics.