{"title":"Information Extraction and Sentence Ordering in Multi-Document Summarization using Preference Learning","authors":"Anuj Kumar, Atul Kumar Uttam","doi":"10.1109/ICECAA55415.2022.9936218","DOIUrl":null,"url":null,"abstract":"Multi-document summarizing is a process that automatically extracts information from many texts that are related to the same subject. For the purpose of information extraction, a technique that uses multi-document summarization which is based on phrase frequency is used. As a result of the phrases being picked from the documents depending on how important they are, the summary loses their coherence and the sequence in which the information is presented, which reduces the readability of the summary. A method of sentence ordering that is predicated on the chronological order of the phrases has been used in order to address this issue. According to the findings of this research, a multi-document summarizer that is based on a word frequency approach performs very well when it comes to the process of extracting relevant content units and increasing the readability of the summary via sentence sequencing.","PeriodicalId":273850,"journal":{"name":"2022 International Conference on Edge Computing and Applications (ICECAA)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Edge Computing and Applications (ICECAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECAA55415.2022.9936218","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-document summarizing is a process that automatically extracts information from many texts that are related to the same subject. For the purpose of information extraction, a technique that uses multi-document summarization which is based on phrase frequency is used. As a result of the phrases being picked from the documents depending on how important they are, the summary loses their coherence and the sequence in which the information is presented, which reduces the readability of the summary. A method of sentence ordering that is predicated on the chronological order of the phrases has been used in order to address this issue. According to the findings of this research, a multi-document summarizer that is based on a word frequency approach performs very well when it comes to the process of extracting relevant content units and increasing the readability of the summary via sentence sequencing.