{"title":"An Approach of Categorization and Summarization of News using Topic Modeling","authors":"Uttara Behera, Sumita Gupta","doi":"10.1109/confluence52989.2022.9734216","DOIUrl":null,"url":null,"abstract":"The increasing demand/availability of online content has triggered intensive research in the automatic text summarization. Text summarization is the process of removing less useful text from the document to find the required news quickly. Likewise, text summarization, news summarization is also the process of picking the news content which is most important in perspective of online readers and gives the clear idea of the proposed news. Various traditional algorithms are available that can be used to summarize the text. In this paper, an automatic text summarizer using extractive summarization approach is proposed and implemented by considering topic modelling for categorization of news content and text rank algorithm for summarization. In order to evaluate accuracy, F-measure and recall of the produced summary, various machine learning algorithms are applied. The result produced 99.8% of accuracy using topic modelling over K-means clustering.","PeriodicalId":261941,"journal":{"name":"2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/confluence52989.2022.9734216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The increasing demand/availability of online content has triggered intensive research in the automatic text summarization. Text summarization is the process of removing less useful text from the document to find the required news quickly. Likewise, text summarization, news summarization is also the process of picking the news content which is most important in perspective of online readers and gives the clear idea of the proposed news. Various traditional algorithms are available that can be used to summarize the text. In this paper, an automatic text summarizer using extractive summarization approach is proposed and implemented by considering topic modelling for categorization of news content and text rank algorithm for summarization. In order to evaluate accuracy, F-measure and recall of the produced summary, various machine learning algorithms are applied. The result produced 99.8% of accuracy using topic modelling over K-means clustering.