Jai Joshi, Aayush Kawathekar, Veda G. Gaonkar, Nikahat Mulla
{"title":"Decluttering the Internet with Article Summarization, Classification, and Recommendation using Natural Language Processing","authors":"Jai Joshi, Aayush Kawathekar, Veda G. Gaonkar, Nikahat Mulla","doi":"10.1109/APSIT58554.2023.10201665","DOIUrl":null,"url":null,"abstract":"The explosion of information available on the internet has led to an overwhelming amount of data that is available to users. The ability to efficiently consume and comprehend this information is crucial for productivity and knowledge expansion. However, only a few of researches focus on providing e-readers not only with comprehensive classification but also relevant recommendations based on the reader's history. The paper has been implemented in pursuit of facilitating the consumption of information with increased coherency and efficiency, leading to a proliferation in productivity in the fields of both professional and informal research using an easy-to-use web-based dashboard, the project works in 3 broad tenets. BERT Algorithm (Bidirectional Encoder Representations from Transformers) for information summarization, followed by Latent Dirichlet allocation (LDA) Algorithm for text classification and Collaborative filtering for recommending further articles for the user's knowledge expansion.","PeriodicalId":170044,"journal":{"name":"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIT58554.2023.10201665","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The explosion of information available on the internet has led to an overwhelming amount of data that is available to users. The ability to efficiently consume and comprehend this information is crucial for productivity and knowledge expansion. However, only a few of researches focus on providing e-readers not only with comprehensive classification but also relevant recommendations based on the reader's history. The paper has been implemented in pursuit of facilitating the consumption of information with increased coherency and efficiency, leading to a proliferation in productivity in the fields of both professional and informal research using an easy-to-use web-based dashboard, the project works in 3 broad tenets. BERT Algorithm (Bidirectional Encoder Representations from Transformers) for information summarization, followed by Latent Dirichlet allocation (LDA) Algorithm for text classification and Collaborative filtering for recommending further articles for the user's knowledge expansion.