Wenxing Hong, Xiaoqing Zheng, Jianwei Qi, Weiwei Wang, Yang Weng
{"title":"Project Rank: An Internet Topic Evaluation Model Based on Latent Dirichlet Allocation","authors":"Wenxing Hong, Xiaoqing Zheng, Jianwei Qi, Weiwei Wang, Yang Weng","doi":"10.1109/ICCSE.2018.8468739","DOIUrl":null,"url":null,"abstract":"With the rapid development of Internet-related industries, the Internet news is containing more and more information as well as being paid more attention by human beings. In order to explore the intrinsic links among the news and extracted effective topics, we put forward ProjectRank model based on latent Dirichlet allocation. Large-scale Chinese text sets serves as initial input of this model. By combining LDA topic model with distance between probability distributions, we figure out scores of topics so as to quantify the result of topic evaluation. Moreover, this model proves to be useful to track trends of hot news in different time periods. This paper applies ProjectRank model to real Internet news data obtained from several websites and achieves good experiment results.","PeriodicalId":228760,"journal":{"name":"2018 13th International Conference on Computer Science & Education (ICCSE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 13th International Conference on Computer Science & Education (ICCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSE.2018.8468739","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
With the rapid development of Internet-related industries, the Internet news is containing more and more information as well as being paid more attention by human beings. In order to explore the intrinsic links among the news and extracted effective topics, we put forward ProjectRank model based on latent Dirichlet allocation. Large-scale Chinese text sets serves as initial input of this model. By combining LDA topic model with distance between probability distributions, we figure out scores of topics so as to quantify the result of topic evaluation. Moreover, this model proves to be useful to track trends of hot news in different time periods. This paper applies ProjectRank model to real Internet news data obtained from several websites and achieves good experiment results.