{"title":"Popular Topic Detection in Chinese Micro-Blog Based on the Modified LDA Model","authors":"Yuzhong Chen, Wanhua Li, Wenzhong Guo, Kun Guo","doi":"10.1109/WISA.2015.58","DOIUrl":null,"url":null,"abstract":"Micro-blog has become a symbol of the novel social media, and because of its rapid development in such a short time, many research researchers are full of enthusiasm about it. We take use of Latent Dirichlet Allocation (LDA) Model which has excellent dimension reduction capability and can excavate latent semantic from texts to discover popular topics. We improve the original LDA model to FSC-LDA model by combining the text clustering methods and feature selection methods, which can identify the number of topics adaptively. FSC-LDA model can keep short micro-blog texts features better, and make the result more stable. The result of the experiments on real Chinese microblog text dataset shows that FSC-LDA model can perform well on the custom evaluation and find more accurate popular topics.","PeriodicalId":198938,"journal":{"name":"2015 12th Web Information System and Application Conference (WISA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 12th Web Information System and Application Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2015.58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Micro-blog has become a symbol of the novel social media, and because of its rapid development in such a short time, many research researchers are full of enthusiasm about it. We take use of Latent Dirichlet Allocation (LDA) Model which has excellent dimension reduction capability and can excavate latent semantic from texts to discover popular topics. We improve the original LDA model to FSC-LDA model by combining the text clustering methods and feature selection methods, which can identify the number of topics adaptively. FSC-LDA model can keep short micro-blog texts features better, and make the result more stable. The result of the experiments on real Chinese microblog text dataset shows that FSC-LDA model can perform well on the custom evaluation and find more accurate popular topics.