{"title":"在中国微博热门话题检测的基础上修改LDA模型","authors":"Yuzhong Chen, Wanhua Li, Wenzhong Guo, Kun Guo","doi":"10.1109/WISA.2015.58","DOIUrl":null,"url":null,"abstract":"Micro-blog has become a symbol of the novel social media, and because of its rapid development in such a short time, many research researchers are full of enthusiasm about it. We take use of Latent Dirichlet Allocation (LDA) Model which has excellent dimension reduction capability and can excavate latent semantic from texts to discover popular topics. We improve the original LDA model to FSC-LDA model by combining the text clustering methods and feature selection methods, which can identify the number of topics adaptively. FSC-LDA model can keep short micro-blog texts features better, and make the result more stable. The result of the experiments on real Chinese microblog text dataset shows that FSC-LDA model can perform well on the custom evaluation and find more accurate popular topics.","PeriodicalId":198938,"journal":{"name":"2015 12th Web Information System and Application Conference (WISA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Popular Topic Detection in Chinese Micro-Blog Based on the Modified LDA Model\",\"authors\":\"Yuzhong Chen, Wanhua Li, Wenzhong Guo, Kun Guo\",\"doi\":\"10.1109/WISA.2015.58\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Micro-blog has become a symbol of the novel social media, and because of its rapid development in such a short time, many research researchers are full of enthusiasm about it. We take use of Latent Dirichlet Allocation (LDA) Model which has excellent dimension reduction capability and can excavate latent semantic from texts to discover popular topics. We improve the original LDA model to FSC-LDA model by combining the text clustering methods and feature selection methods, which can identify the number of topics adaptively. FSC-LDA model can keep short micro-blog texts features better, and make the result more stable. The result of the experiments on real Chinese microblog text dataset shows that FSC-LDA model can perform well on the custom evaluation and find more accurate popular topics.\",\"PeriodicalId\":198938,\"journal\":{\"name\":\"2015 12th Web Information System and Application Conference (WISA)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 12th Web Information System and Application Conference (WISA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISA.2015.58\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 12th Web Information System and Application Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2015.58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Popular Topic Detection in Chinese Micro-Blog Based on the Modified LDA Model
Micro-blog has become a symbol of the novel social media, and because of its rapid development in such a short time, many research researchers are full of enthusiasm about it. We take use of Latent Dirichlet Allocation (LDA) Model which has excellent dimension reduction capability and can excavate latent semantic from texts to discover popular topics. We improve the original LDA model to FSC-LDA model by combining the text clustering methods and feature selection methods, which can identify the number of topics adaptively. FSC-LDA model can keep short micro-blog texts features better, and make the result more stable. The result of the experiments on real Chinese microblog text dataset shows that FSC-LDA model can perform well on the custom evaluation and find more accurate popular topics.