Dandan Zhu, Yusuke Fukazawa, Eleftherios Karapetsas, J. Ota
{"title":"基于活动的主题发现","authors":"Dandan Zhu, Yusuke Fukazawa, Eleftherios Karapetsas, J. Ota","doi":"10.3233/WIA-140292","DOIUrl":null,"url":null,"abstract":"A topic model capable of assigning word pairs to associated topics is developed to explore people's activities. Considering that the form of word pairs led by verbs is a more effective way to express people's activities than separate words, we incorporate the word-connection model into the smoothed Latent Dirichlet Allocation LDA to ensure that the words are well paired and assigned to the associated topics. To quantitatively and qualitatively evaluate the proposed model, two datasets were built using Twitter posts as data sources: the wish-related and the geographical information-related datasets. The experiment results using the wish-related dataset indicate that the relatedness of words plays a key role in forming reasonable pairs, and the proposed model, word-pair generative Latent Dirichlet Allocation wpLDA, performs well in clustering. Results obtained using the geographical information-related dataset demonstrate that the proposed model works well for discovering people's activities, in which the activities are understandably represented with an intuitive character.","PeriodicalId":263450,"journal":{"name":"Web Intell. Agent Syst.","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Activity-based topic discovery\",\"authors\":\"Dandan Zhu, Yusuke Fukazawa, Eleftherios Karapetsas, J. Ota\",\"doi\":\"10.3233/WIA-140292\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A topic model capable of assigning word pairs to associated topics is developed to explore people's activities. Considering that the form of word pairs led by verbs is a more effective way to express people's activities than separate words, we incorporate the word-connection model into the smoothed Latent Dirichlet Allocation LDA to ensure that the words are well paired and assigned to the associated topics. To quantitatively and qualitatively evaluate the proposed model, two datasets were built using Twitter posts as data sources: the wish-related and the geographical information-related datasets. The experiment results using the wish-related dataset indicate that the relatedness of words plays a key role in forming reasonable pairs, and the proposed model, word-pair generative Latent Dirichlet Allocation wpLDA, performs well in clustering. Results obtained using the geographical information-related dataset demonstrate that the proposed model works well for discovering people's activities, in which the activities are understandably represented with an intuitive character.\",\"PeriodicalId\":263450,\"journal\":{\"name\":\"Web Intell. Agent Syst.\",\"volume\":\"69 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Web Intell. Agent Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/WIA-140292\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Web Intell. Agent Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/WIA-140292","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A topic model capable of assigning word pairs to associated topics is developed to explore people's activities. Considering that the form of word pairs led by verbs is a more effective way to express people's activities than separate words, we incorporate the word-connection model into the smoothed Latent Dirichlet Allocation LDA to ensure that the words are well paired and assigned to the associated topics. To quantitatively and qualitatively evaluate the proposed model, two datasets were built using Twitter posts as data sources: the wish-related and the geographical information-related datasets. The experiment results using the wish-related dataset indicate that the relatedness of words plays a key role in forming reasonable pairs, and the proposed model, word-pair generative Latent Dirichlet Allocation wpLDA, performs well in clustering. Results obtained using the geographical information-related dataset demonstrate that the proposed model works well for discovering people's activities, in which the activities are understandably represented with an intuitive character.