Zhibin Zhao, Yanfeng Jia, Lan Yao, Ge Yu, Xiangyang Li
{"title":"5WTAG:基于5W模型的中文微博话题检测","authors":"Zhibin Zhao, Yanfeng Jia, Lan Yao, Ge Yu, Xiangyang Li","doi":"10.1109/WISA.2013.52","DOIUrl":null,"url":null,"abstract":"A hash tag is an important metadata in micro blogs and used to mark topics or index messages. However, statistics show hash tags are absent from most of the micro blogs. It poses great challenges to the retrieve and analysis of these tagless micro blogs. In this paper, we summarize the similarity between micro blogs and short message news, and then propose an algorithm named 5WTAG for detecting micro blog topics based on 5W (When, Where, Who, What, how) model. Since 5W attributes are the core components in event description, it is guaranteed theoretically that 5WTAG can extract the semantics of the micro blogs properly. We introduce the detailed procedure of 5WTAG in this paper including the candidate hash tag construction and recommendation computation. Finally, we verify the semantical correctness of the candidate hash tags as well as the effectiveness of recommendation computation using the real data set from Sina Weibo.","PeriodicalId":178339,"journal":{"name":"IEEE WISA","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"5WTAG: Detecting the Topics of Chinese Microblogs Based on 5W Model\",\"authors\":\"Zhibin Zhao, Yanfeng Jia, Lan Yao, Ge Yu, Xiangyang Li\",\"doi\":\"10.1109/WISA.2013.52\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A hash tag is an important metadata in micro blogs and used to mark topics or index messages. However, statistics show hash tags are absent from most of the micro blogs. It poses great challenges to the retrieve and analysis of these tagless micro blogs. In this paper, we summarize the similarity between micro blogs and short message news, and then propose an algorithm named 5WTAG for detecting micro blog topics based on 5W (When, Where, Who, What, how) model. Since 5W attributes are the core components in event description, it is guaranteed theoretically that 5WTAG can extract the semantics of the micro blogs properly. We introduce the detailed procedure of 5WTAG in this paper including the candidate hash tag construction and recommendation computation. Finally, we verify the semantical correctness of the candidate hash tags as well as the effectiveness of recommendation computation using the real data set from Sina Weibo.\",\"PeriodicalId\":178339,\"journal\":{\"name\":\"IEEE WISA\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE WISA\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISA.2013.52\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE WISA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2013.52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
5WTAG: Detecting the Topics of Chinese Microblogs Based on 5W Model
A hash tag is an important metadata in micro blogs and used to mark topics or index messages. However, statistics show hash tags are absent from most of the micro blogs. It poses great challenges to the retrieve and analysis of these tagless micro blogs. In this paper, we summarize the similarity between micro blogs and short message news, and then propose an algorithm named 5WTAG for detecting micro blog topics based on 5W (When, Where, Who, What, how) model. Since 5W attributes are the core components in event description, it is guaranteed theoretically that 5WTAG can extract the semantics of the micro blogs properly. We introduce the detailed procedure of 5WTAG in this paper including the candidate hash tag construction and recommendation computation. Finally, we verify the semantical correctness of the candidate hash tags as well as the effectiveness of recommendation computation using the real data set from Sina Weibo.