基于质量和活跃度发现新浪微博海量高价值用户

Guangzhi Zhang, R. Bie
{"title":"基于质量和活跃度发现新浪微博海量高价值用户","authors":"Guangzhi Zhang, R. Bie","doi":"10.1109/CyberC.2013.42","DOIUrl":null,"url":null,"abstract":"This paper proposes a method to discover the high-value users who are both high-quality and high-activity. First, a Trust Transfer Model is introduced to capture users with high-quality in Sina Weibo that is a social web in China. Then, analysis and discussion are shown to identify the users are high-quality. Next, fresh users who are busy reposting are captured into the dataset filled with the high-quality users. Considering that reposting stands for the high-activity, hence the four degrees including reposting are proposed to judge one user's quality and activity. We discuss the effects of the four degrees. And then, the method called \"WeiboRank\" based on the four degrees is proposed to mine the high-value users. Finally, testing for the degree of coverage based on the \"Top10 Hot Microblog\" in Weibo presents a relatively high credibility of our dataset. The evaluation indicates that the users mined in this paper are quite high-quality and high-value. In conclusion, we will continue efforts to discover the high-value users in microblog and believe that discovering and maintaining a dataset filled with the high-value users which is not too big is quite significant for both academic research and business applications.","PeriodicalId":133756,"journal":{"name":"2013 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Discovering Massive High-Value Users from Sina Weibo Based on Quality and Activity\",\"authors\":\"Guangzhi Zhang, R. Bie\",\"doi\":\"10.1109/CyberC.2013.42\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a method to discover the high-value users who are both high-quality and high-activity. First, a Trust Transfer Model is introduced to capture users with high-quality in Sina Weibo that is a social web in China. Then, analysis and discussion are shown to identify the users are high-quality. Next, fresh users who are busy reposting are captured into the dataset filled with the high-quality users. Considering that reposting stands for the high-activity, hence the four degrees including reposting are proposed to judge one user's quality and activity. We discuss the effects of the four degrees. And then, the method called \\\"WeiboRank\\\" based on the four degrees is proposed to mine the high-value users. Finally, testing for the degree of coverage based on the \\\"Top10 Hot Microblog\\\" in Weibo presents a relatively high credibility of our dataset. The evaluation indicates that the users mined in this paper are quite high-quality and high-value. In conclusion, we will continue efforts to discover the high-value users in microblog and believe that discovering and maintaining a dataset filled with the high-value users which is not too big is quite significant for both academic research and business applications.\",\"PeriodicalId\":133756,\"journal\":{\"name\":\"2013 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CyberC.2013.42\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CyberC.2013.42","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

本文提出了一种发现高质量、高活跃度高价值用户的方法。首先,引入信任转移模型来捕获中国社交网络新浪微博的高质量用户。然后,通过分析和讨论来识别优质用户。接下来,将忙于转发的新用户捕获到充满高质量用户的数据集中。考虑到转发代表高活跃度,因此提出了包含转发在内的四个度来判断一个用户的质量和活跃度。我们讨论了四度的影响。在此基础上,提出了基于四度的“微博银行”挖掘高价值用户的方法。最后,基于微博“热门微博Top10”的覆盖度测试表明,我们的数据集具有较高的可信度。评价表明,本文所挖掘的用户具有较高的质量和价值。综上所述,我们将继续努力发现微博中的高价值用户,并认为发现和维护一个不太大的高价值用户数据集对于学术研究和商业应用都是非常重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Discovering Massive High-Value Users from Sina Weibo Based on Quality and Activity
This paper proposes a method to discover the high-value users who are both high-quality and high-activity. First, a Trust Transfer Model is introduced to capture users with high-quality in Sina Weibo that is a social web in China. Then, analysis and discussion are shown to identify the users are high-quality. Next, fresh users who are busy reposting are captured into the dataset filled with the high-quality users. Considering that reposting stands for the high-activity, hence the four degrees including reposting are proposed to judge one user's quality and activity. We discuss the effects of the four degrees. And then, the method called "WeiboRank" based on the four degrees is proposed to mine the high-value users. Finally, testing for the degree of coverage based on the "Top10 Hot Microblog" in Weibo presents a relatively high credibility of our dataset. The evaluation indicates that the users mined in this paper are quite high-quality and high-value. In conclusion, we will continue efforts to discover the high-value users in microblog and believe that discovering and maintaining a dataset filled with the high-value users which is not too big is quite significant for both academic research and business applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信