Shuhei Yamamoto, Kei Wakabayashi, N. Kando, T. Satoh
{"title":"Who are growth users?: analyzing and predicting intended Twitter user growth","authors":"Shuhei Yamamoto, Kei Wakabayashi, N. Kando, T. Satoh","doi":"10.1145/3011141.3011145","DOIUrl":null,"url":null,"abstract":"Twitter reflects events and trends in users' real lives because many of them post tweets related to their experiences. Many studies have succeeded in detecting events such as earthquakes and influenza epidemics, along with real-life information from a large amount of tweets, by assuming users as social sensors. On the other hand, inactive users who don't engage in posting activity, are increasing according as time progresses. To collect a large amount of tweets based on specific users for successful Twitter studies, we have to know the characteristics of users who are active over long periods of time. In this paper, we clarify the characteristics of growth users over a long time to strategically collect a large amount of specific users' tweets. We explore the status of users who were active in 2012, and classify users into three statuses of Dead, Lock, and Alive. Based on the differences between the numbers of tweets in 2012 and 2016, we further classify alive users into three types of Eraser, Slumber, and Growth. We analyze the characteristic feature values observed in each user behavior and provide interesting findings with each status/type based on GMM clustering and point-wise mutual information. Finally, we propose a growth user prediction method by a simple formula consisting of feature values and evaluate the effectiveness. We found that active users more easily dropped out than inactive users, and users who engaged in reciprocal communications by replies and retweets often became Growth type.","PeriodicalId":247823,"journal":{"name":"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3011141.3011145","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Twitter reflects events and trends in users' real lives because many of them post tweets related to their experiences. Many studies have succeeded in detecting events such as earthquakes and influenza epidemics, along with real-life information from a large amount of tweets, by assuming users as social sensors. On the other hand, inactive users who don't engage in posting activity, are increasing according as time progresses. To collect a large amount of tweets based on specific users for successful Twitter studies, we have to know the characteristics of users who are active over long periods of time. In this paper, we clarify the characteristics of growth users over a long time to strategically collect a large amount of specific users' tweets. We explore the status of users who were active in 2012, and classify users into three statuses of Dead, Lock, and Alive. Based on the differences between the numbers of tweets in 2012 and 2016, we further classify alive users into three types of Eraser, Slumber, and Growth. We analyze the characteristic feature values observed in each user behavior and provide interesting findings with each status/type based on GMM clustering and point-wise mutual information. Finally, we propose a growth user prediction method by a simple formula consisting of feature values and evaluate the effectiveness. We found that active users more easily dropped out than inactive users, and users who engaged in reciprocal communications by replies and retweets often became Growth type.