Wendel Silva, Á. Santana, F. Lobato, Márcia Pinheiro
{"title":"一种Twitter社区检测方法","authors":"Wendel Silva, Á. Santana, F. Lobato, Márcia Pinheiro","doi":"10.1145/3106426.3117760","DOIUrl":null,"url":null,"abstract":"The microblogging service Twitter is one of the world's most popular online social networks and assembles a huge amount of data produced by interactions between users. A careful analysis of this data allows identifying groups of users who share similar traits, opinions, and preferences. We call community detection the process of user group identification, which grants valuable insights not available upfront. In order to extract useful knowledge from Twitter data many methodologies have been proposed, which define the attributes to be used in community detection problems by manual and empirical criteria - oftentimes guided by the aimed type of community and what the researcher attaches importance to. However, such approach cannot be generalized because it is well known that the task of finding out an appropriate set of attributes leans on context, domain, and data set. Aiming to the advance of community detection domain, reduce computational cost and improve the quality of related researches, this paper proposes a standard methodology for community detection in Twitter using feature selection methods. Results of the present research directly affect the way community detection methodologies have been applied to Twitter and quality of outcomes produced.","PeriodicalId":20685,"journal":{"name":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","volume":"116 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2017-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"A methodology for community detection in Twitter\",\"authors\":\"Wendel Silva, Á. Santana, F. Lobato, Márcia Pinheiro\",\"doi\":\"10.1145/3106426.3117760\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The microblogging service Twitter is one of the world's most popular online social networks and assembles a huge amount of data produced by interactions between users. A careful analysis of this data allows identifying groups of users who share similar traits, opinions, and preferences. We call community detection the process of user group identification, which grants valuable insights not available upfront. In order to extract useful knowledge from Twitter data many methodologies have been proposed, which define the attributes to be used in community detection problems by manual and empirical criteria - oftentimes guided by the aimed type of community and what the researcher attaches importance to. However, such approach cannot be generalized because it is well known that the task of finding out an appropriate set of attributes leans on context, domain, and data set. Aiming to the advance of community detection domain, reduce computational cost and improve the quality of related researches, this paper proposes a standard methodology for community detection in Twitter using feature selection methods. Results of the present research directly affect the way community detection methodologies have been applied to Twitter and quality of outcomes produced.\",\"PeriodicalId\":20685,\"journal\":{\"name\":\"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics\",\"volume\":\"116 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3106426.3117760\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3106426.3117760","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The microblogging service Twitter is one of the world's most popular online social networks and assembles a huge amount of data produced by interactions between users. A careful analysis of this data allows identifying groups of users who share similar traits, opinions, and preferences. We call community detection the process of user group identification, which grants valuable insights not available upfront. In order to extract useful knowledge from Twitter data many methodologies have been proposed, which define the attributes to be used in community detection problems by manual and empirical criteria - oftentimes guided by the aimed type of community and what the researcher attaches importance to. However, such approach cannot be generalized because it is well known that the task of finding out an appropriate set of attributes leans on context, domain, and data set. Aiming to the advance of community detection domain, reduce computational cost and improve the quality of related researches, this paper proposes a standard methodology for community detection in Twitter using feature selection methods. Results of the present research directly affect the way community detection methodologies have been applied to Twitter and quality of outcomes produced.