Seok-Ho Yoon, S. Song, Sang-Wook Kim, Jiwoon Ha, Junghoon Lee, Hanil Kim
{"title":"Linkclus在博客圈中的应用","authors":"Seok-Ho Yoon, S. Song, Sang-Wook Kim, Jiwoon Ha, Junghoon Lee, Hanil Kim","doi":"10.1109/ICNIDC.2010.5657906","DOIUrl":null,"url":null,"abstract":"This paper addresses clustering of blog users and posts in blogosphere. First, we model blogosphere as a bipartite graph where blog users and posts correspond to nodes of two types and actions on posts performed by blog users corresponds to links. Next, for clustering in blogosphere, we employ LinkClus, a link-based algorithm that finds clusters of nodes in a network effectively and efficiently. For more accurate clustering, we propose two refinements: (1) change of granularity from blog users to folders, and (2) removal of blog users and posts being highly likely to incur noises. Finally, we verify the effectiveness of the proposed approach by showing how the posts and blog users in the same cluster are similar to one another in terms of their contents.","PeriodicalId":348778,"journal":{"name":"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Linkclus in blogosphere\",\"authors\":\"Seok-Ho Yoon, S. Song, Sang-Wook Kim, Jiwoon Ha, Junghoon Lee, Hanil Kim\",\"doi\":\"10.1109/ICNIDC.2010.5657906\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper addresses clustering of blog users and posts in blogosphere. First, we model blogosphere as a bipartite graph where blog users and posts correspond to nodes of two types and actions on posts performed by blog users corresponds to links. Next, for clustering in blogosphere, we employ LinkClus, a link-based algorithm that finds clusters of nodes in a network effectively and efficiently. For more accurate clustering, we propose two refinements: (1) change of granularity from blog users to folders, and (2) removal of blog users and posts being highly likely to incur noises. Finally, we verify the effectiveness of the proposed approach by showing how the posts and blog users in the same cluster are similar to one another in terms of their contents.\",\"PeriodicalId\":348778,\"journal\":{\"name\":\"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNIDC.2010.5657906\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNIDC.2010.5657906","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper addresses clustering of blog users and posts in blogosphere. First, we model blogosphere as a bipartite graph where blog users and posts correspond to nodes of two types and actions on posts performed by blog users corresponds to links. Next, for clustering in blogosphere, we employ LinkClus, a link-based algorithm that finds clusters of nodes in a network effectively and efficiently. For more accurate clustering, we propose two refinements: (1) change of granularity from blog users to folders, and (2) removal of blog users and posts being highly likely to incur noises. Finally, we verify the effectiveness of the proposed approach by showing how the posts and blog users in the same cluster are similar to one another in terms of their contents.