{"title":"A user activity-based measurement study characterizing and classifying Stack Exchange communities across multiple domains","authors":"Akshit Trehan, S. Khurana, A. Bagchi","doi":"10.1145/3041823.3041834","DOIUrl":null,"url":null,"abstract":"In recent times, Question-Answer communities have engaged much user attention and have become a major platform for knowledge sharing and discussion. Stack Exchange (SE) is one such successful community which is a collection of various domain-specific forums, each acting as an independent community in itself. In this paper, we undertake a comparative measurement study across a large number of these domain-specific forums within Stack Exchange. We analyse a number of user activity-based features of each forum and try to cluster different forums based on their similarities on this feature set. For our study, we model Stack Exchange as an Across \"Forum Graph\" based on inter-forum similarity, and its individual forums as: (a) A user-to-user graph (question asker-answerer) (b) A bipartite graph between questions and answerers, and (c) A bipartite graph between questions and answers. Through these graphs we present a measurement study of Stack Exchange which focuses on the similarities and differences between various forums based on the patterns of user activity on them. The clusters obtained give a high level idea of similar forums based on common users and content. We observe that communities can be classified as \"discussion-based\" and \"fact-based\" and further we classify forums on the basis of question answering patterns.","PeriodicalId":173593,"journal":{"name":"Proceedings of the 4th ACM IKDD Conferences on Data Sciences","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th ACM IKDD Conferences on Data Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3041823.3041834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent times, Question-Answer communities have engaged much user attention and have become a major platform for knowledge sharing and discussion. Stack Exchange (SE) is one such successful community which is a collection of various domain-specific forums, each acting as an independent community in itself. In this paper, we undertake a comparative measurement study across a large number of these domain-specific forums within Stack Exchange. We analyse a number of user activity-based features of each forum and try to cluster different forums based on their similarities on this feature set. For our study, we model Stack Exchange as an Across "Forum Graph" based on inter-forum similarity, and its individual forums as: (a) A user-to-user graph (question asker-answerer) (b) A bipartite graph between questions and answerers, and (c) A bipartite graph between questions and answers. Through these graphs we present a measurement study of Stack Exchange which focuses on the similarities and differences between various forums based on the patterns of user activity on them. The clusters obtained give a high level idea of similar forums based on common users and content. We observe that communities can be classified as "discussion-based" and "fact-based" and further we classify forums on the basis of question answering patterns.