用偏径分析驯服一群重尾动物

J. Introne, S. Goggins
{"title":"用偏径分析驯服一群重尾动物","authors":"J. Introne, S. Goggins","doi":"10.1145/2786451.2786484","DOIUrl":null,"url":null,"abstract":"The discovery of stable, heavy-tailed distributions of activity on the web has inspired many researchers to search for simple mechanisms that can cut through the complexity of countless social interactions to yield powerful new theories about human behavior. A dominant mode of investigation involves fitting a mathematical model to an observed distribution, and then inferring the behaviors that generate the modeled distribution. Yet, distributions of activity are not always stable, and the process of fitting a mathematical model to empirical distributions can be highly uncertain, especially for smaller and highly variable datasets. In this paper, we introduce an approach called skew-path analysis, which measures how concentrated information production is along different dimensions in community-generated data. The approach scales from small to large datasets, and is suitable for investigating the dynamics of online behavior. We offer a preliminary demonstration of the approach by using it to analyze six years of data from an online health community, and show that the technique offers interesting insights into the dynamics of information production. In particular, we find evidence for two distinct point attractors within a subset of the forums analyzed, demonstrating the utility of the approach.","PeriodicalId":93136,"journal":{"name":"Proceedings of the ... ACM Web Science Conference. ACM Web Science Conference","volume":"13 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2015-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Taming a Menagerie of Heavy Tails with Skew Path Analysis\",\"authors\":\"J. Introne, S. Goggins\",\"doi\":\"10.1145/2786451.2786484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The discovery of stable, heavy-tailed distributions of activity on the web has inspired many researchers to search for simple mechanisms that can cut through the complexity of countless social interactions to yield powerful new theories about human behavior. A dominant mode of investigation involves fitting a mathematical model to an observed distribution, and then inferring the behaviors that generate the modeled distribution. Yet, distributions of activity are not always stable, and the process of fitting a mathematical model to empirical distributions can be highly uncertain, especially for smaller and highly variable datasets. In this paper, we introduce an approach called skew-path analysis, which measures how concentrated information production is along different dimensions in community-generated data. The approach scales from small to large datasets, and is suitable for investigating the dynamics of online behavior. We offer a preliminary demonstration of the approach by using it to analyze six years of data from an online health community, and show that the technique offers interesting insights into the dynamics of information production. In particular, we find evidence for two distinct point attractors within a subset of the forums analyzed, demonstrating the utility of the approach.\",\"PeriodicalId\":93136,\"journal\":{\"name\":\"Proceedings of the ... ACM Web Science Conference. ACM Web Science Conference\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... ACM Web Science Conference. ACM Web Science Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2786451.2786484\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... ACM Web Science Conference. ACM Web Science Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2786451.2786484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

网络上活动的稳定、重尾分布的发现,激发了许多研究人员寻找简单的机制,可以从无数复杂的社会互动中割据出来,产生关于人类行为的有力新理论。一种主要的调查模式包括将数学模型拟合到观察到的分布,然后推断产生模型分布的行为。然而,活动的分布并不总是稳定的,将数学模型拟合到经验分布的过程可能是高度不确定的,特别是对于较小和高度可变的数据集。在本文中,我们介绍了一种称为倾斜路径分析的方法,该方法测量了社区生成数据中不同维度上信息生产的集中程度。该方法可以从小型数据集扩展到大型数据集,并且适合于调查在线行为的动态。我们通过使用该方法分析一个在线健康社区6年的数据,对该方法进行了初步演示,并表明该技术为信息生产的动态提供了有趣的见解。特别是,我们在分析的论坛子集中发现了两个不同的点吸引子的证据,证明了该方法的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Taming a Menagerie of Heavy Tails with Skew Path Analysis
The discovery of stable, heavy-tailed distributions of activity on the web has inspired many researchers to search for simple mechanisms that can cut through the complexity of countless social interactions to yield powerful new theories about human behavior. A dominant mode of investigation involves fitting a mathematical model to an observed distribution, and then inferring the behaviors that generate the modeled distribution. Yet, distributions of activity are not always stable, and the process of fitting a mathematical model to empirical distributions can be highly uncertain, especially for smaller and highly variable datasets. In this paper, we introduce an approach called skew-path analysis, which measures how concentrated information production is along different dimensions in community-generated data. The approach scales from small to large datasets, and is suitable for investigating the dynamics of online behavior. We offer a preliminary demonstration of the approach by using it to analyze six years of data from an online health community, and show that the technique offers interesting insights into the dynamics of information production. In particular, we find evidence for two distinct point attractors within a subset of the forums analyzed, demonstrating the utility of the approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信