信息流估计：推特新闻研究

Q1 Social Sciences

Online Social Networks and Media Pub Date : 2022-09-01 DOI:10.1016/j.osnem.2022.100231

Tobin South , Bridget Smart , Matthew Roughan , Lewis Mitchell

{"title":"信息流估计：推特新闻研究","authors":"Tobin South , Bridget Smart , Matthew Roughan , Lewis Mitchell","doi":"10.1016/j.osnem.2022.100231","DOIUrl":null,"url":null,"abstract":"<div><p>News media has long been an ecosystem of creation, reproduction, and critique, where news outlets report on current events and add commentary to ongoing stories. Understanding the dynamics of news information creation and dispersion is important to accurately ascribe credit to influential work and understand how societal narratives develop. These dynamics can be modelled through a combination of information-theoretic natural language processing and networks; and can be parameterised using large quantities of textual data. However, it is challenging to see “the wood for the trees”, <em>i.e.,</em> to detect small but important flows of information in a sea of noise. Here we develop new comparative techniques to estimate temporal information flow between pairs of text producers. Using both simulated and real text data we compare the reliability and sensitivity of methods for estimating textual information flow, showing that a metric that normalises by local neighbourhood structure provides a robust estimate of information flow in large networks. We apply this metric to a large corpus of news organisations on Twitter and demonstrate its usefulness in identifying influence within an information ecosystem, finding that average information contribution to the network is not correlated with the number of followers or the number of tweets. This suggests that small local organisations and right-wing organisations which have lower average follower counts still contribute significant information to the ecosystem. Further, the methods are applied to smaller full-text datasets of specific news events across news sites and Russian troll accounts on Twitter. The information flow estimation reveals and quantifies features of how these events develop and the role of groups of trolls in setting disinformation narratives. In summary, this work provides a new methodology for examining the information transmitted between content producers in any connected system of natural language, a toolkit with applications to the many networked discourses of our online world.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"31 ","pages":"Article 100231"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468696422000337/pdfft?md5=4198fd7ff22efcd01f7d1b4875be626a&pid=1-s2.0-S2468696422000337-main.pdf","citationCount":"4","resultStr":"{\"title\":\"Information flow estimation: A study of news on Twitter\",\"authors\":\"Tobin South , Bridget Smart , Matthew Roughan , Lewis Mitchell\",\"doi\":\"10.1016/j.osnem.2022.100231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>News media has long been an ecosystem of creation, reproduction, and critique, where news outlets report on current events and add commentary to ongoing stories. Understanding the dynamics of news information creation and dispersion is important to accurately ascribe credit to influential work and understand how societal narratives develop. These dynamics can be modelled through a combination of information-theoretic natural language processing and networks; and can be parameterised using large quantities of textual data. However, it is challenging to see “the wood for the trees”, <em>i.e.,</em> to detect small but important flows of information in a sea of noise. Here we develop new comparative techniques to estimate temporal information flow between pairs of text producers. Using both simulated and real text data we compare the reliability and sensitivity of methods for estimating textual information flow, showing that a metric that normalises by local neighbourhood structure provides a robust estimate of information flow in large networks. We apply this metric to a large corpus of news organisations on Twitter and demonstrate its usefulness in identifying influence within an information ecosystem, finding that average information contribution to the network is not correlated with the number of followers or the number of tweets. This suggests that small local organisations and right-wing organisations which have lower average follower counts still contribute significant information to the ecosystem. Further, the methods are applied to smaller full-text datasets of specific news events across news sites and Russian troll accounts on Twitter. The information flow estimation reveals and quantifies features of how these events develop and the role of groups of trolls in setting disinformation narratives. In summary, this work provides a new methodology for examining the information transmitted between content producers in any connected system of natural language, a toolkit with applications to the many networked discourses of our online world.</p></div>\",\"PeriodicalId\":52228,\"journal\":{\"name\":\"Online Social Networks and Media\",\"volume\":\"31 \",\"pages\":\"Article 100231\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2468696422000337/pdfft?md5=4198fd7ff22efcd01f7d1b4875be626a&pid=1-s2.0-S2468696422000337-main.pdf\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Online Social Networks and Media\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2468696422000337\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Social Networks and Media","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468696422000337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 4

摘要

长期以来，新闻媒体一直是一个创作、复制和评论的生态系统，新闻媒体在这里报道时事，并为正在进行的故事添加评论。了解新闻信息创造和传播的动态对于准确地将功劳归于有影响力的作品和理解社会叙事是如何发展的很重要。这些动态可以通过信息论自然语言处理和网络的结合来建模；并且可以使用大量的文本数据来参数化。然而，要看到“以木换树”，即在噪音的海洋中检测微小但重要的信息流，是一项挑战。在这里，我们开发了新的比较技术来估计文本生产者对之间的时间信息流。使用模拟和真实文本数据，我们比较了估计文本信息流的方法的可靠性和敏感性，表明通过局部邻域结构归一化的度量提供了对大型网络中信息流的稳健估计。我们将这一指标应用于推特上的大量新闻机构，并证明了它在识别信息生态系统中的影响力方面的有用性，发现对网络的平均信息贡献与关注者数量或推文数量无关。这表明，平均追随者数量较低的小型地方组织和右翼组织仍然为生态系统贡献了重要信息。此外，这些方法还应用于新闻网站和推特上俄罗斯巨魔账户中特定新闻事件的较小全文数据集。信息流估计揭示并量化了这些事件如何发展的特征，以及巨魔群体在设置虚假信息叙事中的作用。总之，这项工作提供了一种新的方法来检查任何连接的自然语言系统中内容生产者之间传输的信息，这是一个应用于我们网络世界的许多网络话语的工具包。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Information flow estimation: A study of news on Twitter

News media has long been an ecosystem of creation, reproduction, and critique, where news outlets report on current events and add commentary to ongoing stories. Understanding the dynamics of news information creation and dispersion is important to accurately ascribe credit to influential work and understand how societal narratives develop. These dynamics can be modelled through a combination of information-theoretic natural language processing and networks; and can be parameterised using large quantities of textual data. However, it is challenging to see “the wood for the trees”, i.e., to detect small but important flows of information in a sea of noise. Here we develop new comparative techniques to estimate temporal information flow between pairs of text producers. Using both simulated and real text data we compare the reliability and sensitivity of methods for estimating textual information flow, showing that a metric that normalises by local neighbourhood structure provides a robust estimate of information flow in large networks. We apply this metric to a large corpus of news organisations on Twitter and demonstrate its usefulness in identifying influence within an information ecosystem, finding that average information contribution to the network is not correlated with the number of followers or the number of tweets. This suggests that small local organisations and right-wing organisations which have lower average follower counts still contribute significant information to the ecosystem. Further, the methods are applied to smaller full-text datasets of specific news events across news sites and Russian troll accounts on Twitter. The information flow estimation reveals and quantifies features of how these events develop and the role of groups of trolls in setting disinformation narratives. In summary, this work provides a new methodology for examining the information transmitted between content producers in any connected system of natural language, a toolkit with applications to the many networked discourses of our online world.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Online Social Networks and Media Social Sciences-Communication

CiteScore

10.60

自引率

0.00%

发文量

审稿时长

44 days