鲁棒网络放大方法估计

IF 2.4 2区 社会学 Q1 ANTHROPOLOGY
Sergio Díaz-Aranda , Juan Marcos Ramírez , Jose Aguilar , Rosa E. Lillo , Antonio Fernández Anta
{"title":"鲁棒网络放大方法估计","authors":"Sergio Díaz-Aranda ,&nbsp;Juan Marcos Ramírez ,&nbsp;Jose Aguilar ,&nbsp;Rosa E. Lillo ,&nbsp;Antonio Fernández Anta","doi":"10.1016/j.socnet.2025.08.002","DOIUrl":null,"url":null,"abstract":"<div><div>The Network Scale-up Method (NSUM) is a relatively recent statistical approach for estimating the prevalence of unknown populations through indirect surveys utilizing information about the respondents’ social circles. The popularity of NSUM has increased in recent years due to its ability to uphold privacy and cost-effectiveness. However, the NSUM is not exempt from biases resulting from participants’ behavior. In addition, the simpler and most popular NSUM estimators are based on averages, making them sensitive to deviations in the samples, which may cause significant errors. This work aims to study how robust procedures can overcome misreporting, contamination, and deviation due to conditions such as barrier effects, prevalence, skewness, and tail length. Specifically, the central objective of the article is to analyze the statistical robustness of NSUM methods, studying whether these methods are affected by outliers or unusual data. We employ eight robust proposals for each of the two classical NSUM estimators. We analyze robust estimators through simulation experiments using synthetic random networks such as Erdős–Rényi, Scale Free, and Stochastic Block Model structures to model different degree distributions and community structures with different prevalence levels in contaminated and uncontaminated scenarios. We compare the results of the simulations with real data on COVID-19 indicators in the United Kingdom and voting intention in the Spanish General Elections of 2023. This article shows that the classical NSUM estimators perform poorly in contaminated scenarios, while most of the robust proposals are not considerably affected. However, the performance of some robust NSUM estimators decreases under barrier effects. In addition, we observe that distortions created by small prevalence play an important role in selecting the most suitable robust NSUM estimator. Particularly, the robustification of the Mean of Ratios (<span><math><mi>MoR</mi></math></span>) estimator based on the Myriad operator typically exhibits the best performance (for MoR methods) across the various social network structures for different prevalence levels, reducing the estimation error regarding the non-robust methods by up to three orders of magnitude in contaminated scenarios.</div></div>","PeriodicalId":48353,"journal":{"name":"Social Networks","volume":"84 ","pages":"Pages 46-61"},"PeriodicalIF":2.4000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robust network scale-up method estimators\",\"authors\":\"Sergio Díaz-Aranda ,&nbsp;Juan Marcos Ramírez ,&nbsp;Jose Aguilar ,&nbsp;Rosa E. Lillo ,&nbsp;Antonio Fernández Anta\",\"doi\":\"10.1016/j.socnet.2025.08.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The Network Scale-up Method (NSUM) is a relatively recent statistical approach for estimating the prevalence of unknown populations through indirect surveys utilizing information about the respondents’ social circles. The popularity of NSUM has increased in recent years due to its ability to uphold privacy and cost-effectiveness. However, the NSUM is not exempt from biases resulting from participants’ behavior. In addition, the simpler and most popular NSUM estimators are based on averages, making them sensitive to deviations in the samples, which may cause significant errors. This work aims to study how robust procedures can overcome misreporting, contamination, and deviation due to conditions such as barrier effects, prevalence, skewness, and tail length. Specifically, the central objective of the article is to analyze the statistical robustness of NSUM methods, studying whether these methods are affected by outliers or unusual data. We employ eight robust proposals for each of the two classical NSUM estimators. We analyze robust estimators through simulation experiments using synthetic random networks such as Erdős–Rényi, Scale Free, and Stochastic Block Model structures to model different degree distributions and community structures with different prevalence levels in contaminated and uncontaminated scenarios. We compare the results of the simulations with real data on COVID-19 indicators in the United Kingdom and voting intention in the Spanish General Elections of 2023. This article shows that the classical NSUM estimators perform poorly in contaminated scenarios, while most of the robust proposals are not considerably affected. However, the performance of some robust NSUM estimators decreases under barrier effects. In addition, we observe that distortions created by small prevalence play an important role in selecting the most suitable robust NSUM estimator. Particularly, the robustification of the Mean of Ratios (<span><math><mi>MoR</mi></math></span>) estimator based on the Myriad operator typically exhibits the best performance (for MoR methods) across the various social network structures for different prevalence levels, reducing the estimation error regarding the non-robust methods by up to three orders of magnitude in contaminated scenarios.</div></div>\",\"PeriodicalId\":48353,\"journal\":{\"name\":\"Social Networks\",\"volume\":\"84 \",\"pages\":\"Pages 46-61\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Social Networks\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378873325000590\",\"RegionNum\":2,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ANTHROPOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Networks","FirstCategoryId":"90","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378873325000590","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

网络放大法(Network Scale-up Method, NSUM)是一种相对较新的统计方法,通过间接调查利用被调查者的社交圈信息来估计未知人群的患病率。近年来,由于其保护隐私和成本效益的能力,NSUM越来越受欢迎。然而,NSUM也不能避免参与者行为造成的偏差。此外,最简单和最流行的NSUM估计器是基于平均值的,这使得它们对样本中的偏差很敏感,这可能会导致显著的误差。这项工作旨在研究稳健的程序如何克服由于屏障效应、流行率、偏度和尾长等条件造成的误报、污染和偏差。具体来说,本文的中心目标是分析NSUM方法的统计稳健性,研究这些方法是否受到异常值或异常数据的影响。我们对两个经典的NSUM估计分别采用了8个稳健的建议。我们通过模拟实验分析了鲁棒性估计,使用合成随机网络如Erdős-Rényi、Scale Free和随机块模型结构来模拟污染和未污染情景中不同程度分布和不同患病率水平的社区结构。我们将模拟结果与英国COVID-19指标和西班牙2023年大选投票意向的真实数据进行了比较。本文表明,经典的NSUM估计器在污染情况下表现不佳,而大多数鲁棒性建议没有受到很大影响。然而,一些鲁棒NSUM估计器在屏障效应下性能下降。此外,我们观察到小流行率产生的扭曲在选择最合适的稳健NSUM估计器中起着重要作用。特别是,基于Myriad算子的比率均值(MoR)估计器的鲁棒性通常在不同流行水平的各种社会网络结构中表现出最佳性能(对于MoR方法),在污染场景中将非鲁棒方法的估计误差减少了多达三个数量级。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Robust network scale-up method estimators
The Network Scale-up Method (NSUM) is a relatively recent statistical approach for estimating the prevalence of unknown populations through indirect surveys utilizing information about the respondents’ social circles. The popularity of NSUM has increased in recent years due to its ability to uphold privacy and cost-effectiveness. However, the NSUM is not exempt from biases resulting from participants’ behavior. In addition, the simpler and most popular NSUM estimators are based on averages, making them sensitive to deviations in the samples, which may cause significant errors. This work aims to study how robust procedures can overcome misreporting, contamination, and deviation due to conditions such as barrier effects, prevalence, skewness, and tail length. Specifically, the central objective of the article is to analyze the statistical robustness of NSUM methods, studying whether these methods are affected by outliers or unusual data. We employ eight robust proposals for each of the two classical NSUM estimators. We analyze robust estimators through simulation experiments using synthetic random networks such as Erdős–Rényi, Scale Free, and Stochastic Block Model structures to model different degree distributions and community structures with different prevalence levels in contaminated and uncontaminated scenarios. We compare the results of the simulations with real data on COVID-19 indicators in the United Kingdom and voting intention in the Spanish General Elections of 2023. This article shows that the classical NSUM estimators perform poorly in contaminated scenarios, while most of the robust proposals are not considerably affected. However, the performance of some robust NSUM estimators decreases under barrier effects. In addition, we observe that distortions created by small prevalence play an important role in selecting the most suitable robust NSUM estimator. Particularly, the robustification of the Mean of Ratios (MoR) estimator based on the Myriad operator typically exhibits the best performance (for MoR methods) across the various social network structures for different prevalence levels, reducing the estimation error regarding the non-robust methods by up to three orders of magnitude in contaminated scenarios.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Social Networks
Social Networks Multiple-
CiteScore
5.90
自引率
12.90%
发文量
118
期刊介绍: Social Networks is an interdisciplinary and international quarterly. It provides a common forum for representatives of anthropology, sociology, history, social psychology, political science, human geography, biology, economics, communications science and other disciplines who share an interest in the study of the empirical structure of social relations and associations that may be expressed in network form. It publishes both theoretical and substantive papers. Critical reviews of major theoretical or methodological approaches using the notion of networks in the analysis of social behaviour are also included, as are reviews of recent books dealing with social networks and social structure.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信