Mining socioenvironmental drivers of bias in social media rescue data for fairness-aware modeling: A case study of Hurricane Harvey

IF 4.2 1区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY
Kejin Wang , Volodymyr Mihunov , Nina S.N. Lam , Mingxuan Sun
{"title":"Mining socioenvironmental drivers of bias in social media rescue data for fairness-aware modeling: A case study of Hurricane Harvey","authors":"Kejin Wang ,&nbsp;Volodymyr Mihunov ,&nbsp;Nina S.N. Lam ,&nbsp;Mingxuan Sun","doi":"10.1016/j.ijdrr.2025.105717","DOIUrl":null,"url":null,"abstract":"<div><div>Researchers now use AI to extract valuable insights from social media data for rapid rescue and emergency management. However, social media users are not sampled in a way that represents the entire population. This study establishes baseline Expected Rescue Request frequencies using social media data to address two critical knowledge gaps: (1) whether systematic differences in the expected rescue request frequencies during disasters exist among communities of different social-environmental characteristics, and (2) which of these characteristics reflect data representation biases and should be treated as sensitive attributes in fairness-aware AI models. Using 35 geographical, socioeconomic, and digital access variables alongside rescue request tweets from the 2017 Hurricane Harvey, a novel fairness measurement index (Rescue Request Difference) was developed and analyzed through regression and Random Forest modeling to balance predictive power, interoperability and transparency. Results showed that overserved communities exhibited higher proportions of physically and financially vulnerable populations, flood-prone geography, limited digital access, and single-family housing. Key attributes such as \"% Minority\" and \"Road density\" were identified as sensitive features requiring fairness rectification in predictive models. Limitations include reliance on a single case study and social media data biases. Future work should integrate multi-source data (e.g., 911 calls, volunteer reports) to strengthen the fairness-aware modeling framework.</div></div>","PeriodicalId":13915,"journal":{"name":"International journal of disaster risk reduction","volume":"128 ","pages":"Article 105717"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of disaster risk reduction","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212420925005412","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Researchers now use AI to extract valuable insights from social media data for rapid rescue and emergency management. However, social media users are not sampled in a way that represents the entire population. This study establishes baseline Expected Rescue Request frequencies using social media data to address two critical knowledge gaps: (1) whether systematic differences in the expected rescue request frequencies during disasters exist among communities of different social-environmental characteristics, and (2) which of these characteristics reflect data representation biases and should be treated as sensitive attributes in fairness-aware AI models. Using 35 geographical, socioeconomic, and digital access variables alongside rescue request tweets from the 2017 Hurricane Harvey, a novel fairness measurement index (Rescue Request Difference) was developed and analyzed through regression and Random Forest modeling to balance predictive power, interoperability and transparency. Results showed that overserved communities exhibited higher proportions of physically and financially vulnerable populations, flood-prone geography, limited digital access, and single-family housing. Key attributes such as "% Minority" and "Road density" were identified as sensitive features requiring fairness rectification in predictive models. Limitations include reliance on a single case study and social media data biases. Future work should integrate multi-source data (e.g., 911 calls, volunteer reports) to strengthen the fairness-aware modeling framework.

Abstract Image

挖掘社会媒体救援数据中偏见的社会环境驱动因素,以实现公平意识建模:以飓风哈维为例
研究人员现在使用人工智能从社交媒体数据中提取有价值的见解,用于快速救援和应急管理。然而,社交媒体用户的抽样并不能代表整个人口。本研究利用社交媒体数据建立了基线预期救援请求频率,以解决两个关键的知识缺口:(1)灾害期间不同社会环境特征的社区之间是否存在预期救援请求频率的系统性差异,以及(2)这些特征中哪些反映了数据表示偏差,应被视为公平感知AI模型中的敏感属性。利用35个地理、社会经济和数字访问变量以及2017年飓风哈维的救援请求推文,开发了一种新的公平衡量指数(救援请求差异),并通过回归和随机森林建模进行了分析,以平衡预测能力、互操作性和透明度。结果表明,服务过剩的社区表现出更高比例的物质和经济脆弱人口、易受洪水影响的地理位置、有限的数字访问和单户住房。关键属性,如“% Minority”和“Road density”被确定为需要在预测模型中公平纠正的敏感特征。局限性包括对单一案例研究的依赖和社交媒体数据的偏见。未来的工作应该整合多源数据(例如,911呼叫,志愿者报告),以加强公平意识建模框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International journal of disaster risk reduction
International journal of disaster risk reduction GEOSCIENCES, MULTIDISCIPLINARYMETEOROLOGY-METEOROLOGY & ATMOSPHERIC SCIENCES
CiteScore
8.70
自引率
18.00%
发文量
688
审稿时长
79 days
期刊介绍: The International Journal of Disaster Risk Reduction (IJDRR) is the journal for researchers, policymakers and practitioners across diverse disciplines: earth sciences and their implications; environmental sciences; engineering; urban studies; geography; and the social sciences. IJDRR publishes fundamental and applied research, critical reviews, policy papers and case studies with a particular focus on multi-disciplinary research that aims to reduce the impact of natural, technological, social and intentional disasters. IJDRR stimulates exchange of ideas and knowledge transfer on disaster research, mitigation, adaptation, prevention and risk reduction at all geographical scales: local, national and international. Key topics:- -multifaceted disaster and cascading disasters -the development of disaster risk reduction strategies and techniques -discussion and development of effective warning and educational systems for risk management at all levels -disasters associated with climate change -vulnerability analysis and vulnerability trends -emerging risks -resilience against disasters. The journal particularly encourages papers that approach risk from a multi-disciplinary perspective.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信