nlp驱动的城市洪水监测众包:来自孟买的见解

IF 12 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY
Aniket Navalkar , Shrabani S. Tripathy , Mayank Gupta , Sanghita Basu , Puja Tripathy , Archismita Banerjee , Sheeba Sekharan , Raghu Murtugudde , Subimal Ghosh
{"title":"nlp驱动的城市洪水监测众包:来自孟买的见解","authors":"Aniket Navalkar ,&nbsp;Shrabani S. Tripathy ,&nbsp;Mayank Gupta ,&nbsp;Sanghita Basu ,&nbsp;Puja Tripathy ,&nbsp;Archismita Banerjee ,&nbsp;Sheeba Sekharan ,&nbsp;Raghu Murtugudde ,&nbsp;Subimal Ghosh","doi":"10.1016/j.scs.2025.106795","DOIUrl":null,"url":null,"abstract":"<div><div>With increasing extreme rainfall events, densely populated megacities face recurrent flooding, necessitating real-time flood monitoring systems for citizens. Social media platforms like Twitter (now X) can complement conventional sensor-based flood monitoring. However, their utility is limited by low geotagging and a data stream cluttered with extraneous content. We take the case study of Mumbai to demonstrate how Natural Language Processing (NLP) can be leveraged to identify flood-relevant tweets. Applying NLP to a historical Twitter dataset (2017 – 2022), we match the text patterns in tweets against lexical keyword datasets to geocode them and classify their sentiment as ‘positive’ or ‘negative’. We observe a decline in the daily positivity ratio of tweets with increasing daily average rainfall for all years. We find that negative tweets are concentrated in areas of orographic rainfall, higher population density, and lower elevations. Further, we analysed extreme rainfall impacts on urban transportation using a network theory-based global efficiency loss (GEL) metric. For this, we configured Q-NEAT, a GIS-based network model, to perform time-optimized routing by integrating geolocated negative tweets alongside hotspots identified by the municipal corporation. We found that Twitter explained an additional 1.13 %, 20.96 %, and 67 % of the total GEL detected on the extreme rainfall days, 16th July 2021 (199.87 mm), 2nd July 2019 (200.12 mm), and 25th June 2018 (153.14 mm). Such impacts are not fully detected by the municipal corporation’s thumb rule for identifying waterlogging based on precipitation amounts and field surveys. Our findings underscore the advantages of crowdsourced data for enhancing urban flood monitoring and impact assessment.</div></div>","PeriodicalId":48659,"journal":{"name":"Sustainable Cities and Society","volume":"132 ","pages":"Article 106795"},"PeriodicalIF":12.0000,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"NLP-driven crowdsourcing for urban flood monitoring: insights from mumbai\",\"authors\":\"Aniket Navalkar ,&nbsp;Shrabani S. Tripathy ,&nbsp;Mayank Gupta ,&nbsp;Sanghita Basu ,&nbsp;Puja Tripathy ,&nbsp;Archismita Banerjee ,&nbsp;Sheeba Sekharan ,&nbsp;Raghu Murtugudde ,&nbsp;Subimal Ghosh\",\"doi\":\"10.1016/j.scs.2025.106795\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With increasing extreme rainfall events, densely populated megacities face recurrent flooding, necessitating real-time flood monitoring systems for citizens. Social media platforms like Twitter (now X) can complement conventional sensor-based flood monitoring. However, their utility is limited by low geotagging and a data stream cluttered with extraneous content. We take the case study of Mumbai to demonstrate how Natural Language Processing (NLP) can be leveraged to identify flood-relevant tweets. Applying NLP to a historical Twitter dataset (2017 – 2022), we match the text patterns in tweets against lexical keyword datasets to geocode them and classify their sentiment as ‘positive’ or ‘negative’. We observe a decline in the daily positivity ratio of tweets with increasing daily average rainfall for all years. We find that negative tweets are concentrated in areas of orographic rainfall, higher population density, and lower elevations. Further, we analysed extreme rainfall impacts on urban transportation using a network theory-based global efficiency loss (GEL) metric. For this, we configured Q-NEAT, a GIS-based network model, to perform time-optimized routing by integrating geolocated negative tweets alongside hotspots identified by the municipal corporation. We found that Twitter explained an additional 1.13 %, 20.96 %, and 67 % of the total GEL detected on the extreme rainfall days, 16th July 2021 (199.87 mm), 2nd July 2019 (200.12 mm), and 25th June 2018 (153.14 mm). Such impacts are not fully detected by the municipal corporation’s thumb rule for identifying waterlogging based on precipitation amounts and field surveys. Our findings underscore the advantages of crowdsourced data for enhancing urban flood monitoring and impact assessment.</div></div>\",\"PeriodicalId\":48659,\"journal\":{\"name\":\"Sustainable Cities and Society\",\"volume\":\"132 \",\"pages\":\"Article 106795\"},\"PeriodicalIF\":12.0000,\"publicationDate\":\"2025-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sustainable Cities and Society\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2210670725006699\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sustainable Cities and Society","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210670725006699","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

随着极端降雨事件的增加,人口密集的特大城市面临经常性的洪水,需要为市民提供实时洪水监测系统。像Twitter(现在是X)这样的社交媒体平台可以补充传统的基于传感器的洪水监测。然而,它们的实用性受到低地理标记和混杂着无关内容的数据流的限制。我们以孟买的案例研究为例,展示如何利用自然语言处理(NLP)来识别与洪水相关的推文。将NLP应用于历史推特数据集(2017 - 2022),我们将推文中的文本模式与词汇关键字数据集进行匹配,对其进行地理编码,并将其情绪分类为“积极”或“消极”。我们观察到,随着所有年份的日平均降雨量的增加,推文的日积极率有所下降。我们发现,负面推文主要集中在地形降雨、人口密度较高和海拔较低的地区。此外,我们使用基于网络理论的全球效率损失(GEL)指标分析了极端降雨对城市交通的影响。为此,我们配置了Q-NEAT,一个基于gis的网络模型,通过将地理位置的负面推文与市政公司确定的热点集成在一起,来执行时间优化路由。我们发现,在极端降雨日(2021年7月16日(199.87毫米)、2019年7月2日(200.12毫米)和2018年6月25日(153.14毫米)中,Twitter解释了额外的1.13%、20.96%和67%的GEL。市政公司根据降雨量和实地调查确定内涝的经验法则并没有完全检测到这种影响。我们的研究结果强调了众包数据在加强城市洪水监测和影响评估方面的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
NLP-driven crowdsourcing for urban flood monitoring: insights from mumbai
With increasing extreme rainfall events, densely populated megacities face recurrent flooding, necessitating real-time flood monitoring systems for citizens. Social media platforms like Twitter (now X) can complement conventional sensor-based flood monitoring. However, their utility is limited by low geotagging and a data stream cluttered with extraneous content. We take the case study of Mumbai to demonstrate how Natural Language Processing (NLP) can be leveraged to identify flood-relevant tweets. Applying NLP to a historical Twitter dataset (2017 – 2022), we match the text patterns in tweets against lexical keyword datasets to geocode them and classify their sentiment as ‘positive’ or ‘negative’. We observe a decline in the daily positivity ratio of tweets with increasing daily average rainfall for all years. We find that negative tweets are concentrated in areas of orographic rainfall, higher population density, and lower elevations. Further, we analysed extreme rainfall impacts on urban transportation using a network theory-based global efficiency loss (GEL) metric. For this, we configured Q-NEAT, a GIS-based network model, to perform time-optimized routing by integrating geolocated negative tweets alongside hotspots identified by the municipal corporation. We found that Twitter explained an additional 1.13 %, 20.96 %, and 67 % of the total GEL detected on the extreme rainfall days, 16th July 2021 (199.87 mm), 2nd July 2019 (200.12 mm), and 25th June 2018 (153.14 mm). Such impacts are not fully detected by the municipal corporation’s thumb rule for identifying waterlogging based on precipitation amounts and field surveys. Our findings underscore the advantages of crowdsourced data for enhancing urban flood monitoring and impact assessment.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Sustainable Cities and Society
Sustainable Cities and Society Social Sciences-Geography, Planning and Development
CiteScore
22.00
自引率
13.70%
发文量
810
审稿时长
27 days
期刊介绍: Sustainable Cities and Society (SCS) is an international journal that focuses on fundamental and applied research to promote environmentally sustainable and socially resilient cities. The journal welcomes cross-cutting, multi-disciplinary research in various areas, including: 1. Smart cities and resilient environments; 2. Alternative/clean energy sources, energy distribution, distributed energy generation, and energy demand reduction/management; 3. Monitoring and improving air quality in built environment and cities (e.g., healthy built environment and air quality management); 4. Energy efficient, low/zero carbon, and green buildings/communities; 5. Climate change mitigation and adaptation in urban environments; 6. Green infrastructure and BMPs; 7. Environmental Footprint accounting and management; 8. Urban agriculture and forestry; 9. ICT, smart grid and intelligent infrastructure; 10. Urban design/planning, regulations, legislation, certification, economics, and policy; 11. Social aspects, impacts and resiliency of cities; 12. Behavior monitoring, analysis and change within urban communities; 13. Health monitoring and improvement; 14. Nexus issues related to sustainable cities and societies; 15. Smart city governance; 16. Decision Support Systems for trade-off and uncertainty analysis for improved management of cities and society; 17. Big data, machine learning, and artificial intelligence applications and case studies; 18. Critical infrastructure protection, including security, privacy, forensics, and reliability issues of cyber-physical systems. 19. Water footprint reduction and urban water distribution, harvesting, treatment, reuse and management; 20. Waste reduction and recycling; 21. Wastewater collection, treatment and recycling; 22. Smart, clean and healthy transportation systems and infrastructure;
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信