Situational awareness extraction: a comprehensive review of social media data classification during natural hazards

IF 2.7 Q1 GEOGRAPHY

Annals of GIS Pub Date : 2020-10-09 DOI:10.1080/19475683.2020.1817146

Jirapa Vongkusolkit, Qunying Huang

{"title":"Situational awareness extraction: a comprehensive review of social media data classification during natural hazards","authors":"Jirapa Vongkusolkit, Qunying Huang","doi":"10.1080/19475683.2020.1817146","DOIUrl":null,"url":null,"abstract":"ABSTRACT Social media (e.g., Twitter and Facebook) can be regarded as vital sources of information during disasters to improve situational awareness (SA) and disaster management since they play a significant role in the rapid spread of information in the event of a disaster. Due to the volume of data is far beyond the capabilities of manual examination, existing works utilize natural language processing methods based on keywords, or classification models relying on features derived from text and other metadata (e.g., user profiles) to extract social media data contributing to SA and automatically categorize them into the relevant classes (e.g., damage and donation). However, the design of the classification schema and the associated information extraction methods are far less than straightforward and highly depend on: (1) the event type, (2) the study or analysis purpose, and (3) the social media platform used. To this end, this paper reviews the literature for extracting social media data and provides an overview of classification schemas that have been used to assess SA in events involving natural hazards from five different analytical perspectives (content, temporal, user, sentiment, and spatiotemporal) by discussing the prevalent topic categories, disaster event types, purpose of studies, and platforms utilized from each schema. Finally, this paper summarizes classification methods, and platforms that are most commonly used for each disaster event type, and outlines a research agenda with recommendations for future innovations.","PeriodicalId":46270,"journal":{"name":"Annals of GIS","volume":"1 1","pages":"5 - 28"},"PeriodicalIF":2.7000,"publicationDate":"2020-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of GIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/19475683.2020.1817146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}

引用次数: 17

Abstract

ABSTRACT Social media (e.g., Twitter and Facebook) can be regarded as vital sources of information during disasters to improve situational awareness (SA) and disaster management since they play a significant role in the rapid spread of information in the event of a disaster. Due to the volume of data is far beyond the capabilities of manual examination, existing works utilize natural language processing methods based on keywords, or classification models relying on features derived from text and other metadata (e.g., user profiles) to extract social media data contributing to SA and automatically categorize them into the relevant classes (e.g., damage and donation). However, the design of the classification schema and the associated information extraction methods are far less than straightforward and highly depend on: (1) the event type, (2) the study or analysis purpose, and (3) the social media platform used. To this end, this paper reviews the literature for extracting social media data and provides an overview of classification schemas that have been used to assess SA in events involving natural hazards from five different analytical perspectives (content, temporal, user, sentiment, and spatiotemporal) by discussing the prevalent topic categories, disaster event types, purpose of studies, and platforms utilized from each schema. Finally, this paper summarizes classification methods, and platforms that are most commonly used for each disaster event type, and outlines a research agenda with recommendations for future innovations.

查看原文本刊更多论文

情景感知提取:自然灾害期间社会媒体数据分类的综合审查

社交媒体(如Twitter和Facebook)可以被视为灾害期间提高态势感知(SA)和灾害管理的重要信息来源，因为它们在灾害发生时信息的快速传播中发挥了重要作用。由于数据量远远超出了人工检查的能力，现有的作品采用基于关键词的自然语言处理方法，或者基于文本和其他元数据(如用户配置文件)衍生的特征的分类模型，提取有助于SA的社交媒体数据，并将其自动分类到相关的类中(如损坏和捐赠)。然而，分类模式的设计和相关的信息提取方法远不是那么简单，并且高度依赖于:(1)事件类型，(2)研究或分析目的，(3)使用的社交媒体平台。为此，本文回顾了提取社交媒体数据的文献，并从五个不同的分析角度(内容、时间、用户、情感和时空)概述了用于评估自然灾害事件中SA的分类模式，讨论了流行的主题类别、灾害事件类型、研究目的和每个模式使用的平台。最后，本文总结了每种灾害事件类型最常用的分类方法和平台，并概述了研究议程和未来创新的建议。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annals of GIS Multiple-

CiteScore

8.30

自引率

2.00%

发文量