Six classes named entity recognition for mapping location of Indonesia natural disasters from twitter data

IF 2.2 Q3 COMPUTER SCIENCE, CYBERNETICS
A. S. Girsang, Bima Krisna Noveta
{"title":"Six classes named entity recognition for mapping location of Indonesia natural disasters from twitter data","authors":"A. S. Girsang, Bima Krisna Noveta","doi":"10.1108/ijicc-09-2023-0251","DOIUrl":null,"url":null,"abstract":"PurposeThe purpose of this study is to provide the location of natural disasters that are poured into maps by extracting Twitter data. The Twitter text is extracted by using named entity recognition (NER) with six classes hierarchy location in Indonesia. Moreover, the tweet then is classified into eight classes of natural disasters using the support vector machine (SVM). Overall, the system is able to classify tweet and mapping the position of the content tweet.Design/methodology/approachThis research builds a model to map the geolocation of tweet data using NER. This research uses six classes of NER which is based on region Indonesia. This data is then classified into eight classes of natural disasters using the SVM.FindingsExperiment results demonstrate that the proposed NER with six special classes based on the regional level in Indonesia is able to map the location of the disaster based on data Twitter. The results also show good performance in geocoding such as match rate, match score and match type. Moreover, with SVM, this study can also classify tweet into eight classes of types of natural disasters specifically for the Indonesian region, which originate from the tweets collected.Research limitations/implicationsThis study implements in Indonesia region.Originality/value(a)NER with six classes is used to create a location classification model with StanfordNER and ArcGIS tools. The use of six location classes is based on the Indonesia regional which has the large area. Hence, it has many levels in its regional location, such as province, district/city, sub-district, village, road and place names. (b) SVM is used to classify natural disasters. Classification of types of natural disasters is divided into eight: floods, earthquakes, landslides, tsunamis, hurricanes, forest fires, droughts and volcanic eruptions.","PeriodicalId":45291,"journal":{"name":"International Journal of Intelligent Computing and Cybernetics","volume":"122 11","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Computing and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ijicc-09-2023-0251","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 0

Abstract

PurposeThe purpose of this study is to provide the location of natural disasters that are poured into maps by extracting Twitter data. The Twitter text is extracted by using named entity recognition (NER) with six classes hierarchy location in Indonesia. Moreover, the tweet then is classified into eight classes of natural disasters using the support vector machine (SVM). Overall, the system is able to classify tweet and mapping the position of the content tweet.Design/methodology/approachThis research builds a model to map the geolocation of tweet data using NER. This research uses six classes of NER which is based on region Indonesia. This data is then classified into eight classes of natural disasters using the SVM.FindingsExperiment results demonstrate that the proposed NER with six special classes based on the regional level in Indonesia is able to map the location of the disaster based on data Twitter. The results also show good performance in geocoding such as match rate, match score and match type. Moreover, with SVM, this study can also classify tweet into eight classes of types of natural disasters specifically for the Indonesian region, which originate from the tweets collected.Research limitations/implicationsThis study implements in Indonesia region.Originality/value(a)NER with six classes is used to create a location classification model with StanfordNER and ArcGIS tools. The use of six location classes is based on the Indonesia regional which has the large area. Hence, it has many levels in its regional location, such as province, district/city, sub-district, village, road and place names. (b) SVM is used to classify natural disasters. Classification of types of natural disasters is divided into eight: floods, earthquakes, landslides, tsunamis, hurricanes, forest fires, droughts and volcanic eruptions.
利用 twitter 数据绘制印尼自然灾害位置图的六类命名实体识别技术
目的本研究的目的是通过提取 Twitter 数据,将自然灾害的位置信息绘制到地图中。Twitter 文本是通过命名实体识别(NER)提取的,其中包含印度尼西亚的六个等级。然后,使用支持向量机(SVM)将推特分为八类自然灾害。总之,该系统能够对推文进行分类,并映射出内容推文的位置。设计/方法/途径本研究利用命名实体识别(NER)建立了一个映射推文数据地理位置的模型。本研究使用基于印度尼西亚地区的六类 NER。实验结果实验结果表明,所提出的基于印度尼西亚地区级别的六个特殊类别的 NER 能够根据 Twitter 数据绘制灾害位置图。实验结果还显示了地理编码的良好性能,如匹配率、匹配分数和匹配类型。此外,通过 SVM,本研究还可以将推文分为八类,具体针对印度尼西亚地区的自然灾害类型,这些类型均源自所收集的推文。使用六个位置类别是基于印尼地区,因为该地区面积较大。因此,其区域位置有许多级别,如省、区/市、县、村、道路和地名。(b) 利用 SVM 对自然灾害进行分类。自然灾害类型的分类分为八种:洪水、地震、山体滑坡、海啸、飓风、森林火灾、干旱和火山爆发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.80
自引率
4.70%
发文量
26
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信