Ontology-Based Traffic Accident Information Extraction on Twitter In Indonesia

Nur Aini Rakhmawati, Yasin Awwab, Ahmad Choirun Najib, A. Irsyad
{"title":"Ontology-Based Traffic Accident Information Extraction on Twitter In Indonesia","authors":"Nur Aini Rakhmawati, Yasin Awwab, Ahmad Choirun Najib, A. Irsyad","doi":"10.4114/intartif.vol25iss70pp1-12","DOIUrl":null,"url":null,"abstract":"Traffic accidents become one of the events that often occur in Indonesia. From the three-monthly report by the Indonesian National Police Traffic Police, there are about 25,000 traffic accidents. Many social media users, especially Twitter, share information about traffic accidents. Twitter has various information regarding traffic accidents. Therefore, this study aims to process and map information about traffic accidents contained on Twitter in Indonesia language.  We use the domain ontology and Named-Entity Recognition for the data extraction process. Named-Entity Recognition is used for obtaining keywords from a tweet based on class categories such as actor, time, location, and information on the cause of the accident. This research generates a Named Entity Recognition (NER) model that can provide a reasonably accurate level of accuracy. Also, we create an ontology that can categorize the causes of traffic accidents based on the Directorate General of the Land Transportation Office, Indonesia. We found that the traffic accidents are generally caused by inadequate vehicle conditions with the main problem in the vehicle caused by brake failure, while environmental factors rarely cause traffic accidents. Moreover, the vehicle is the subclass that mostly appears in the tweets, where car is the most popular actor, followed by truck and motorcycle.","PeriodicalId":176050,"journal":{"name":"Inteligencia Artif.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Inteligencia Artif.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4114/intartif.vol25iss70pp1-12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Traffic accidents become one of the events that often occur in Indonesia. From the three-monthly report by the Indonesian National Police Traffic Police, there are about 25,000 traffic accidents. Many social media users, especially Twitter, share information about traffic accidents. Twitter has various information regarding traffic accidents. Therefore, this study aims to process and map information about traffic accidents contained on Twitter in Indonesia language.  We use the domain ontology and Named-Entity Recognition for the data extraction process. Named-Entity Recognition is used for obtaining keywords from a tweet based on class categories such as actor, time, location, and information on the cause of the accident. This research generates a Named Entity Recognition (NER) model that can provide a reasonably accurate level of accuracy. Also, we create an ontology that can categorize the causes of traffic accidents based on the Directorate General of the Land Transportation Office, Indonesia. We found that the traffic accidents are generally caused by inadequate vehicle conditions with the main problem in the vehicle caused by brake failure, while environmental factors rarely cause traffic accidents. Moreover, the vehicle is the subclass that mostly appears in the tweets, where car is the most popular actor, followed by truck and motorcycle.
基于本体的印尼Twitter交通事故信息提取
交通事故成为印尼经常发生的事件之一。从印尼国家警察交通警察的三个月报告来看,大约有2.5万起交通事故。许多社交媒体用户,尤其是推特,分享有关交通事故的信息。Twitter上有各种关于交通事故的信息。因此,本研究旨在处理和绘制印尼语Twitter上包含的交通事故信息。我们使用领域本体和命名实体识别来进行数据提取。命名实体识别用于根据角色、时间、地点和事故原因信息等类别从tweet中获取关键字。本研究生成了一个命名实体识别(NER)模型,该模型可以提供相当精确的精度水平。此外,我们创建了一个本体,可以根据印度尼西亚陆地运输办公室的总局对交通事故的原因进行分类。我们发现交通事故一般是由车辆条件不充分引起的,车辆的主要问题是刹车失灵,而环境因素很少引起交通事故。此外,车辆是最常出现在tweet中的子类,其中汽车是最受欢迎的演员,其次是卡车和摩托车。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信