Classification of Potential Tsunami Disaster Due to Earthquakes in Indonesia Based on Machine Learning

Eri Mardiani, Nur Rahmansyah, Sari Ningsih, Dhieka Avrilia Lantana, Nabila Puspita Wulandana, Azzaleya Agashi Lombu, Sisca Budyarti
{"title":"Classification of Potential Tsunami Disaster Due to Earthquakes in Indonesia Based on Machine Learning","authors":"Eri Mardiani, Nur Rahmansyah, Sari Ningsih, Dhieka Avrilia Lantana, Nabila Puspita Wulandana, Azzaleya Agashi Lombu, Sisca Budyarti","doi":"10.35870/ijsecs.v4i1.2084","DOIUrl":null,"url":null,"abstract":"Earthquakes and tsunamis pose significant threats to Indonesia due to its unique geological positioning at the convergence of four tectonic plates. This study focuses on classifying the potential occurrence of tsunami disasters following earthquakes using various data mining methods, including k-Nearest Neighbor (kNN), Naïve Bayes, Decision Tree and Ensemble Method, and Linear Regression. The research employs a qualitative approach to systematically understand and describe the context of natural disasters, utilizing both primary and secondary data collection techniques. Performance evaluation metrics such as Area Under the Curve (AUC), Classification Accuracy (CA), F1 Score, Precision, and Recall are utilized to assess the effectiveness of each method in predicting potential tsunami events. The findings reveal that the kNN method exhibits the highest performance, with an AUC of 94.4% and a precision of 82.8%, indicating robust predictive capabilities. However, misclassifications were observed, emphasizing the need for further refinement. Naïve Bayes also shows promising results with an AUC of 84.5% and precision of 78.6%. Decision Tree and Ensemble Method models, such as Random Forest and AdaBoost, demonstrate reasonable performance, with Random Forest achieving the highest AUC of 71.9%. Linear Regression is employed to explore the correlation between earthquake attributes and tsunami occurrence, revealing a weak relationship. Further research integrating advanced modeling approaches and additional earthquake attributes is recommended to enhance the predictive capabilities of tsunami risk assessment models. The study underscores the importance of employing diverse machine learning techniques and evaluating their performance metrics to refine the accuracy of tsunami prediction models, ultimately contributing to practical disaster preparedness and mitigation strategies.","PeriodicalId":189392,"journal":{"name":"International Journal Software Engineering and Computer Science (IJSECS)","volume":"137 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal Software Engineering and Computer Science (IJSECS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35870/ijsecs.v4i1.2084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Earthquakes and tsunamis pose significant threats to Indonesia due to its unique geological positioning at the convergence of four tectonic plates. This study focuses on classifying the potential occurrence of tsunami disasters following earthquakes using various data mining methods, including k-Nearest Neighbor (kNN), Naïve Bayes, Decision Tree and Ensemble Method, and Linear Regression. The research employs a qualitative approach to systematically understand and describe the context of natural disasters, utilizing both primary and secondary data collection techniques. Performance evaluation metrics such as Area Under the Curve (AUC), Classification Accuracy (CA), F1 Score, Precision, and Recall are utilized to assess the effectiveness of each method in predicting potential tsunami events. The findings reveal that the kNN method exhibits the highest performance, with an AUC of 94.4% and a precision of 82.8%, indicating robust predictive capabilities. However, misclassifications were observed, emphasizing the need for further refinement. Naïve Bayes also shows promising results with an AUC of 84.5% and precision of 78.6%. Decision Tree and Ensemble Method models, such as Random Forest and AdaBoost, demonstrate reasonable performance, with Random Forest achieving the highest AUC of 71.9%. Linear Regression is employed to explore the correlation between earthquake attributes and tsunami occurrence, revealing a weak relationship. Further research integrating advanced modeling approaches and additional earthquake attributes is recommended to enhance the predictive capabilities of tsunami risk assessment models. The study underscores the importance of employing diverse machine learning techniques and evaluating their performance metrics to refine the accuracy of tsunami prediction models, ultimately contributing to practical disaster preparedness and mitigation strategies.
基于机器学习的印度尼西亚地震潜在海啸灾害分类
由于印度尼西亚地处四大板块交汇处的独特地质位置,地震和海啸对其构成了重大威胁。本研究的重点是利用各种数据挖掘方法,包括 k-近邻法(kNN)、奈夫贝叶斯法(Naïve Bayes)、决策树和集合法以及线性回归法,对地震后可能发生的海啸灾害进行分类。研究采用定性方法,利用原始数据和二手数据收集技术,系统地了解和描述自然灾害的背景。曲线下面积(AUC)、分类准确率(CA)、F1 分数、精确度和召回率等性能评估指标被用来评估每种方法在预测潜在海啸事件中的有效性。研究结果表明,kNN 方法的性能最高,AUC 为 94.4%,精确度为 82.8%,显示出强大的预测能力。不过,也发现了分类错误,这说明需要进一步改进。Naïve Bayes 也取得了不错的结果,AUC 为 84.5%,精确度为 78.6%。决策树和集合方法模型,如随机森林和 AdaBoost,表现出合理的性能,其中随机森林的 AUC 最高,达到 71.9%。线性回归用于探索地震属性与海啸发生之间的相关性,结果显示两者之间的关系较弱。建议进一步研究先进的建模方法和更多的地震属性,以提高海啸风险评估模型的预测能力。这项研究强调了采用多种机器学习技术并评估其性能指标以提高海啸预测模型准确性的重要性,最终有助于制定切实可行的备灾和减灾战略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信