Forecasting the Level and Types of North Korea’s Provocations with Text Mining

Daehan san'eob gonghag hoeji Pub Date : 2023-10-31 DOI:10.7232/jkiie.2023.49.5.441

Sunkyo Cha, Bongkyoo Yoon

{"title":"Forecasting the Level and Types of North Korea’s Provocations with Text Mining","authors":"Sunkyo Cha, Bongkyoo Yoon","doi":"10.7232/jkiie.2023.49.5.441","DOIUrl":null,"url":null,"abstract":"Research into the feasibility of predicting specific events using Text Mining techniques has been actively pursued in conjunction with the advancement of Machine Learning. Consequently, the potential for predicting North Korea’s provocations utilizing Text Mining methods has emerged. However, the field lags behind other domains due to challenges in acquiring high-quality training data and the complexity associated with event classification. This study addresses these limitations by leveraging a Pre-trained BERT model to establish a comprehensive classification framework for North Korea’s provocative behavior, moving beyond binary classifications (provocation or peace) used in previous research. Original data from the Korean Central News Agency (KCNA) and domestic media sources were gathered and analyzed as training data. Notably, the findings demonstrated that employing original data from the KCNA increased prediction accuracy compared to utilizing data from domestic media. This study offers a way to enhance the informational value of North Korea’s provocations through scientific predictions, ultimately bolstering the reliability of qualitative expert judgments.","PeriodicalId":488346,"journal":{"name":"Daehan san'eob gonghag hoeji","volume":"98 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Daehan san'eob gonghag hoeji","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7232/jkiie.2023.49.5.441","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Research into the feasibility of predicting specific events using Text Mining techniques has been actively pursued in conjunction with the advancement of Machine Learning. Consequently, the potential for predicting North Korea’s provocations utilizing Text Mining methods has emerged. However, the field lags behind other domains due to challenges in acquiring high-quality training data and the complexity associated with event classification. This study addresses these limitations by leveraging a Pre-trained BERT model to establish a comprehensive classification framework for North Korea’s provocative behavior, moving beyond binary classifications (provocation or peace) used in previous research. Original data from the Korean Central News Agency (KCNA) and domestic media sources were gathered and analyzed as training data. Notably, the findings demonstrated that employing original data from the KCNA increased prediction accuracy compared to utilizing data from domestic media. This study offers a way to enhance the informational value of North Korea’s provocations through scientific predictions, ultimately bolstering the reliability of qualitative expert judgments.

查看原文本刊更多论文

用文本挖掘预测朝鲜挑衅的程度和类型

利用文本挖掘技术预测特定事件的可行性研究一直在积极地与机器学习的进步相结合。因此，利用文本挖掘方法预测朝鲜挑衅的潜力已经出现。然而，由于难以获得高质量的训练数据和事件分类的复杂性，该领域落后于其他领域。本研究通过利用预训练的BERT模型来建立朝鲜挑衅行为的综合分类框架，超越了以前研究中使用的二元分类(挑衅或和平)，从而解决了这些局限性。从朝鲜中央通讯社(KCNA)和国内媒体来源收集和分析原始数据作为训练数据。值得注意的是，研究结果表明，与利用国内媒体的数据相比，使用朝中社的原始数据提高了预测准确性。本研究提供了一种通过科学预测提高朝鲜挑衅信息价值的方法，最终提高定性专家判断的可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Daehan san'eob gonghag hoeji

自引率

0.00%

发文量