Information Extraction from Unstructured Data on Microplastics through Text Mining

Wuseong Jeong, JungJin Kim, Hanseok Jeong
{"title":"Information Extraction from Unstructured Data on Microplastics through Text Mining","authors":"Wuseong Jeong, JungJin Kim, Hanseok Jeong","doi":"10.4491/ksee.2023.45.1.34","DOIUrl":null,"url":null,"abstract":"Objectives:In this study, we seek to provide a thorough insight into how people perceive microplastics and uncover issues and hidden trends about the significant microplastic pollution problems by analyzing unstructured data on microplastics.Methods:Environmental news articles related to microplastics were collected. Text mining techniques including data pre-processing, word cloud, TF-IDF weight-based trend analysis, and LDA topic modeling were used to analyze the amount of textual data.Results and Discussion:The public's interest in microplastics is consistently growing, according to an analysis of all environmental news and the keyword ‘microplastic’ from 2014 to 2021 conducted via BIGKinds. The keyword 'trash' was the overwhelmingly enormous weight among words. The top 5 keywords connected to microplastics did not fade away and continued appearing even though the socially noticeable keywords during the study period varied yearly. This indicates that the primary issue with microplastics related to keywords has not yet been solved. Our study has a limitation of subject diversity because we only focused on microplastic news. The results, however, presented all processes from plastic pollution emergence to treatment, such as microplastic pollution sources, microplastic detection, and prevention methods against microplastics.Conclusion:Text mining analysis was performed on microplastics in environmental news and provided issues and trends on microplastic pollution. This study presents a new methodology for environmental and social problem analysis, suggesting that it could enable a multidimensional understanding of environmental problems and help establish environmental policies.","PeriodicalId":52756,"journal":{"name":"daehanhwangyeonggonghaghoeji","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"daehanhwangyeonggonghaghoeji","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4491/ksee.2023.45.1.34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives:In this study, we seek to provide a thorough insight into how people perceive microplastics and uncover issues and hidden trends about the significant microplastic pollution problems by analyzing unstructured data on microplastics.Methods:Environmental news articles related to microplastics were collected. Text mining techniques including data pre-processing, word cloud, TF-IDF weight-based trend analysis, and LDA topic modeling were used to analyze the amount of textual data.Results and Discussion:The public's interest in microplastics is consistently growing, according to an analysis of all environmental news and the keyword ‘microplastic’ from 2014 to 2021 conducted via BIGKinds. The keyword 'trash' was the overwhelmingly enormous weight among words. The top 5 keywords connected to microplastics did not fade away and continued appearing even though the socially noticeable keywords during the study period varied yearly. This indicates that the primary issue with microplastics related to keywords has not yet been solved. Our study has a limitation of subject diversity because we only focused on microplastic news. The results, however, presented all processes from plastic pollution emergence to treatment, such as microplastic pollution sources, microplastic detection, and prevention methods against microplastics.Conclusion:Text mining analysis was performed on microplastics in environmental news and provided issues and trends on microplastic pollution. This study presents a new methodology for environmental and social problem analysis, suggesting that it could enable a multidimensional understanding of environmental problems and help establish environmental policies.
基于文本挖掘的微塑料非结构化数据信息提取
目的:在这项研究中,我们试图通过分析微塑料的非结构化数据,深入了解人们对微塑料的看法,并揭示重大微塑料污染问题的问题和隐藏趋势。方法:收集与微塑料相关的环境新闻报道。文本挖掘技术包括数据预处理、单词云、基于TF-IDF权重的趋势分析和LDA主题建模,用于分析文本数据量。结果和讨论:根据BIGKinds对2014年至2021年所有环境新闻和关键词“微塑料”的分析,公众对微塑料的兴趣持续增长。关键词“trash”在单词中占据了压倒性的巨大权重。与微塑料相关的前5个关键词并没有消失,而是继续出现,尽管在研究期间,社会关注的关键词每年都在变化。这表明与关键词相关的微塑料的主要问题尚未解决。我们的研究存在主题多样性的局限性,因为我们只关注微塑料新闻。然而,研究结果展示了从塑料污染出现到处理的所有过程,如微塑料污染源、微塑料检测和预防微塑料的方法。结论:对环境新闻中的微塑料进行了文本挖掘分析,提供了微塑料污染的问题和趋势。这项研究提出了一种新的环境和社会问题分析方法,表明它可以实现对环境问题的多维理解,并有助于制定环境政策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
38
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信