{"title":"基于文本挖掘的微塑料非结构化数据信息提取","authors":"Wuseong Jeong, JungJin Kim, Hanseok Jeong","doi":"10.4491/ksee.2023.45.1.34","DOIUrl":null,"url":null,"abstract":"Objectives:In this study, we seek to provide a thorough insight into how people perceive microplastics and uncover issues and hidden trends about the significant microplastic pollution problems by analyzing unstructured data on microplastics.Methods:Environmental news articles related to microplastics were collected. Text mining techniques including data pre-processing, word cloud, TF-IDF weight-based trend analysis, and LDA topic modeling were used to analyze the amount of textual data.Results and Discussion:The public's interest in microplastics is consistently growing, according to an analysis of all environmental news and the keyword ‘microplastic’ from 2014 to 2021 conducted via BIGKinds. The keyword 'trash' was the overwhelmingly enormous weight among words. The top 5 keywords connected to microplastics did not fade away and continued appearing even though the socially noticeable keywords during the study period varied yearly. This indicates that the primary issue with microplastics related to keywords has not yet been solved. Our study has a limitation of subject diversity because we only focused on microplastic news. The results, however, presented all processes from plastic pollution emergence to treatment, such as microplastic pollution sources, microplastic detection, and prevention methods against microplastics.Conclusion:Text mining analysis was performed on microplastics in environmental news and provided issues and trends on microplastic pollution. This study presents a new methodology for environmental and social problem analysis, suggesting that it could enable a multidimensional understanding of environmental problems and help establish environmental policies.","PeriodicalId":52756,"journal":{"name":"daehanhwangyeonggonghaghoeji","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Information Extraction from Unstructured Data on Microplastics through Text Mining\",\"authors\":\"Wuseong Jeong, JungJin Kim, Hanseok Jeong\",\"doi\":\"10.4491/ksee.2023.45.1.34\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives:In this study, we seek to provide a thorough insight into how people perceive microplastics and uncover issues and hidden trends about the significant microplastic pollution problems by analyzing unstructured data on microplastics.Methods:Environmental news articles related to microplastics were collected. Text mining techniques including data pre-processing, word cloud, TF-IDF weight-based trend analysis, and LDA topic modeling were used to analyze the amount of textual data.Results and Discussion:The public's interest in microplastics is consistently growing, according to an analysis of all environmental news and the keyword ‘microplastic’ from 2014 to 2021 conducted via BIGKinds. The keyword 'trash' was the overwhelmingly enormous weight among words. The top 5 keywords connected to microplastics did not fade away and continued appearing even though the socially noticeable keywords during the study period varied yearly. This indicates that the primary issue with microplastics related to keywords has not yet been solved. Our study has a limitation of subject diversity because we only focused on microplastic news. The results, however, presented all processes from plastic pollution emergence to treatment, such as microplastic pollution sources, microplastic detection, and prevention methods against microplastics.Conclusion:Text mining analysis was performed on microplastics in environmental news and provided issues and trends on microplastic pollution. This study presents a new methodology for environmental and social problem analysis, suggesting that it could enable a multidimensional understanding of environmental problems and help establish environmental policies.\",\"PeriodicalId\":52756,\"journal\":{\"name\":\"daehanhwangyeonggonghaghoeji\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"daehanhwangyeonggonghaghoeji\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4491/ksee.2023.45.1.34\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"daehanhwangyeonggonghaghoeji","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4491/ksee.2023.45.1.34","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Information Extraction from Unstructured Data on Microplastics through Text Mining
Objectives:In this study, we seek to provide a thorough insight into how people perceive microplastics and uncover issues and hidden trends about the significant microplastic pollution problems by analyzing unstructured data on microplastics.Methods:Environmental news articles related to microplastics were collected. Text mining techniques including data pre-processing, word cloud, TF-IDF weight-based trend analysis, and LDA topic modeling were used to analyze the amount of textual data.Results and Discussion:The public's interest in microplastics is consistently growing, according to an analysis of all environmental news and the keyword ‘microplastic’ from 2014 to 2021 conducted via BIGKinds. The keyword 'trash' was the overwhelmingly enormous weight among words. The top 5 keywords connected to microplastics did not fade away and continued appearing even though the socially noticeable keywords during the study period varied yearly. This indicates that the primary issue with microplastics related to keywords has not yet been solved. Our study has a limitation of subject diversity because we only focused on microplastic news. The results, however, presented all processes from plastic pollution emergence to treatment, such as microplastic pollution sources, microplastic detection, and prevention methods against microplastics.Conclusion:Text mining analysis was performed on microplastics in environmental news and provided issues and trends on microplastic pollution. This study presents a new methodology for environmental and social problem analysis, suggesting that it could enable a multidimensional understanding of environmental problems and help establish environmental policies.