基于k均值数据挖掘方法的社交媒体大数据分类分析COVID-19信息

Lukas Umbu Zogara, A. Sururi, Leny Tritanto Ningrum
{"title":"基于k均值数据挖掘方法的社交媒体大数据分类分析COVID-19信息","authors":"Lukas Umbu Zogara, A. Sururi, Leny Tritanto Ningrum","doi":"10.38101/sisfotek.v12i1.448","DOIUrl":null,"url":null,"abstract":"This Covid-19 began to infect almost all countries in early 2020, including in Indonesia, Covid-19 spread widely throughout the world and was declared as a global pandemic by the World Health Organization (WHO). In the current era of Big Data, large amounts of data have been generated and collected from a variety of rich data sources. Big Data is useful information and valuable knowledge. In this study, the method that will be used for data analysis is the K-Means algorithm with orange tools as a tool to display the results of data classification. One of the information that can be generated is Sentiment Analysis. The purpose of this research was to determining the information such as public sentiment on social media towards government policies in handling COVID-19. In this research, 2000 tweets were used. The keyword used related to government policies are sourced from several online media. The tools used to analyze this twitter data is using Orange Application. The selected keywords are covid19, lockdown, PSBB, and isolation. This keyword is used as a reference to retrieve tweet data from twitter. From each of these keywords, a sentiment classification process will be carried out automatically so that data or tweets are obtained and grouped into positive, negative and neutral sentiment classes. From the result of research conducted, public sentiment on social media towards government policies in handling this virus outbreak tends to be positive.","PeriodicalId":378682,"journal":{"name":"JURNAL SISFOTEK GLOBAL","volume":"265 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of COVID-19 Information Based on Social Media Big Data Classification Using the K-Means Data Mining Method\",\"authors\":\"Lukas Umbu Zogara, A. Sururi, Leny Tritanto Ningrum\",\"doi\":\"10.38101/sisfotek.v12i1.448\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This Covid-19 began to infect almost all countries in early 2020, including in Indonesia, Covid-19 spread widely throughout the world and was declared as a global pandemic by the World Health Organization (WHO). In the current era of Big Data, large amounts of data have been generated and collected from a variety of rich data sources. Big Data is useful information and valuable knowledge. In this study, the method that will be used for data analysis is the K-Means algorithm with orange tools as a tool to display the results of data classification. One of the information that can be generated is Sentiment Analysis. The purpose of this research was to determining the information such as public sentiment on social media towards government policies in handling COVID-19. In this research, 2000 tweets were used. The keyword used related to government policies are sourced from several online media. The tools used to analyze this twitter data is using Orange Application. The selected keywords are covid19, lockdown, PSBB, and isolation. This keyword is used as a reference to retrieve tweet data from twitter. From each of these keywords, a sentiment classification process will be carried out automatically so that data or tweets are obtained and grouped into positive, negative and neutral sentiment classes. From the result of research conducted, public sentiment on social media towards government policies in handling this virus outbreak tends to be positive.\",\"PeriodicalId\":378682,\"journal\":{\"name\":\"JURNAL SISFOTEK GLOBAL\",\"volume\":\"265 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JURNAL SISFOTEK GLOBAL\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.38101/sisfotek.v12i1.448\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JURNAL SISFOTEK GLOBAL","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.38101/sisfotek.v12i1.448","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

2020年初,包括印度尼西亚在内的几乎所有国家都开始感染新冠肺炎,新冠肺炎在世界各地广泛传播,并被世界卫生组织(世卫组织)宣布为全球大流行。在当前的大数据时代,从各种丰富的数据源中产生和收集了大量的数据。大数据是有用的信息和有价值的知识。在本研究中,将用于数据分析的方法是K-Means算法,以橙色工具作为显示数据分类结果的工具。可以生成的信息之一是情感分析。此次调查的目的是为了确定在社交媒体上对政府应对新冠肺炎政策的舆论等信息。在这项研究中,使用了2000条推文。与政府政策相关的关键词来自多家网络媒体。用于分析这些twitter数据的工具是使用Orange Application。选择的关键词是covid - 19、锁定、PSBB和隔离。此关键字用作从twitter检索tweet数据的引用。从每个关键词中,自动进行情绪分类过程,从而获得数据或tweet,并将其分为积极,消极和中性情绪类。从调查结果来看,在社交媒体上,公众对政府应对新冠疫情的政策的看法趋于积极。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis of COVID-19 Information Based on Social Media Big Data Classification Using the K-Means Data Mining Method
This Covid-19 began to infect almost all countries in early 2020, including in Indonesia, Covid-19 spread widely throughout the world and was declared as a global pandemic by the World Health Organization (WHO). In the current era of Big Data, large amounts of data have been generated and collected from a variety of rich data sources. Big Data is useful information and valuable knowledge. In this study, the method that will be used for data analysis is the K-Means algorithm with orange tools as a tool to display the results of data classification. One of the information that can be generated is Sentiment Analysis. The purpose of this research was to determining the information such as public sentiment on social media towards government policies in handling COVID-19. In this research, 2000 tweets were used. The keyword used related to government policies are sourced from several online media. The tools used to analyze this twitter data is using Orange Application. The selected keywords are covid19, lockdown, PSBB, and isolation. This keyword is used as a reference to retrieve tweet data from twitter. From each of these keywords, a sentiment classification process will be carried out automatically so that data or tweets are obtained and grouped into positive, negative and neutral sentiment classes. From the result of research conducted, public sentiment on social media towards government policies in handling this virus outbreak tends to be positive.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信