{"title":"基于K-means方法的印度尼西亚Covid-19聚类分析","authors":"Claudia Larasvaty, S. Khomsah, R. Sa","doi":"10.20895/dinda.v3i1.822","DOIUrl":null,"url":null,"abstract":"These days technology are rapidly increasing and developing in various fields, especially data storage. The information that has been stored in a database usually called a dataset. Covid-19 is a new type of respiratory disease that attacks the respiratory system with rapid transmission, followed by the increasing number of Covid-19 cases that continues to increase every day in all provinces in Indonesia. This study aims to cluster the spread of Covid-19 in every province in Indonesia by using the data that obtained from the website named kaggle with many data variables. The method used in this research is K-Means. From many variables in the data, for this study only 3 variables were taken, which are: Number of Recovery, Number of Deaths, and Number of total Cases in Covid-19 in Indonesia. These 3 variables then will be applied using the K-Means method and formed 3 provincial groups. By using the clustering method and the K-means algorithm, this research can be carried out to find the characteristics of the distribution in each province in Indonesia by looking at the best clusters.","PeriodicalId":419119,"journal":{"name":"Journal of Dinda : Data Science, Information Technology, and Data Analytics","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cluster Analysis of Covid-19 in Indonesia Using K-means Method\",\"authors\":\"Claudia Larasvaty, S. Khomsah, R. Sa\",\"doi\":\"10.20895/dinda.v3i1.822\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"These days technology are rapidly increasing and developing in various fields, especially data storage. The information that has been stored in a database usually called a dataset. Covid-19 is a new type of respiratory disease that attacks the respiratory system with rapid transmission, followed by the increasing number of Covid-19 cases that continues to increase every day in all provinces in Indonesia. This study aims to cluster the spread of Covid-19 in every province in Indonesia by using the data that obtained from the website named kaggle with many data variables. The method used in this research is K-Means. From many variables in the data, for this study only 3 variables were taken, which are: Number of Recovery, Number of Deaths, and Number of total Cases in Covid-19 in Indonesia. These 3 variables then will be applied using the K-Means method and formed 3 provincial groups. By using the clustering method and the K-means algorithm, this research can be carried out to find the characteristics of the distribution in each province in Indonesia by looking at the best clusters.\",\"PeriodicalId\":419119,\"journal\":{\"name\":\"Journal of Dinda : Data Science, Information Technology, and Data Analytics\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Dinda : Data Science, Information Technology, and Data Analytics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20895/dinda.v3i1.822\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Dinda : Data Science, Information Technology, and Data Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20895/dinda.v3i1.822","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cluster Analysis of Covid-19 in Indonesia Using K-means Method
These days technology are rapidly increasing and developing in various fields, especially data storage. The information that has been stored in a database usually called a dataset. Covid-19 is a new type of respiratory disease that attacks the respiratory system with rapid transmission, followed by the increasing number of Covid-19 cases that continues to increase every day in all provinces in Indonesia. This study aims to cluster the spread of Covid-19 in every province in Indonesia by using the data that obtained from the website named kaggle with many data variables. The method used in this research is K-Means. From many variables in the data, for this study only 3 variables were taken, which are: Number of Recovery, Number of Deaths, and Number of total Cases in Covid-19 in Indonesia. These 3 variables then will be applied using the K-Means method and formed 3 provincial groups. By using the clustering method and the K-means algorithm, this research can be carried out to find the characteristics of the distribution in each province in Indonesia by looking at the best clusters.