Comparison of K-Means and Two-Step Cluster Analysis Methods for Clustering COVID-19 Data

Sawitree  Pansayta; Wirapong  Chansanam

doi:10.15379/ijmst.v10i2.1203

Authors

Sawitree Pansayta Department of Information Science, Faculty of Humanities and Social Sciences, Khon Kaen University, Khon Kaen, Thailand
Wirapong Chansanam Department of Information Science, Faculty of Humanities and Social Sciences, Khon Kaen University, Khon Kaen, Thailand

DOI:

https://doi.org/10.15379/ijmst.v10i2.1203

Keywords:

K-Means, Two-step cluster analysis, clustering, COVID-19, Thailand

Abstract

This study compares the K-Means and two-step cluster analysis methods for clustering COVID-19 data. The dataset had 1,893,941 cumulative cases from January 2020 to October 2021. K-means clustering resulted in eight clusters, while two-step cluster analysis clustering resulted in three grouped cases by nationality, occupation, patient type, and risk group. These clusters were categorized based on age, gender, nationality, occupation, and region of infection. Group 1 had 5,883 workers infected in community settings, Group 2 had 7,420 foreign migrant workers infected in industrial settings or through direct contact with patients, and Group 3 had 6,870 cases of indirect transmission. The study recommends targeted interventions and continued monitoring and evaluation based on the clusters. The findings can help improve government policies, medical facilities, and treatment

Downloads

Download data is not yet available.

Comparison of K-Means and Two-Step Cluster Analysis Methods for Clustering COVID-19 Data

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License