Comparison of K-Means and Two-Step Cluster Analysis Methods for Clustering COVID-19 Data

Authors

  • Sawitree Pansayta Department of Information Science, Faculty of Humanities and Social Sciences, Khon Kaen University, Khon Kaen, Thailand
  • Wirapong Chansanam Department of Information Science, Faculty of Humanities and Social Sciences, Khon Kaen University, Khon Kaen, Thailand

DOI:

https://doi.org/10.15379/ijmst.v10i2.1203

Keywords:

K-Means, Two-step cluster analysis, clustering, COVID-19, Thailand

Abstract

This study compares the K-Means and two-step cluster analysis methods for clustering COVID-19 data. The dataset had 1,893,941 cumulative cases from January 2020 to October 2021. K-means clustering resulted in eight clusters, while two-step cluster analysis clustering resulted in three grouped cases by nationality, occupation, patient type, and risk group. These clusters were categorized based on age, gender, nationality, occupation, and region of infection. Group 1 had 5,883 workers infected in community settings, Group 2 had 7,420 foreign migrant workers infected in industrial settings or through direct contact with patients, and Group 3 had 6,870 cases of indirect transmission. The study recommends targeted interventions and continued monitoring and evaluation based on the clusters. The findings can help improve government policies, medical facilities, and treatment

Downloads

Download data is not yet available.

Downloads

Published

2023-06-21

How to Cite

[1]
S. . Pansayta and W. . Chansanam, “Comparison of K-Means and Two-Step Cluster Analysis Methods for Clustering COVID-19 Data”, ijmst, vol. 10, no. 2, pp. 341-348, Jun. 2023.