K-Means Algorithm Implementation for Project Health Clustering

  • Ajeng Arifa Chantika Rindu Universitas Pembangunan Nasional Veteran Jakarta
  • Ria Astriratma Universitas Pembangunan Nasional Veteran Jakarta https://orcid.org/0000-0003-1493-9169
  • Ati Zaidiah Universitas Pembangunan Nasional Veteran Jakarta
Keywords: project, project health, clustering, K-Means

Abstract

Indonesia has several companies that are involved in the telecommunications sector. Various projects run in parallel to support the success of telecommunications companies. The potential of a project can increase company revenue and productivity. On the other hand, there are some risks that need to be considered for every project when it is about to start. Project data is recorded from start to finish so that the project's progress and improvements can be monitored and analyzed. As the project runs, the project team at one of Indonesia's telecommunication companies, which is responsible for the processes leading to project success, requires a project health category. Therefore, this study is conducted to develop a clustering project health process, which is included in a type of unsupervised learning that runs on unlabeled data. One of the clustering algorithms is K-Means, which groups data based on similar criteria. Researchers also use dimensionality reduction with the principal component analysis (PCA) method to determine its impact on the clustering process with the K-Means algorithm. From this study, the researcher obtained three groups or project health categories, consisting of groups 0, 1, and 2. The evaluation results with the Calinski-Harabasz index showed that the K-Means model in the PCA dimensionality reduction data performed better than the standard K-Means model with a Calinski-Harabasz index value of 55633,12776405707, which is higher than 25914,578262576793.

Downloads

Download data is not yet available.

References

M. Hopmere, L. Crawford, and M. S. Harré, “Proactively Monitoring Large Project Portfolios,” Proj. Manag. J., vol. 51, pp. 656–669, 2020.

A. A. Wael, A. Elyamany, and A. Elhakeem, “Classification of Evaluation Metrics for Project Baseline Schedules,” Int. J. Eng. Adv. Technol., vol. 10, no. 1, pp. 235–239, 2020, doi: 10.35940/ijeat.c5456.1010120.

T. Jo, Machine Learning Foundations: Supervised, Unsupervised, and Advanced Learning. 2021.

M. Mohri, A. Rostamizadeh, and A. Talwalkar, Foundations of Machine Learning. The MIT Press, 2018.

M. Garzon, Dimensionality Reduction in Data Science. 2022.

O. Nasraoui and C.-E. Ben N’Cir, Clustering Methods for Big Data Analytics. 2019.

F. Aksan et al., “Clustering methods for power quality measurements in virtual power plant,” Energies, vol. 14, no. 18, 2021, doi: 10.3390/en14185902.

M. Zubair, M. Asif Iqbal, A. Shil, E. Haque, M. Moshiul Hoque, and I. H. Sarker, “An Efficient K-Means Clustering Algorithm for Analysing COVID-19,” Adv. Intell. Syst. Comput., vol. 1375 AIST, pp. 422–432, 2021, doi: 10.1007/978-3-030-73050-5_43.

Fairleigh Dickinson University, “Purchasing Policies and Procedures,” no. October, 2019.

M. Ashika and V. Monisha, “A Material Management in Construction Project Using Inventory Management System,” Int. J. Mod. Trends Sci. Technol., vol. 6, pp. 32–40, 2020, doi: 10.46501/IJMTST060506.

One Finance, How to Receipt Purchase Orders. The London School of Economics and Political Science, 2019.

R. Jafari, Hands-On Data Preprocessing in Python. Packt Publishing, 2022.

E. Antony, N. S. Sreekanth, R. K. Sunil Kumar, and T. Nishanth, “Data preprocessing techniques for handling time series data for environmental science studies,” Int. J. Eng. Trends Technol., vol. 69, no. 5, pp. 196–207, 2021, doi: 10.14445/22315381/IJETT-V69I5P227.

Abhishek Thakur, Approaching (Almost) Any Machine Learning Problem. Abhishek Thakur, 2020.

A. Zheng and A. Casari, Feature Engineering for Machine Learning: Principles and Techniques for Data Scientists. O’Reilly Media, 2018.

G. R. Naik, Advances in principal component analysis: Research and development. 2018.

K. Sud, P. Erdogmus, and S. Kadry, Introduction to Data Science and Machine Learning. Rijeka: IntechOpen, 2020.

I. Zada et al., “Performance Evaluation of Simple K-Mean and Parallel K-Mean Clustering Algorithms: Big Data Business Process Management Concept,” Mob. Inf. Syst., vol. 2022, 2022, doi: 10.1155/2022/1277765.

N. Zumel, J. Mount, J. Howard, and R. Thomas, Practical Data Science With R. Manning, 2020.

Published
2023-09-30
How to Cite
Ajeng Arifa Chantika Rindu, Ria Astriratma, & Ati Zaidiah. (2023). K-Means Algorithm Implementation for Project Health Clustering. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 7(5), 1064 - 1076. https://doi.org/10.29207/resti.v7i5.5181
Section
Information Technology Articles