SISTEM KLASIFIKASI PENYAKIT DIABETES MENGGUNAKAN ALGORITMA RANDOM FOREST BERDASARKAN DATA MEDIS
Keywords:
Random Forest, Deteksi Dini, Diabetes Mellitus, Data Medis, Klasifikasi, machine learningAbstract
Penelitian ini membahas pembangunan model klasifikasi untuk deteksi dini diabetes mellitus menggunakan algoritma Random Forest. Diabetes merupakan penyakit kronis dengan prevalensi yang terus meningkat, sehingga deteksi awal menjadi krusial untuk mencegah komplikasi. Pemanfaatan data mining dan machine learning memungkinkan analisis data medis secara lebih akurat dalam mendukung proses diagnosis. Pendekatan penelitian menggunakan metode supervised learning dengan dataset berisi atribut kesehatan seperti kadar glukosa, tekanan darah, BMI, usia, dan variabel medis relevan lainnya. Data diproses melalui tahapan pre-processing, meliputi pembersihan, transformasi, normalisasi, serta pembagian data menjadi 80% data latih dan 20% data uji. Model Random Forest dibangun sebagai algoritma klasifikasi, sementara kinerjanya dievaluasi dengan metrik akurasi, presisi, recall, F1-score, dan confusion matrix. Hasil eksperimen menunjukkan bahwa model mencapai akurasi 96,76%, dengan variasi presisi dan recall antar kelas yang mengindikasikan adanya ketidakseimbangan data dan kompleksitas fitur. Meskipun akurasinya belum optimal, penelitian ini menegaskan bahwa Random Forest tetap potensial sebagai model dasar dalam sistem pendukung keputusan medis. Kinerja model masih dapat ditingkatkan melalui optimasi parameter, penyeimbangan data, serta peningkatan kualitas pre-processing. Pendekatan ini diharapkan menjadi langkah awal pengembangan analitik data untuk deteksi dini diabetes.
References
S. Raghavendra and S. K. J, “Performance Evaluation of Random Forest With Feature Selection Methods in Prediction of Diabetes,” International Journal of Electrical and Computer Engineering (Ijece), vol. 10, no. 1, p. 353, 2020, doi: 10.11591/ijece.v10i1.pp353-359.
I. J. B. do Nascimento et al., “Impact of Big Data Analytics on People’s Health: Overview of Systematic Reviews and Recommendations for Future Studies (Preprint),” 2021, doi: 10.2196/preprints.27275.
G. A. B. Suryanegara, A. Adiwijaya, and M. D. Purbolaksono, “Peningkatan Hasil Klasifikasi Pada Algoritma Random Forest Untuk Deteksi Pasien Penderita Diabetes Menggunakan Metode Normalisasi,” Jurnal Resti (Rekayasa Sistem Dan Teknologi Informasi), vol. 5, no. 1, pp. 114–122, 2021, doi: 10.29207/resti.v5i1.2880.
D. A. Hadi and D. A. N. Sirodj, “Metode Random Forest Untuk Klasifikasi Penyakit Diabetes,” Bandung Conference Series Statistics, vol. 3, no. 2, pp. 428–435, 2023, doi: 10.29313/bcss.v3i2.8354.
S. Alkhalefah, I. Al-Turaiki, and Ν. Altwaijry, “Advancing Diabetic Foot Ulcer Care: AI and Generative AI Approaches for Classification, Prediction, Segmentation, and Detection,” Healthcare, vol. 13, no. 6, p. 648, 2025, doi: 10.3390/healthcare13060648.
P. V. S. Kumar and N. S. Kumar, “Analysis and Comparison for Prediction of Diabetic Among Pregnant Women Using Innovative Support Vector Machine Algorithm Over Random Forest Algorithm With Improved Accuracy,” Cm, no. 25, pp. 956–962, 2023, doi: 10.18137/cardiometry.2022.25.956962.
F. N. Rohman, F. Farikhin, and B. Surarso, “Hyperparameter Tuning of Random Forest Algorithm for Diabetes Classification,” International Journal of Current Science Research and Review, vol. 08, no. 01, 2025, doi: 10.47191/ijcsrr/v8-i1-31.
M. Wang, H. Cao, Z. Ai, and Q. Zhang, “Fault diagnosis of ship ballast water system based on support vector machine optimized by improved sparrow search algorithm,” IEEE Access, vol. 12, pp. 17045–17057, 2024, doi: 10.1109/access.2024.3351171.
L. Li, “Comparative Research on Diabetes Influencing Factors Based on Random Forest and Decision Tree Models,” Highlights in Science Engineering and Technology, vol. 72, pp. 231–242, 2023, doi: 10.54097/7m4x7j04.
N. G. Ramadhan, A. Adiwijaya, W. Maharani, and A. A. Gozali, “Chronic Diseases Prediction Using Machine Learning With Data Preprocessing Handling: A Critical Review,” Ieee Access, vol. 12, pp. 80698–80730, 2024, doi: 10.1109/access.2024.3406748.
C. N. Noviyanti and A. Alamsyah, “Early Detection of Diabetes Using Random Forest Algorithm,” J. Inf. Syst. Explor. Res., vol. 2, no. 1, 2024, doi: 10.52465/joiser.v2i1.245.
J. Jumanto, M. A. Muslim, Y. Dasril, and T. Mustaqim, “Accuracy of Malaysia Public Response to Economic Factors During the Covid-19 Pandemic Using Vader and Random Forest,” J. Inf. Syst. Explor. Res., vol. 1, no. 1, pp. 49–70, 2022, doi: 10.52465/joiser.v1i1.104.
L. P. Sari, N. Darmawulan, F. Agustina, and N. Nursiswati, “Application of Artificial Intelligence for Managing Diabetes Mellitus Patients During the Covid-19 Pandemic,” Journal of Nursing Care, vol. 5, no. 2, 2023, doi: 10.24198/jnc.v5i2.36971.
A. Sadagopan, “Artificial Intelligence Driving Diabetes Care,” Journal for International Medical Graduates, vol. 2, no. 1, 2023, doi: 10.56570/jimgs.v2i1.92.
G. Chemello, B. Salvatori, M. Morettini, and A. Tura, “Artificial Intelligence Methodologies Applied to Technologies for Screening, Diagnosis and Care of the Diabetic Foot: A Narrative Review,” Biosensors, vol. 12, no. 11, p. 985, 2022, doi: 10.3390/bios12110985.
Z. Guan et al., “Artificial Intelligence in Diabetes Management: Advancements, Opportunities, and Challenges,” Cell Reports Medicine, vol. 4, no. 10, p. 101213, 2023, doi: 10.1016/j.xcrm.2023.101213.
S. Ziajor et al., “The Use of Artificial Intelligence in the Diagnosis and Detection of Complications of Diabetes,” Journal of Education Health and Sport, vol. 65, pp. 11–27, 2024, doi: 10.12775/jehs.2024.65.001.
K. Thanigainathan, “USING ENSEMBLE CLUSTERING TO IDENTIFY PHENOTYPES,” no. March, 2022.
Downloads
Published
Issue
Section
Citation Check
License
Copyright (c) 2026 Hafist Rezallul Fikry, Rudi Kurniawan, Bani Nurhakim, Umi Hayati

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.




