Klasifikasi Risiko Kredit Nasabah Menggunakan Algoritma Machine Learning dan Teknik Explainable AI
DOI:
https://doi.org/10.55681/jige.v7i2.5941Keywords:
Classification, Credit Risk, Machine Learning, SMOTE, SHAP, German Credit DatasetAbstract
The banking sector faces a major challenge in the form of credit risk due to customers' inability to repay loans, which can threaten financial stability. Traditional methods are less effective at handling complex data and class imbalances, so a machine learning approach is needed to improve classification accuracy. This study compares the performance of the Naïve Bayes, AdaBoost, Random Forest, and XGBoost algorithms on the German Credit dataset (1000 entries, 70% low risk and 30% high risk) with SMOTE techniques to overcome class imbalances as well as SHAP as Explainable AI for model interpretability. Data processing is carried out using Python (Pandas, Scikit-learn, XGBoost, SHAP) in Google Colab, including preprocessing (handling missing values, encoding, scaling), 10-fold cross-validation evaluation, and SHAP analysis. The results showed that Random Forest achieved the best performance with an average accuracy of 87% (Std: 0.036), precision 0.88, recall 0.87, F1-score 0.87, and ROC-AUC 0.93, followed by XGBoost (85%, Std: 0.041). Naïve Bayes and AdaBoost only reached 81%. SHAP analysis revealed Credit Amount and Duration as the most influential features on high risk prediction (positive correlation ~0.62). The ensemble model excels in accuracy and stability, while the integration of SMOTE and SHAP improves minority class recall as well as transparency for banking decision-making. This study outperformed previous studies (71.33% in ANN and 83% in Random Forest) thanks to the combination of these techniques, supporting more accurate, ethical, and regulated credit risk management in financial institutions.
Downloads
References
Ardana, Y., Wulandari, W., & Septiadi, D. (2021). Perbankan dan pertumbuhan ekonomi di Indonesia: Analisis error correction model. Jurnal Masharif Al-Syariah: Jurnal Ekonomi dan Perbankan Syariah, 6(3), 674. https://doi.org/10.30651/jms.v6i3.7578
Aryani, Y., & Wijayanto, A. W. (2021). Klasifikasi pengembalian radar dari ionosfer menggunakan SVM, Naïve Bayes dan Random Forest. Komputika: Jurnal Sistem Komputer, 10(2), 111–117. https://doi.org/10.34010/komputika.v10i2.4347
Bintoro, I. P. P., Ratnasari, R., Wihardjo, E., & Asari, A. (n.d.). Pengantar machine learning.
Budi, E. S., Chan, A. N., Alda, P. P., & Idris, M. A. F. (2024). Optimasi model machine learning untuk klasifikasi dan prediksi citra menggunakan algoritma convolutional neural network. RESOLUSI: Rekayasa Teknik Informatika dan Informasi, 4(5), 502–509.
Chang, V., Xu, Q. A., Akinloye, S. H., Benson, V., & Hall, K. (2024). Prediction of bank credit worthiness through credit risk analysis: An explainable machine learning study. Annals of Operations Research. https://doi.org/10.1007/s10479-024-06134-x
Fauzihan, A., Sajiwo, B., Rahmat, B., & Junaidi, A. (2024). Klasifikasi indeks standar pencemaran udara (ISPU) menggunakan XGBoost dengan teknik imbalanced data. [Nama Jurnal], 12(3).
Ghildiyal, V., Danu, A., Verma, V., & Bhat, A. (2025). Explainable AI. https://doi.org/10.1201/9781003536901-8
Ismafillah, D., Rohana, T., & Cahyana, Y. (2023). Analisis algoritma pohon keputusan untuk memprediksi penyakit diabetes menggunakan oversampling SMOTE. [Nama Jurnal], 4, 27–36.
Momole, G. M. (2022). Perbandingan Naïve Bayes dan Random Forest dalam klasifikasi bahasa daerah. JATISI (Jurnal Teknik Informatika dan Sistem Informasi), 9(2), 855–863. https://doi.org/10.35957/jatisi.v9i2.1857
Permata, I., & Nababan, E. S. M. (2023). Analisis perbandingan algoritma XGBoost dan algoritma Random Forest ensemble learning pada klasifikasi keputusan kredit. Jurnal Riset Rumpun Matematika dan Ilmu Pengetahuan Alam, 2(2), 65–71.
Perwira. (2019). Pembuatan sistem deteksi serangan DOS menggunakan algoritma Adaboost classifier dan Raspberry PI3. http://eprints.upnyk.ac.id/31575/1/BUKU%20Adabost.pdf
Prasojo, B., & Haryatmi, E. (2021). Analisa prediksi kelayakan pemberian kredit pinjaman dengan metode Random Forest. Jurnal Nasional Teknologi dan Sistem Informasi, 7(2), 79–89. https://doi.org/10.25077/teknosi.v7i2.2021.79-89
Qurahman, T., Mustakim, & Jaini, A. (2019). Penerapan algoritma Naïve Bayes classifier dan probabilistic neural network untuk klasifikasi nasabah bank dalam membayar kredit. Seminar Nasional Teknologi Informasi, Komunikasi dan Industri, 205–213.
Raharjo, B. (2021). Pembelajaran mesin. Codepolitan. https://www.codepolitan.com/mengenal-teknologi-machine-learning-pembelajaran-mesin
Santoso, J. T. (2023). Ilmu data (Vol. 1).
Sari, V., Firdausi, F., & Azhar, Y. (2020). Perbandingan prediksi kualitas kopi Arabika dengan menggunakan algoritma SGD, Random Forest dan Naive Bayes. Edumatic: Jurnal Pendidikan Informatika, 4(2), 1–9. https://doi.org/10.29408/edumatic.v4i2.2202
Scheule, H. (2020). Deep credit risk.
Yani, A. (2019). Analisa kelayakan kredit menggunakan artificial neural network dan backpropagation (Studi kasus German credit data). Jurnal Ilmiah Komputasi, 18(4), 385–390. https://doi.org/10.32409/jikstik.18.4.2672
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Ismi Hayati Nabila, Sri Lestari

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.








