Comparison of the Naïve Bayes Classifier and Decision Tree J48 for Credit Classification of Bank Customers

Alifia Tanza; Dina Tri Utari

doi:10.20885/EKSAKTA.vol3.iss2.art2

Submitted

July 13, 2022

Accepted

August 22, 2022

Published

August 29, 2022

Download

PDF

Statistic

Read Counter : 959 Download : 929

Abstract

The bank conducts an analysis or survey in the credit system to determine whether the customer is eligible to receive credit. With a case study of Bank BJB debtor data in December 2021, credit classification analysis was carried out by forming a model using the Naïve Bayes Classifier and Decision Tree J48. Thus it is expected to minimize the occurrence of bad loans. The data are divided into several categories: debtors with good, substandard, doubtful, and bad credit. The analysis was carried out using a 10-fold cross-validation model, where the results obtained from both tests, the highest accuracy value was the Decision Tree J48 of 78.26%. While the Naïve Bayes Classifier has a lower level of accuracy, the prediction results tend to be better than the Decision Tree J48. The prediction results with the Naïve Bayes Classifier can predict all classes and the most influential variable in classifying credit is the loan term.

Keywords

Credit classification Naïve Bayes Classifier Decision Tree J48

License

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under aÂ Creative Commons Attribution LicenseÂ that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (SeeÂ The Effect of Open Access).

How to Cite

Tanza, A., & Utari, D. T. (2022). Comparison of the Naïve Bayes Classifier and Decision Tree J48 for Credit Classification of Bank Customers. EKSAKTA: Journal of Sciences and Data Analysis, 3(2). https://doi.org/10.20885/EKSAKTA.vol3.iss2.art2

Download Citation

References

Kasmir, Manajemen Perbankan, PT. Raja Grafindo Persada, Jakarta, 2011.
A. N. Kholifah and N. Insani, Analisis Klasifikasi Pada Nasabah Kredit Koperasi X Menggunakan Decision Tree C4.5 dan Naive Bayes, Jurnal Pendidikan Matematika dan Sains (2016).
Rahmadeni, Susandi, R. Yendra, and A. P. Desvina, Analisis Diskriminan Fisher Untuk Klasifikasi Risiko Kredit, Seminar Nasional Teknologi Informasi, Komunikasi dan Industri (SNTIKI) 11 (2019).
D. S. L. Ting, P. W. H. Ip, and D. H. C. Tsang, Is Naïve Bayes a Good Classifier for Document Classification, Int. J. Softw. Eng. its Appl., 5(3) (2011) 37–46.
Ketjie, V. C. Mawardi, and N. J. Perdana, Prediction of Credit Card Using the Naïve Bayes Method and C4.5 Algorithm, IOP Conf. Ser.: Mater. Sci. Eng., (2020).
A. Nafalski and A. P. Wibawa, Machine translation with Javanese speech levels’ classification, Informatyka, Automatyka, Pomiary W Gospodarce I Ochronie Środowiska, 6(1) (2016) 21–25.
Bustami, Penerapan Algoritma Naive Bayes Untuk Mengklasifikasi Data Nasabah Asuransi, Jurnal Informatika, 8(1) (2014) 884–898.
D. Yusuf and E. Sestri, Metode Decision Tree Dalam Klasifikasi Kredit Pada Nasabah PT Bank Perkreditan Rakyat (Studi Kasus : PT BPR Lubuk Raya Mandiri), Jurnal Sistem Informasi (JUSIN), 1(1) (2020) 21–28.
M. R. Camana, S. Ahmed, C. E. Garcia, and I. Koo, Extremely Randomized Trees-BasedScheme for Stealthy Cyber-Attack Detection in Smart Grid Networks, IEEE Access 4(2016) (2020) 1–13.
Rusito and M. T. Firmansyah, Implementasi Metode Decision Tree dan Algoritma C4.5 Untuk Klasifikasi Data Nasabah, INFOKAM, 1(12) (2016) 1–12.
E. v. Venkatesan, Performance Analysis of Decision Tree Algorithms for Breast Cancer Classification, Indian Journal of Science and Technology (2015).
N. Saravanan and V. Gayathri, Performance and Classification Evaluation of J48 Algorithm and Kendall’s Based J48 Algorithm (KNJ48), Int. j. comput. intell. inform., 7(4) (2018) 188–198.
P. Gulati, A. Sharma, and M. Gupta, Theoretical Study of Decision Tree Algorithms to Identify Pivotal Factors for Performance Improvement: A Review, Int. J. Comput. Appl., 141(14) (2016) 19–25.
T.-T. Wong and P.-Y. Yeh, Reliable Accuracy Estimates from k-Fold Cross Validation, IEEE Trans. Knowl. Data Eng., 32(8) (2020).
I. M. Nasir, M. A. Khan, M. Yasmin, J. H. Shah, M. Gabryel, and R. Damaševičius, Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training, sensors, 20(6793) (2020).

References

Kasmir, Manajemen Perbankan, PT. Raja Grafindo Persada, Jakarta, 2011.

A. N. Kholifah and N. Insani, Analisis Klasifikasi Pada Nasabah Kredit Koperasi X Menggunakan Decision Tree C4.5 dan Naive Bayes, Jurnal Pendidikan Matematika dan Sains (2016).

Rahmadeni, Susandi, R. Yendra, and A. P. Desvina, Analisis Diskriminan Fisher Untuk Klasifikasi Risiko Kredit, Seminar Nasional Teknologi Informasi, Komunikasi dan Industri (SNTIKI) 11 (2019).

D. S. L. Ting, P. W. H. Ip, and D. H. C. Tsang, Is Naïve Bayes a Good Classifier for Document Classification, Int. J. Softw. Eng. its Appl., 5(3) (2011) 37–46.

Ketjie, V. C. Mawardi, and N. J. Perdana, Prediction of Credit Card Using the Naïve Bayes Method and C4.5 Algorithm, IOP Conf. Ser.: Mater. Sci. Eng., (2020).

A. Nafalski and A. P. Wibawa, Machine translation with Javanese speech levels’ classification, Informatyka, Automatyka, Pomiary W Gospodarce I Ochronie Środowiska, 6(1) (2016) 21–25.

Bustami, Penerapan Algoritma Naive Bayes Untuk Mengklasifikasi Data Nasabah Asuransi, Jurnal Informatika, 8(1) (2014) 884–898.

D. Yusuf and E. Sestri, Metode Decision Tree Dalam Klasifikasi Kredit Pada Nasabah PT Bank Perkreditan Rakyat (Studi Kasus : PT BPR Lubuk Raya Mandiri), Jurnal Sistem Informasi (JUSIN), 1(1) (2020) 21–28.

M. R. Camana, S. Ahmed, C. E. Garcia, and I. Koo, Extremely Randomized Trees-BasedScheme for Stealthy Cyber-Attack Detection in Smart Grid Networks, IEEE Access 4(2016) (2020) 1–13.

Rusito and M. T. Firmansyah, Implementasi Metode Decision Tree dan Algoritma C4.5 Untuk Klasifikasi Data Nasabah, INFOKAM, 1(12) (2016) 1–12.

E. v. Venkatesan, Performance Analysis of Decision Tree Algorithms for Breast Cancer Classification, Indian Journal of Science and Technology (2015).

N. Saravanan and V. Gayathri, Performance and Classification Evaluation of J48 Algorithm and Kendall’s Based J48 Algorithm (KNJ48), Int. j. comput. intell. inform., 7(4) (2018) 188–198.

P. Gulati, A. Sharma, and M. Gupta, Theoretical Study of Decision Tree Algorithms to Identify Pivotal Factors for Performance Improvement: A Review, Int. J. Comput. Appl., 141(14) (2016) 19–25.

T.-T. Wong and P.-Y. Yeh, Reliable Accuracy Estimates from k-Fold Cross Validation, IEEE Trans. Knowl. Data Eng., 32(8) (2020).

I. M. Nasir, M. A. Khan, M. Yasmin, J. H. Shah, M. Gabryel, and R. Damaševičius, Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training, sensors, 20(6793) (2020).

Comparison of the Naïve Bayes Classifier and Decision Tree J48 for Credit Classification of Bank Customers

Abstract

Keywords

References

References

Most read articles by the same author(s)

Related Articles based on the article keywords

Comparison of the Naïve Bayes Classifier and Decision Tree J48 for Credit Classification of Bank Customers

Article Sidebar

Main Article Content

Abstract

Keywords

Article Details

References

References

Most read articles by the same author(s)

Related Articles based on the article keywords