Digital Library

cab1

 
Title:      ARE DECISION TREES THE BEST TOOL FOR IMPROVEMENT OF THE CLASSIFICATION ACCURACY RATES AND EXPLAINABILITY OF LOAN GRANTING DECISIONS?
Author(s):      Jozef Zurada
ISBN:      978-972-8939-47-2
Editors:      Miguel Baptista Nunes, Pedro IsaĆ­as and Philip Powell
Year:      2011
Edition:      Single
Keywords:      Loan granting decisions, Decision trees, Data mining methods, Classification accuracy rates, Interpretability, ROC charts, Knowledge representation
Type:      Full Paper
First Page:      13
Last Page:      20
Language:      English
Cover:      cover          
Full Contents:      click to dowload Download
Paper Abstract:      The paper compares the classification performance rate of eight models: logistic regression (LR), neural network (NN), radial basis function neural network (RBFNN), support vector machine (SVM), case-base reasoning (CBR), and three decision trees (DTs). We build models and test their classification accuracy rates on a historical data set provided by a German financial institution. The data set contains 21 financial attributes of 1000 customers of which 300 defaulted upon a loan and 700 paid it off. To obtain reliable and unbiased error estimates for each of the eight models we apply 10-fold cross-validation and repeat an experiment 10 times. We found that in the overall classification accuracy rates at 0.5 probability cut-off, two of the three DT models significantly outperformed (at ?=0.05) the other remaining models. We then concentrate our attention on DT models and compare their performance at 0.3 and 0.7 cut-off levels which are more likely to be used by financial institutions. The DT models not only classify better than the other models, but the knowledge they learn in the form of if-then rules is easy to interpret, makes sense, and may be of value to financial institutions which may have to explain the reasons for a loan denial.
   

Social Media Links

Search

Login