January 22, 2019

Introduction

Data Description

The description of the used dataset and its variables is as follows:

## 'data.frame':    500 obs. of  3 variables:
##  $ loan_status  : Factor w/ 3 levels "COLLECTION","COLLECTION_PAIDOFF",..: 3 3 3 3 3 3 3 3 3 3 ...
##  $ paid_off     : num  1 1 1 1 1 1 1 1 1 1 ...
##  $ past_due_days: num  0 0 0 0 0 0 0 0 0 0 ...

Variables:

loan_status: Whether a loan is paid off, in collection, new customer yet to payoff, or paid off after the collection efforts paid_off: Whether the customer paid on time (0 yes, 1 no) past_due_days: Whether the payment has been past due (0 no, 1 yes)

Application Description

Application Description

The application allows the user to choose between two models for building a decision tree algorithm that classifies the loan payment status. Model 1 uses "Past Due Days" and Model 2 "Paid off" and "Past Due Days" as features. In addition, the user can also manipulate the amount of percentage of the training set for model, and the rest will be used for the test set. Finally, the accuracy of the two models is presented to the user and the final product should look like this:

My Awesome figure

My Awesome figure