lab3.1 Presentation

Abigail Russell

Data Understanding

Data Understanding

   Age Gender Housing Saving accounts Checking account Credit amount Duration
1   67   male     own            <NA>           little          1169        6
2   22 female     own          little         moderate          5951       48
3   49   male     own          little             <NA>          2096       12
4   45   male    free          little           little          7882       42
5   53   male    free          little           little          4870       24
6   35   male    free            <NA>             <NA>          9055       36
7   53   male     own      quite rich             <NA>          2835       24
8   35   male    rent          little         moderate          6948       36
9   61   male     own            rich             <NA>          3059       12
10  28   male     own          little         moderate          5234       30
               Purpose Class Risk
1             radio/TV          1
2             radio/TV          2
3            education          1
4  furniture/equipment          1
5                  car          2
6            education          1
7  furniture/equipment          1
8                  car          1
9             radio/TV          1
10                 car          2
     Age Gender Housing Saving accounts Checking account Credit amount Duration
991   37   male     own            <NA>             <NA>          3565       12
992   34   male     own        moderate             <NA>          1569       15
993   23   male    rent            <NA>           little          1936       18
994   30   male     own          little           little          3959       36
995   50   male     own            <NA>             <NA>          2390       12
996   31 female     own          little             <NA>          1736       12
997   40   male     own          little           little          3857       30
998   38   male     own          little             <NA>           804       12
999   23   male    free          little           little          1845       45
1000  27   male     own        moderate         moderate          4576       45
                 Purpose Class Risk
991            education          1
992             radio/TV          1
993             radio/TV          1
994  furniture/equipment          1
995                  car          1
996  furniture/equipment          1
997                  car          1
998             radio/TV          1
999             radio/TV          2
1000                 car          1

Summary Statistics for Credit Amount and Duration

   vars    n    mean      sd median trimmed     mad min   max range skew
X1    1 1000 3271.26 2822.74 2319.5 2754.57 1627.15 250 18424 18174 1.94
   kurtosis    se
X1     4.25 89.26
   vars    n mean    sd median trimmed mad min max range skew kurtosis   se
X1    1 1000 20.9 12.06     18   19.47 8.9   4  72    68 1.09      0.9 0.38

Categorical Tables


  1   2 
700 300 

1 is equal to customers that are considered good according to his/her failure to repay. 2 is equal to customers that are considered bad according to their failure to repay.

Final Graph

Interpretation

This graph shows the relationship between the credit amount borrowed, the duration in months of that amount and the class that they are associated with. When looking at the graph we see that the two classes are very similar to each other. There is a lot of overlap in the distribution between the amount borrowed and the duration. Due to this overlap the credit amount and the duration due have a positive relationship but it is not strong enough to predict the Class Risk.

Improvements

The changes that were made included fixing labels to better describe the x and y axis. Fixing the legend label so that the title of the legend is correct, this also included changing the colors.Finally, fixing the main titles punctuation and grammar.