KKher
8/20/2020
Create a web page presentation using R Markdown that features a plot created with Plotly.
This data set contains 113,937 loans with 81 variables on each loan, including loan amount, borrower rate (or interest rate), current loan status, borrower income, and many others. Can be downloaded from here
This data dictionary explains the variables in the data set.
Plotly is an interactive plotting package, feel free to explore the graphs ;)
From this graph we can see that entries for unidentified states ended in 2008. Moreover we notice huge spike in 2013 for almost all states.
This graph gives an insight on how in 2011, almost all states have close-range APRs, unlike prior to 2011, too many variabilities are found.
By taking a closer look we can new listings were introduced after 2011.
This one is messy but insightful!
Further data are definitely needed to understand this spike in loans in 2013, also noticeable here when certain categories ended!
Despite the fact that (Not Displayed) had entries in 2005, there is no APR data for that period.
Inerestingly, there is no data for some years above certain thershold!
Further breakdown of the previous graph per status. (Plotting top 5 LoanStatuses)
Unlike EstimatedLoss, EstimatedReturn range is wider for some years, and have a steeper relation with APR. Double click on the year of interest to see its graph alone!
Unlike EstimatedLoss, obvious variability existed across years, however it seems it was handled for current loans.