Can client demographic, financial, and campaign-related characteristics predict whether a client subscribes to a bank term deposit?
Why is this important?
Dataset: Bank Marketing Dataset
Source:
- Portuguese Retail Bank
- UCI Repository (Kaggle Mirror)
- 2008–2010
Observations: 11,162 clients
Outcome Variable:
- Deposit (Yes / No)
Key Predictors: Balance, Housing Loan, Personal Loan, Duration, Previous Campaign Outcome
- 52.6% No
- 47.4% Yes
- Balanced dataset
We built two logistic regression models.
Financial Variables: - Balance - Housing Loan - Personal Loan
Financial + Campaign Variables: - Balance - Housing Loan - Personal Loan - Duration - Previous - Previous Campaign Outcome (poutcome)
Determine whether campaign-related variables improve predictive performance.
| Model | Accuracy | Precision | Recall |
|---|---|---|---|
| Model 1 | 0.603 | 0.627 | 0.626 |
| Model 2 | 0.789 | 0.770 | 0.861 |
Model 2 outperformed Model 1 across all evaluation metrics.
| Metric | Test Set | Cross Validation |
|---|---|---|
| Accuracy | 0.789 | 0.786 |
| Precision | 0.770 | 0.769 |
| Recall | 0.861 | 0.846 |
The cross-validation results are very similar to the test-set results.
Interpretation: Model 2 generalizes well and shows no strong evidence of overfitting.
↑ Balance
↑ Duration
↑ Previous Success
↓ Housing Loan
↓ Personal Loan
→ Better customer targeting