library(tidyverse)
library(openintro)
The file UniversalBank.rds contains data on 5000
customers. The data include customer demographic information (age,
income, etc.), the customer’s relationship with the bank (mortgage,
securities account, etc.), and the customer response to the last
personal loan campaign (Personal Loan). Among these 5000 customers, only
480 (= 9.6%) accepted the personal loan that was offered to them in the
earlier campaign.
Data Description:
| ID |
Customer ID |
| Age |
Customer’s age in completed years |
| Experience |
# years of professional experience |
| Income |
Annual income of the customer ($000) |
| ZIPCode |
Home Address ZIP code |
| Family |
Family size of the customer |
| CCAvg |
Avg. spending on credit cards per month ($000) |
| Education_1 |
Education Level = 1 if Undergrad; 0 otherwise |
| Education_2 |
Education Level = 1 if Graduate; 0 otherwise |
| Education_3 |
Education Level = 1 if Advanced/Professional; 0
otherwise |
| Mortgage |
Value of house mortgage if any. ($000) |
| PersonalLoan |
Did this customer accept the personal loan offered in
the last campaign? |
| SecuritiesAccount |
Does the customer have a securities account with the
bank? |
| CDAccount |
Does the customer have a certificate of deposit (CD)
account with the bank? |
| Online |
Does the customer use internet banking facilities? |
| CreditCard |
Does the customer use a credit card issued by
UniversalBank? |
Question 1
Read and preprocess the data: a. Read the data file and assign into
an R object. b. Drop ID and ZIPcode from the dataset.
c. Convert binary categorical variables into factor variables. d. Handle
missing values, if there is any. e. Normalize all numerical
variables.
Answer to Question 1
# Insert code yor code here
Question 2
Partition the data into training (60%) and testing (40%) sets.
Use
Answer to Question 2
# Insert code yor code here
Question 3
Perform a k-NN classification with all predictors except ID and ZIP
code using:
a. k = 1. b. k = 2. c. k = 3. d. k = 4.
For each k=1,2,3,4, compute accuracy based on test dataset. Whick k
is the best?
Answer to Question 3
# Insert code yor code here
Question 4
Consider the following customer:
Age = 40, Experience = 10, Income = 84, Family = 2, CCAvg = 2,
Education_1 = 0, Education_2 = 1, Education_3 = 0, Mortgage = 0,
SecuritiesAccount = 0, CDAccount = 0, Online = 1, and CreditCard =
1.
Perform a k-NN classification with all predictors except ID and ZIP
code using th best k. How would this customer be classified?
Answer to Question 4
# Insert code yor code here
LS0tCnRpdGxlOiAiRUNPTiAzMjAwOiBIb21ld29yayAxIgphdXRob3I6ICJBdXRob3IgTmFtZSIKZGF0ZTogImByIFN5cy5EYXRlKClgIgpvdXRwdXQ6IG9wZW5pbnRybzo6bGFiX3JlcG9ydAotLS0KCmBgYHtyIGxvYWQtcGFja2FnZXMsIG1lc3NhZ2U9RkFMU0V9CmxpYnJhcnkodGlkeXZlcnNlKQpsaWJyYXJ5KG9wZW5pbnRybykKYGBgCgpUaGUgZmlsZSBgVW5pdmVyc2FsQmFuay5yZHNgIGNvbnRhaW5zIGRhdGEgb24gNTAwMCBjdXN0b21lcnMuIFRoZSBkYXRhIGluY2x1ZGUgY3VzdG9tZXIgZGVtb2dyYXBoaWMgaW5mb3JtYXRpb24gKGFnZSwgaW5jb21lLCBldGMuKSwgdGhlIGN1c3RvbWVy4oCZcyByZWxhdGlvbnNoaXAgd2l0aCB0aGUgYmFuayAobW9ydGdhZ2UsIHNlY3VyaXRpZXMgYWNjb3VudCwgZXRjLiksIGFuZCB0aGUgY3VzdG9tZXIgcmVzcG9uc2UgdG8gdGhlIGxhc3QgcGVyc29uYWwgbG9hbiBjYW1wYWlnbiAoUGVyc29uYWwgTG9hbikuIEFtb25nIHRoZXNlIDUwMDAgY3VzdG9tZXJzLCBvbmx5IDQ4MCAoPSA5LjYlKSBhY2NlcHRlZCB0aGUgcGVyc29uYWwgbG9hbiB0aGF0IHdhcyBvZmZlcmVkIHRvIHRoZW0gaW4gdGhlIGVhcmxpZXIgY2FtcGFpZ24uCgpEYXRhIERlc2NyaXB0aW9uOgoKfCBWYXJpYWJsZSAgICAgICAgICAgIHwgRGVzY3JpcHRpb24gICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8Oi0tLS0tLS0tLS0tLS0tLS0tLS0tfDotLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS18CnwgSUQgICAgICAgICAgICAgICAgICB8IEN1c3RvbWVyIElEICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBBZ2UgICAgICAgICAgICAgICAgIHwgQ3VzdG9tZXIncyBhZ2UgaW4gY29tcGxldGVkIHllYXJzICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IEV4cGVyaWVuY2UgICAgICAgICAgfCAjIHllYXJzIG9mIHByb2Zlc3Npb25hbCBleHBlcmllbmNlICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgSW5jb21lICAgICAgICAgICAgICB8IEFubnVhbCBpbmNvbWUgb2YgdGhlIGN1c3RvbWVyICgkMDAwKSAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBaSVBDb2RlICAgICAgICAgICAgIHwgSG9tZSBBZGRyZXNzIFpJUCBjb2RlICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IEZhbWlseSAgICAgICAgICAgICAgfCBGYW1pbHkgc2l6ZSBvZiB0aGUgY3VzdG9tZXIgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgQ0NBdmcgICAgICAgICAgICAgICB8IEF2Zy4gc3BlbmRpbmcgb24gY3JlZGl0IGNhcmRzIHBlciBtb250aCAoJDAwMCkgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBFZHVjYXRpb25fMSAgICAgICAgIHwgRWR1Y2F0aW9uIExldmVsID0gMSBpZiBVbmRlcmdyYWQ7IDAgb3RoZXJ3aXNlICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IEVkdWNhdGlvbl8yICAgICAgICAgfCBFZHVjYXRpb24gTGV2ZWwgPSAxIGlmIEdyYWR1YXRlOyAwIG90aGVyd2lzZSAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgRWR1Y2F0aW9uXzMgICAgICAgICB8IEVkdWNhdGlvbiBMZXZlbCA9IDEgaWYgQWR2YW5jZWQvUHJvZmVzc2lvbmFsOyAwIG90aGVyd2lzZSAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBNb3J0Z2FnZSAgICAgICAgICAgIHwgVmFsdWUgb2YgaG91c2UgbW9ydGdhZ2UgaWYgYW55LiAoJDAwMCkgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IFBlcnNvbmFsTG9hbiAgICAgICAgfCBEaWQgdGhpcyBjdXN0b21lciBhY2NlcHQgdGhlIHBlcnNvbmFsIGxvYW4gb2ZmZXJlZCBpbiB0aGUgbGFzdCBjYW1wYWlnbj8gICAgICAgICAgICB8CnwgU2VjdXJpdGllc0FjY291bnQgICB8IERvZXMgdGhlIGN1c3RvbWVyIGhhdmUgYSBzZWN1cml0aWVzIGFjY291bnQgd2l0aCB0aGUgYmFuaz8gICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBDREFjY291bnQgICAgICAgICAgIHwgRG9lcyB0aGUgY3VzdG9tZXIgaGF2ZSBhIGNlcnRpZmljYXRlIG9mIGRlcG9zaXQgKENEKSBhY2NvdW50IHdpdGggdGhlIGJhbms/ICAgICAgICAgfAp8IE9ubGluZSAgICAgICAgICAgICAgfCBEb2VzIHRoZSBjdXN0b21lciB1c2UgaW50ZXJuZXQgYmFua2luZyBmYWNpbGl0aWVzPyAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgQ3JlZGl0Q2FyZCAgICAgICAgICB8IERvZXMgdGhlIGN1c3RvbWVyIHVzZSBhIGNyZWRpdCBjYXJkIGlzc3VlZCBieSBVbml2ZXJzYWxCYW5rPyAgICAgICAgICAgICAgICAgICAgICAgIHwKCiMjIyBRdWVzdGlvbiAxCgpSZWFkIGFuZCBwcmVwcm9jZXNzIHRoZSBkYXRhOiAKYS4gUmVhZCB0aGUgZGF0YSBmaWxlIGFuZCBhc3NpZ24gaW50byBhbiBgUmAgb2JqZWN0LgpiLiBEcm9wIElEIGFuZCBaSVBjb2RlIGZyb20gdGhlIGRhdGFzZXQuCmMuIENvbnZlcnQgYmluYXJ5IGNhdGVnb3JpY2FsIHZhcmlhYmxlcyBpbnRvIGZhY3RvciB2YXJpYWJsZXMuCmQuIEhhbmRsZSBtaXNzaW5nIHZhbHVlcywgaWYgdGhlcmUgaXMgYW55LgplLiBOb3JtYWxpemUgYWxsIG51bWVyaWNhbCB2YXJpYWJsZXMuCgoKIyMjIF9BbnN3ZXIgdG8gUXVlc3Rpb24gMV8KCmBgYHtyIFExfQojIEluc2VydCBjb2RlIHlvciBjb2RlIGhlcmUKCmBgYAoKCiMjIyBRdWVzdGlvbiAyCgpQYXJ0aXRpb24gdGhlIGRhdGEgaW50byB0cmFpbmluZyAoNjAlKSBhbmQgdGVzdGluZyAoNDAlKSBzZXRzLiBVc2UKCiMjIyBfQW5zd2VyIHRvIFF1ZXN0aW9uIDJfCgpgYGB7ciBRMn0KIyBJbnNlcnQgY29kZSB5b3IgY29kZSBoZXJlCgpgYGAKCiMjIyBRdWVzdGlvbiAzCgpQZXJmb3JtIGEgay1OTiBjbGFzc2lmaWNhdGlvbiB3aXRoIGFsbCBwcmVkaWN0b3JzIGV4Y2VwdCBJRCBhbmQgWklQIGNvZGUgdXNpbmc6ICAKYS4gayA9IDEuIApiLiBrID0gMi4KYy4gayA9IDMuCmQuIGsgPSA0LgoKRm9yIGVhY2ggaz0xLDIsMyw0LCBjb21wdXRlIGFjY3VyYWN5IGJhc2VkIG9uIHRlc3QgZGF0YXNldC4gV2hpY2sgayBpcyB0aGUgYmVzdD8KCiMjIyBfQW5zd2VyIHRvIFF1ZXN0aW9uIDNfCgpgYGB7ciBRM30KIyBJbnNlcnQgY29kZSB5b3IgY29kZSBoZXJlCgpgYGAKCiMjIyBRdWVzdGlvbiA0CgpDb25zaWRlciB0aGUgZm9sbG93aW5nIGN1c3RvbWVyOiAKCkFnZSA9IDQwLCBFeHBlcmllbmNlID0gMTAsIEluY29tZSA9IDg0LCBGYW1pbHkgPSAyLCBDQ0F2ZyA9IDIsIEVkdWNhdGlvbl8xID0gMCwgRWR1Y2F0aW9uXzIgPSAxLCBFZHVjYXRpb25fMyA9IDAsIE1vcnRnYWdlID0gMCwgU2VjdXJpdGllc0FjY291bnQgPSAwLCBDREFjY291bnQgPSAwLCBPbmxpbmUgPSAxLCBhbmQgQ3JlZGl0Q2FyZCA9IDEuCgpQZXJmb3JtIGEgay1OTiBjbGFzc2lmaWNhdGlvbiB3aXRoIGFsbCBwcmVkaWN0b3JzIGV4Y2VwdCBJRCBhbmQgWklQIGNvZGUgdXNpbmcgdGggYmVzdCBrLiBIb3cgd291bGQgdGhpcyBjdXN0b21lciBiZSBjbGFzc2lmaWVkPwoKIyMjIF9BbnN3ZXIgdG8gUXVlc3Rpb24gNF8KCmBgYHtyIFE0fQojIEluc2VydCBjb2RlIHlvciBjb2RlIGhlcmUKCmBgYAoK