library(tidyverse)
library(openintro)

The file UniversalBank.rds contains data on 5000 customers. The data include customer demographic information (age, income, etc.), the customer’s relationship with the bank (mortgage, securities account, etc.), and the customer response to the last personal loan campaign (Personal Loan). Among these 5000 customers, only 480 (= 9.6%) accepted the personal loan that was offered to them in the earlier campaign.

Data Description:

Variable Description
ID Customer ID
Age Customer’s age in completed years
Experience # years of professional experience
Income Annual income of the customer ($000)
ZIPCode Home Address ZIP code
Family Family size of the customer
CCAvg Avg. spending on credit cards per month ($000)
Education_1 Education Level = 1 if Undergrad; 0 otherwise
Education_2 Education Level = 1 if Graduate; 0 otherwise
Education_3 Education Level = 1 if Advanced/Professional; 0 otherwise
Mortgage Value of house mortgage if any. ($000)
PersonalLoan Did this customer accept the personal loan offered in the last campaign?
SecuritiesAccount Does the customer have a securities account with the bank?
CDAccount Does the customer have a certificate of deposit (CD) account with the bank?
Online Does the customer use internet banking facilities?
CreditCard Does the customer use a credit card issued by UniversalBank?

Question 1

Read and preprocess the data: a. Read the data file and assign into an R object. b. Drop ID and ZIPcode from the dataset. c. Convert binary categorical variables into factor variables. d. Handle missing values, if there is any. e. Normalize all numerical variables.

Answer to Question 1

# Insert code yor code here

Question 2

Partition the data into training (60%) and testing (40%) sets. Use

Answer to Question 2

# Insert code yor code here

Question 3

Perform a k-NN classification with all predictors except ID and ZIP code using:
a. k = 1. b. k = 2. c. k = 3. d. k = 4.

For each k=1,2,3,4, compute accuracy based on test dataset. Whick k is the best?

Answer to Question 3

# Insert code yor code here

Question 4

Consider the following customer:

Age = 40, Experience = 10, Income = 84, Family = 2, CCAvg = 2, Education_1 = 0, Education_2 = 1, Education_3 = 0, Mortgage = 0, SecuritiesAccount = 0, CDAccount = 0, Online = 1, and CreditCard = 1.

Perform a k-NN classification with all predictors except ID and ZIP code using th best k. How would this customer be classified?

Answer to Question 4

# Insert code yor code here
LS0tCnRpdGxlOiAiRUNPTiAzMjAwOiBIb21ld29yayAxIgphdXRob3I6ICJBdXRob3IgTmFtZSIKZGF0ZTogImByIFN5cy5EYXRlKClgIgpvdXRwdXQ6IG9wZW5pbnRybzo6bGFiX3JlcG9ydAotLS0KCmBgYHtyIGxvYWQtcGFja2FnZXMsIG1lc3NhZ2U9RkFMU0V9CmxpYnJhcnkodGlkeXZlcnNlKQpsaWJyYXJ5KG9wZW5pbnRybykKYGBgCgpUaGUgZmlsZSBgVW5pdmVyc2FsQmFuay5yZHNgIGNvbnRhaW5zIGRhdGEgb24gNTAwMCBjdXN0b21lcnMuIFRoZSBkYXRhIGluY2x1ZGUgY3VzdG9tZXIgZGVtb2dyYXBoaWMgaW5mb3JtYXRpb24gKGFnZSwgaW5jb21lLCBldGMuKSwgdGhlIGN1c3RvbWVy4oCZcyByZWxhdGlvbnNoaXAgd2l0aCB0aGUgYmFuayAobW9ydGdhZ2UsIHNlY3VyaXRpZXMgYWNjb3VudCwgZXRjLiksIGFuZCB0aGUgY3VzdG9tZXIgcmVzcG9uc2UgdG8gdGhlIGxhc3QgcGVyc29uYWwgbG9hbiBjYW1wYWlnbiAoUGVyc29uYWwgTG9hbikuIEFtb25nIHRoZXNlIDUwMDAgY3VzdG9tZXJzLCBvbmx5IDQ4MCAoPSA5LjYlKSBhY2NlcHRlZCB0aGUgcGVyc29uYWwgbG9hbiB0aGF0IHdhcyBvZmZlcmVkIHRvIHRoZW0gaW4gdGhlIGVhcmxpZXIgY2FtcGFpZ24uCgpEYXRhIERlc2NyaXB0aW9uOgoKfCBWYXJpYWJsZSAgICAgICAgICAgIHwgRGVzY3JpcHRpb24gICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8Oi0tLS0tLS0tLS0tLS0tLS0tLS0tfDotLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS18CnwgSUQgICAgICAgICAgICAgICAgICB8IEN1c3RvbWVyIElEICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBBZ2UgICAgICAgICAgICAgICAgIHwgQ3VzdG9tZXIncyBhZ2UgaW4gY29tcGxldGVkIHllYXJzICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IEV4cGVyaWVuY2UgICAgICAgICAgfCAjIHllYXJzIG9mIHByb2Zlc3Npb25hbCBleHBlcmllbmNlICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgSW5jb21lICAgICAgICAgICAgICB8IEFubnVhbCBpbmNvbWUgb2YgdGhlIGN1c3RvbWVyICgkMDAwKSAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBaSVBDb2RlICAgICAgICAgICAgIHwgSG9tZSBBZGRyZXNzIFpJUCBjb2RlICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IEZhbWlseSAgICAgICAgICAgICAgfCBGYW1pbHkgc2l6ZSBvZiB0aGUgY3VzdG9tZXIgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgQ0NBdmcgICAgICAgICAgICAgICB8IEF2Zy4gc3BlbmRpbmcgb24gY3JlZGl0IGNhcmRzIHBlciBtb250aCAoJDAwMCkgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBFZHVjYXRpb25fMSAgICAgICAgIHwgRWR1Y2F0aW9uIExldmVsID0gMSBpZiBVbmRlcmdyYWQ7IDAgb3RoZXJ3aXNlICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IEVkdWNhdGlvbl8yICAgICAgICAgfCBFZHVjYXRpb24gTGV2ZWwgPSAxIGlmIEdyYWR1YXRlOyAwIG90aGVyd2lzZSAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgRWR1Y2F0aW9uXzMgICAgICAgICB8IEVkdWNhdGlvbiBMZXZlbCA9IDEgaWYgQWR2YW5jZWQvUHJvZmVzc2lvbmFsOyAwIG90aGVyd2lzZSAgICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBNb3J0Z2FnZSAgICAgICAgICAgIHwgVmFsdWUgb2YgaG91c2UgbW9ydGdhZ2UgaWYgYW55LiAoJDAwMCkgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgfAp8IFBlcnNvbmFsTG9hbiAgICAgICAgfCBEaWQgdGhpcyBjdXN0b21lciBhY2NlcHQgdGhlIHBlcnNvbmFsIGxvYW4gb2ZmZXJlZCBpbiB0aGUgbGFzdCBjYW1wYWlnbj8gICAgICAgICAgICB8CnwgU2VjdXJpdGllc0FjY291bnQgICB8IERvZXMgdGhlIGN1c3RvbWVyIGhhdmUgYSBzZWN1cml0aWVzIGFjY291bnQgd2l0aCB0aGUgYmFuaz8gICAgICAgICAgICAgICAgICAgICAgICAgIHwKfCBDREFjY291bnQgICAgICAgICAgIHwgRG9lcyB0aGUgY3VzdG9tZXIgaGF2ZSBhIGNlcnRpZmljYXRlIG9mIGRlcG9zaXQgKENEKSBhY2NvdW50IHdpdGggdGhlIGJhbms/ICAgICAgICAgfAp8IE9ubGluZSAgICAgICAgICAgICAgfCBEb2VzIHRoZSBjdXN0b21lciB1c2UgaW50ZXJuZXQgYmFua2luZyBmYWNpbGl0aWVzPyAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB8CnwgQ3JlZGl0Q2FyZCAgICAgICAgICB8IERvZXMgdGhlIGN1c3RvbWVyIHVzZSBhIGNyZWRpdCBjYXJkIGlzc3VlZCBieSBVbml2ZXJzYWxCYW5rPyAgICAgICAgICAgICAgICAgICAgICAgIHwKCiMjIyBRdWVzdGlvbiAxCgpSZWFkIGFuZCBwcmVwcm9jZXNzIHRoZSBkYXRhOiAKYS4gUmVhZCB0aGUgZGF0YSBmaWxlIGFuZCBhc3NpZ24gaW50byBhbiBgUmAgb2JqZWN0LgpiLiBEcm9wIElEIGFuZCBaSVBjb2RlIGZyb20gdGhlIGRhdGFzZXQuCmMuIENvbnZlcnQgYmluYXJ5IGNhdGVnb3JpY2FsIHZhcmlhYmxlcyBpbnRvIGZhY3RvciB2YXJpYWJsZXMuCmQuIEhhbmRsZSBtaXNzaW5nIHZhbHVlcywgaWYgdGhlcmUgaXMgYW55LgplLiBOb3JtYWxpemUgYWxsIG51bWVyaWNhbCB2YXJpYWJsZXMuCgoKIyMjIF9BbnN3ZXIgdG8gUXVlc3Rpb24gMV8KCmBgYHtyIFExfQojIEluc2VydCBjb2RlIHlvciBjb2RlIGhlcmUKCmBgYAoKCiMjIyBRdWVzdGlvbiAyCgpQYXJ0aXRpb24gdGhlIGRhdGEgaW50byB0cmFpbmluZyAoNjAlKSBhbmQgdGVzdGluZyAoNDAlKSBzZXRzLiBVc2UKCiMjIyBfQW5zd2VyIHRvIFF1ZXN0aW9uIDJfCgpgYGB7ciBRMn0KIyBJbnNlcnQgY29kZSB5b3IgY29kZSBoZXJlCgpgYGAKCiMjIyBRdWVzdGlvbiAzCgpQZXJmb3JtIGEgay1OTiBjbGFzc2lmaWNhdGlvbiB3aXRoIGFsbCBwcmVkaWN0b3JzIGV4Y2VwdCBJRCBhbmQgWklQIGNvZGUgdXNpbmc6ICAKYS4gayA9IDEuIApiLiBrID0gMi4KYy4gayA9IDMuCmQuIGsgPSA0LgoKRm9yIGVhY2ggaz0xLDIsMyw0LCBjb21wdXRlIGFjY3VyYWN5IGJhc2VkIG9uIHRlc3QgZGF0YXNldC4gV2hpY2sgayBpcyB0aGUgYmVzdD8KCiMjIyBfQW5zd2VyIHRvIFF1ZXN0aW9uIDNfCgpgYGB7ciBRM30KIyBJbnNlcnQgY29kZSB5b3IgY29kZSBoZXJlCgpgYGAKCiMjIyBRdWVzdGlvbiA0CgpDb25zaWRlciB0aGUgZm9sbG93aW5nIGN1c3RvbWVyOiAKCkFnZSA9IDQwLCBFeHBlcmllbmNlID0gMTAsIEluY29tZSA9IDg0LCBGYW1pbHkgPSAyLCBDQ0F2ZyA9IDIsIEVkdWNhdGlvbl8xID0gMCwgRWR1Y2F0aW9uXzIgPSAxLCBFZHVjYXRpb25fMyA9IDAsIE1vcnRnYWdlID0gMCwgU2VjdXJpdGllc0FjY291bnQgPSAwLCBDREFjY291bnQgPSAwLCBPbmxpbmUgPSAxLCBhbmQgQ3JlZGl0Q2FyZCA9IDEuCgpQZXJmb3JtIGEgay1OTiBjbGFzc2lmaWNhdGlvbiB3aXRoIGFsbCBwcmVkaWN0b3JzIGV4Y2VwdCBJRCBhbmQgWklQIGNvZGUgdXNpbmcgdGggYmVzdCBrLiBIb3cgd291bGQgdGhpcyBjdXN0b21lciBiZSBjbGFzc2lmaWVkPwoKIyMjIF9BbnN3ZXIgdG8gUXVlc3Rpb24gNF8KCmBgYHtyIFE0fQojIEluc2VydCBjb2RlIHlvciBjb2RlIGhlcmUKCmBgYAoK