library(data.table)
library(reshape2)
library(recommenderlab)
library(ggplot2)
library(pander)

1.0 Objective

As in your previous assignments, compare the accuracy of at least two recommender system algorithms against your offline data.
Implement support for at least one business or user experience goal such as increased serendipity, novelty, or diversity.
Compare and report on any change in accuracy before and after you’ve made the change in #2.
As part of your textual conclusion, discuss one or more additional experiments that could be performed and/or metrics that could be evaluated only if online evaluation was possible. Also, briefly propose how you would design a reasonable online evaluation environment.

2.0 Data Sourcing and Loading

Our dataset contains social networking, tagging, and music artist listening information from a set of 2K users from Last.fm online music system.

# user artist plays
data_set <- fread('user_artists.dat',header = T,sep='\t')
data_set <- as.data.frame(data_set)
# artist listing
artist_ds <- fread('artists.dat',header = T,sep='\t')
artist_ds <- as.data.frame(artist_ds)
# create programmer friendly column names. 
colnames(data_set) <- c('userID','artistID','listeningCount')
head(data_set)

##   userID artistID listeningCount
## 1      2       51          13883
## 2      2       52          11690
## 3      2       53          11351
## 4      2       54          10300
## 5      2       55           8983
## 6      2       56           6152

3.0 Data Transformation

Our goal in this section is to restructure our user-artist dataset into a wide matrix format. There are 1892 users and 17632 artists.

# coverting our dataset into a wide format. Where the artistID are the col names and userID row names.
data_matrix<- acast(data_set, userID ~ artistID)
data_matrix<- as.matrix(data_matrix)
dim(data_matrix)

## [1]  1892 17632

3.1 Creating Recommender Matrices

We plan to represent our rating matrix in two forms:

Real rating matrix
Binary rating matrix (If the user has played the song we will assign it a vaule of one (1) and zero (0) if the song wasn’t played.)

# creating rating matrix
s_matrix <- as(data_matrix,'realRatingMatrix')

# creating binary matrix
data_matrix_bin <- data_matrix

# set values more than 1 to 1 and NA values to 0
data_matrix_bin[data_matrix_bin > 1] <- 1
data_matrix_bin[is.na(data_matrix_bin)] <- 0

#binarize(data_matrix_bin, minRating = 1)
bin_matrix <- as(data_matrix_bin,'binaryRatingMatrix')

# normalizing the matrix
#s_norm <- normalize(s_matrix)
#bin_norm <- normalize(s_matrix)

3.2 Data Visualization

artist_play <- colCounts(s_matrix)

# sorting artist by number of plays
artist_lis <- data.frame(artist= artist_ds[which(artist_ds$id== names(artist_play)),]$name,plays= artist_play)
artist_lis <- artist_lis[order(artist_lis$plays, decreasing =TRUE), ]
ggplot(artist_lis[1:6, ], aes(x = artist, y = plays)) +geom_bar(stat="identity") + ggtitle("Number of plays of the top artists")

Lady Gaga received the most plays with Britney Spares following. Surprising they both received more plays than Rihanna.

3.2.1 Heatmaps

Let’s visualize the listeners and artists using heatmaps.

The heatmap is dense.The cart is unreadable because there are too many users and plays. We will build a smaller heatmap with the first 100 users and first 100 artists.

4.0 Split the dataset into Train and Test

We decided to use 80% of our dataset for training and the remaining 20% for testing.

## Picking 80% of the sample
set.seed(123)

which_train <- sample(x = c(TRUE, FALSE),size = nrow(s_matrix),replace = TRUE,prob = c(0.8, 0.2))
which_train_bin <- sample(x = c(TRUE, FALSE),size = nrow(bin_matrix),replace = TRUE,prob = c(0.8, 0.2))

train_matrix_raw <- s_matrix[which_train, ]
test_matrix_raw <- s_matrix[!which_train, ]

train_matrix_bin <- bin_matrix[which_train_bin, ]
test_matrix_bin <- bin_matrix[!which_train_bin, ]

4.1 Dimensions of Test and Training Matrices

Raw Listenig Matrix

Training

## 1512 x 17632 rating matrix of class 'realRatingMatrix' with 74292 ratings.

Test

## 380 x 17632 rating matrix of class 'realRatingMatrix' with 18542 ratings.

Binarized Listenig Matrix

Training

## 1505 x 17632 rating matrix of class 'binaryRatingMatrix' with 73891 ratings.

Test

## 387 x 17632 rating matrix of class 'binaryRatingMatrix' with 18943 ratings.

5.0 Model Building

We will construct User Based and Item Based Collaborative Filtering Models for our Raw Listening Matrix and our Binarized Listening Matrix.

We decided to restrict our matrix to 500 items when performing only item-based collaborative filtering since the Item-based models took over 2 hours to create.

# creating models for our raw matrix
raw_ubcf_model <- Recommender(train_matrix_raw[,1:500],method="UBCF",param=NULL)
raw_ibcf_model <- Recommender(train_matrix_raw[,1:500],method="IBCF",param=NULL)

# creating model for our binarized matrix. Using Jaccard as our distance method since it is preferred choice for binary data.
bin_ubcf_model <- Recommender(train_matrix_bin[,1:500],method="UBCF",param= list(method = "Jaccard"))
bin_ibcf_model <- Recommender(train_matrix_bin[,1:500],method="IBCF",param= list(method = "Jaccard"))

#recom <- predict(rec, r[1:nrow(r)], type="ratings")
#recom
#names(getModel(rec))

5.1 Visualizing our Item Based Collaborative Filtering Models

Raw Listening Matrix Model

Binarized Listening Matrix Model

6.0 Prediction

We will peform prediction using the models we created above against our test data. Our plan is to recommend 10 items to each user.

n_recommended <- 10
bin_predicted_ubcf <- predict(object = bin_ubcf_model,newdata = test_matrix_bin[,1:500],n = n_recommended)
bin_predicted_ibcf <- predict(object = bin_ibcf_model,newdata = test_matrix_bin[,1:500],n = n_recommended)


raw_predicted_ubcf <- predict(object = raw_ubcf_model,newdata = test_matrix_raw[,1:500],n = n_recommended)
raw_predicted_ibcf <- predict(object = raw_ibcf_model,newdata = test_matrix_raw[,1:500],n = n_recommended)

# create functions that returns prediction 

table_labels <- data.frame(id = bin_predicted_ubcf@itemLabels)
table_labels <- merge(table_labels, artist_ds,by = "id", all.x = TRUE, all.y = FALSE,sort = FALSE)
descriptions <- as(table_labels$name, "character")

getModelPredictionForUser <- function(userIndex,model) 
{
  user_rec <- model@items[[userIndex]]  
  return(descriptions[user_rec])
}

head(getModelPredictionForUser(1,bin_predicted_ubcf))

## [1] "New Order"   "Madonna"     "The Killers" "The Beatles" "Portishead" 
## [6] "Camouflage"

6.1 Top 5 Recommendations for User 1

bin_ibcf_pre <- head(getModelPredictionForUser(1,bin_predicted_ibcf))
bin_ubcf_pre <- head(getModelPredictionForUser(1,bin_predicted_ubcf))

raw_ibcf_pre <- head(getModelPredictionForUser(1,raw_predicted_ibcf))
raw_ubcf_pre <- head(getModelPredictionForUser(1,raw_predicted_ubcf))

output <- data.frame(binaryIBCF = bin_ibcf_pre, binaryUBCF = bin_ubcf_pre, rawIBCF= raw_ibcf_pre,rawUBCF= raw_ubcf_pre, stringsAsFactors = FALSE)
knitr::kable(output)

binaryIBCF	binaryUBCF	rawIBCF	rawUBCF
Limbonic Art	New Order	Sinamore	Sigur RÃ³s
Artista sconosciuto	Madonna	Artista sconosciuto	Thievery Corporation
Madonna	The Killers	:wumpscut:	The Cure
Jennifer Lopez	The Beatles	The Birthday Massacre	Opeth
BeyoncÃ©	Portishead	Agonoize	Kylie Minogue
Rihanna	Camouflage	Sopor Aeternus & The Ensemble of Shadows	Porcupine Tree

7.0 Model Evaluation

we can measure the performance depending on number of neighbors. Since we are optimizing the IBCF part only, we will set weight description to zero. Using lapply, we can build a list of elements that contain the performance for the value of nn_to_test:

7.1 Item Based

True Positives (TP): These are recommended items that have been purchased
False Positives (FP): These are recommended items that haven’t been purchased
False Negatives( FN): These are not recommended items that have been purchased
True Negatives (TN): These are not recommended items that haven’t been purchased
True Positive Rate (TPR): This is the percentage of purchased items that have been recommended
False Positive Rate (FPR): This is the percentage of not purchased items that have been recommended

A perfect (or overfitted) model would have only TP and TN.

Raw Ratings

# neigbors to test
n_n_test <- 25

list_performance_1 <- lapply(X = n_n_test,FUN = function(nn){
evaluateModel(train_data = train_matrix_raw[,1:500],test_data=test_matrix_raw[,1:500],number_neighbors = nn,goodRatingVal=1,paramMethod="cosine")})

pander(unlist(list_performance_1))

TP	FP	FN	TN	precision	recall	TPR	FPR
0	9.263	13.71	467	0	0	0	0.0195

The results show TP = 0, meaning the recommended items have not been purchased.

Binarized Ratings

list_performance_2 <- lapply(X = n_n_test,FUN = function(nn){
evaluateModel(train_data = train_matrix_bin[,1:500],test_data=test_matrix_bin[,1:500],number_neighbors = nn)})

pander(unlist(list_performance_2))

TP	FP	FN	TN	precision	recall	TPR	FPR
0	9.793	14.56	465.6	0	0	0	0.02062

The results show TP = 0, meaning the recommended items have not been purchased.

7.2 User Based

True Positives (TP): These are recommended items that have been purchased
False Positives (FP): These are recommended items that haven’t been purchased
False Negatives( FN): These are not recommended items that have been purchased
True Negatives (TN): These are not recommended items that haven’t been purchased
True Positive Rate (TPR): This is the percentage of purchased items that have been recommended
False Positive Rate (FPR): This is the percentage of not purchased items that have been recommended

A perfect (or overfitted) model would have only TP and TN.

Raw Ratings

list_performance_3 <- lapply(X = n_n_test,FUN = function(nn){
evaluateModel(train_data = train_matrix_raw[,1:500],test_data=test_matrix_raw[,1:500],recMethod ="UBCF",number_neighbors =nn,goodRatingVal=1,paramMethod="cosine")})

pander(unlist(list_performance_3))

TP	FP	FN	TN	precision	recall	TPR	FPR
0	9.289	13.71	467	0	0	0	0.01955

The results show TP = 0, meaning the recommended items have not been purchased.

Binarized Ratings

# neigbors to test

list_performance_4 <- lapply(X = n_n_test,FUN = function(nn){
evaluateModel(train_data = train_matrix_bin[,1:500],test_data=test_matrix_bin[,1:500],recMethod ="UBCF",number_neighbors = nn)})

pander(unlist(list_performance_4))

TP	FP	FN	TN	precision	recall	TPR	FPR
0	10	14.56	465.4	0	0	0	0.02104

The results show TP = 0, meaning the recommended items have not been purchased.

7.1 Increased Serendipity

Serendipity is defined as finding something good or useful while not specifically searching for it.

To achieve increased serendipity in our approach, we can try append random artists into the suggestion list. This will introduce the user to new artists. A detail analysis of this approach would entail further tweaking and running of the model, which is outside the scope of our work here due to the time constraint.

Summary

Based on the above, we demonstrated how to apply the techniques in a real-life context. Starting with raw unstructured data, we built a rating matrix, which is the input of collaborative filtering. Using performance evaluations, we optimized the model parameters. The same approach can be applied in real-life contexts, if properly refined models are used.

The model evaluation using the item based as well as the user based matrix did not yield any True Positives, meaning there were no purchases made based on the recommendations. This could mean the model is not appropriate for this particular exercise or might need firther fine turing and optimization.

There are different ways to evaluate performances that might potentially lead to different choices. Depending on the business target, the evaluation metric is different.

Dataset Credit

Source url: http://files.grouplens.org/datasets/hetrec2011/hetrec2011-lastfm-2k.zip, http://www.lastfm.com
Authors: Cantador, Iv,Brusilovsky, Peter and Kuflik, Tsvi
Title: 2nd Workshop on Information Heterogeneity and Fusion in Recommender Systems (HetRec 2011)

DATA 643 Project 4 | Accuracy and Beyond

Jason Joseph, Srini Illapani

July 02, 2017

1.0 Objective

2.0 Data Sourcing and Loading

3.0 Data Transformation

3.1 Creating Recommender Matrices

3.2 Data Visualization

3.2.1 Heatmaps

4.0 Split the dataset into Train and Test

4.1 Dimensions of Test and Training Matrices

Raw Listenig Matrix

Binarized Listenig Matrix

5.0 Model Building

5.1 Visualizing our Item Based Collaborative Filtering Models

6.0 Prediction

6.1 Top 5 Recommendations for User 1

7.0 Model Evaluation

7.1 Item Based

Raw Ratings

Binarized Ratings

7.2 User Based

Raw Ratings

Binarized Ratings

7.1 Increased Serendipity

Summary

Dataset Credit