Introduction
The goal of this assignment is give you practice working with accuracy and other recommender system metrics.
In this assignment you’re asked to do at least one or (if you like) both of the following:
• Work in a small group, and/or • Choose a different dataset to work with from your previous projects.
Assignment Highlights
As in your previous assignments, compare the accuracy of at least two recommender system algorithms against your offline data.
Implement support for at least one business or user experience goal such as increased serendipity, novelty, or diversity.
Compare and report on any change in accuracy before and after you’ve made the change in #2.
As part of your textual conclusion, discuss one or more additional experiments that could be performed and/or metrics that could be evaluated only if online evaluation was possible. Also, briefly propose how you would design a reasonable online evaluation environment.
About the Data
For Project 4, I will build on Project 3 using the same MovieLens dataset, and evaluate the models accuracy using RMSE, MSE and MAE.
The MovieLens Latest Small Datasets contain 100,000 ratings and 3,600 tag applications applied to 9,000 movies by 600 users.
Loading the MovieLense Data
## 943 x 1664 rating matrix of class 'realRatingMatrix' with 99392 ratings.
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 19.0 32.0 64.0 105.4 147.5 735.0
## [1] "data" "normalize"
## [1] "realRatingMatrix"
## attr(,"package")
## [1] "recommenderlab"
## [1] "Toy Story (1995)"
## [2] "GoldenEye (1995)"
## [3] "Four Rooms (1995)"
## [4] "Get Shorty (1995)"
## [5] "Copycat (1995)"
## [6] "Shanghai Triad (Yao a yao yao dao waipo qiao) (1995)"
## [7] "Twelve Monkeys (1995)"
## [8] "Babe (1995)"
## [9] "Dead Man Walking (1995)"
## [10] "Richard III (1995)"
Data structure
## Formal class 'realRatingMatrix' [package "recommenderlab"] with 2 slots
## ..@ data :Formal class 'dgCMatrix' [package "Matrix"] with 6 slots
## .. .. ..@ i : int [1:99392] 0 1 4 5 9 12 14 15 16 17 ...
## .. .. ..@ p : int [1:1665] 0 452 583 673 882 968 994 1386 1605 1904 ...
## .. .. ..@ Dim : int [1:2] 943 1664
## .. .. ..@ Dimnames:List of 2
## .. .. .. ..$ : chr [1:943] "1" "2" "3" "4" ...
## .. .. .. ..$ : chr [1:1664] "Toy Story (1995)" "GoldenEye (1995)" "Four Rooms (1995)" "Get Shorty (1995)" ...
## .. .. ..@ x : num [1:99392] 5 4 4 4 4 3 1 5 4 5 ...
## .. .. ..@ factors : list()
## ..@ normalize: NULL
Distribution of ratings

Exploring the ratings values
## [1] 5 4 0 3 1 2
## ratingvalues
## 0 1 2 3 4 5
## 1469760 6059 11307 27002 33947 21077
Excluding the missing values
Distribution of the Ratings

Pre-processing
Convert data to numeric
Create the sparse Matrix
ratingsMat <- sparseMatrix(i = movies$user, j = movies$item, x = movies$rating,
dims = c(length(unique(movies$user)), length(unique(movies$item))),
dimnames = list(paste("u", 1:length(unique(movies$user)), sep = ""),
paste("m", 1:length(unique(movies$item)), sep = "")))
ratingsReal <- new("realRatingMatrix", data = ratingsMat)
ratingsReal
## 943 x 1664 rating matrix of class 'realRatingMatrix' with 99392 ratings.
I’m selecting the Users who have rated at least 100 movies and those movies that have been watched at least 150 times
Modified Distributions


I’m selecting the Users who have rated at least 100 movies and those movies that have been watched at least 150 times
Spliting the Data Set
Use 90% for training the model
## Evaluation scheme with 10 items given
## Method: 'split' with 1 run(s).
## Training set proportion: 0.900
## Good ratings: >=4.000000
## Data set: 358 x 200 rating matrix of class 'realRatingMatrix' with 32713 ratings.
ModelAlgorithms <- list(
"Random" = list(name="RANDOM", param=list(normalize = "Z-score")),
"Popular" = list(name="POPULAR", param=list(normalize = "Z-score")),
"UserBased" = list(name="UBCF", param=list(normalize = "Z-score",
method="Cosine",
nn=50, minRating=3)),
"ItemBased" = list(name="IBCF2", param=list(normalize = "Z-score"
))
)
Algorithms to Predict Ratings
## RANDOM run fold/sample [model time/prediction time]
## 1 [0sec/0.08sec]
## POPULAR run fold/sample [model time/prediction time]
## 1 [0.06sec/0.02sec]
## UBCF run fold/sample [model time/prediction time]
## 1
## Warning: Unknown parameters: minRating
## Available parameter (with default values):
## method = cosine
## nn = 25
## sample = FALSE
## normalize = center
## verbose = FALSE
## [0.01sec/0.07sec]
## IBCF2 run fold/sample [model time/prediction time]
## 1
## Timing stopped at: 0 0 0
## Error in .local(data, ...) :
## Recommender method IBCF2 not implemented for data type realRatingMatrix .
## Warning in .local(x, method, ...):
## Recommender 'ItemBased' has failed and has been removed from the results!
Comparing the Recommender Systems

Popular appears to be the best performing Model with the lowest RMSE, MSE and MAE.
Let’s predict next movies
## RANDOM run fold/sample [model time/prediction time]
## 1 [0.02sec/0.01sec]
## POPULAR run fold/sample [model time/prediction time]
## 1 [0.02sec/0.09sec]
## UBCF run fold/sample [model time/prediction time]
## 1
## Warning: Unknown parameters: minRating
## Available parameter (with default values):
## method = cosine
## nn = 25
## sample = FALSE
## normalize = center
## verbose = FALSE
## [0.01sec/0.16sec]
## IBCF2 run fold/sample [model time/prediction time]
## 1
## Timing stopped at: 0 0 0
## Error in .local(data, ...) :
## Recommender method IBCF2 not implemented for data type realRatingMatrix .
## Warning in .local(x, method, ...):
## Recommender 'ItemBased' has failed and has been removed from the results!
Implement Support for Business/User Experiance Goal
The Popular and UserBased models outperforms the Random, I will create a hybrid model consisting of Popular and UserBased models.
It may not always be desirable to recommend products that are likely to be most highly rated by a user. Recommending somewhat unexpected products may improve user experience, expand user preferences, provide additional knowledge about a user.
Training, Testing and Evaluation Data sets
Prepare training dataset
Prepare testing set
Prepare evaluation set
Recommender Systems
The Popular Based Model
Building the User Based Model using 25 nearest neighbor. Building the user collaborative filtering system to recommend movies to users based on how similar they are with other users.
Making Predictions with newdata = test
Model Evaluation - Popular Model
## RMSE MSE MAE
## 0.9076623 0.8238508 0.7153872
The User Based Model
Building the User Based Model using 25 nearest neighbor. Building the user collaborative filtering system to recommend movies to users based on how similar they are with other users.
Making Predictions with newdata = test
Model Evaluation - User Based
## RMSE MSE MAE
## 0.9604391 0.9224433 0.7627016
The Hybrid Model
Making Predictions with newdata = test
Model Evaluation - Hybrid Model
## RMSE MSE MAE
## 0.9077973 0.8240959 0.7154864
Comprison of the Model Accuracy
|
|
RMSE
|
MSE
|
MAE
|
|
Popular
|
0.908
|
0.824
|
0.715
|
|
UserBased
|
0.960
|
0.922
|
0.763
|
|
HYBRID
|
0.908
|
0.824
|
0.715
|
The prediction accuracy of the Hybrid Model is similar to the accuracy of the Popular Model. Based on a blend of the two models, the RMSE, MSE and MAE of the Hybrid model decreased slightly in accuracy.
Conclusion
The purpose of this exercise was to build a recommender system and to practice working with accuracy and other recommender system metrics.
For the given data set, POPULAR and the Userbased Models within recommenderlab provided the best results. It is worth remembering that the choice of the recommender algorithm depends on the context. A Hybrid of both models provided performance accuarcy similar to the Popular Model.
In addition to the methods provided within the recommenderlab library, there are other supervised and unsupervised machine learning algorithm as well as neutral network-driven deep learning algorithms that could be used to build recommendation systems.
References
Breese JS, Heckerman D, Kadie C (1998). “Empirical Analysis of Predictive Algorithms for Collaborative Filtering.” In Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference, pp. 43-52.
Kohavi, Ron (1995). “A study of cross-validation and bootstrap for accuracy estimation and model selection”. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1137-1143.
Koren, Y., Bell, R., & Volinsky, C. (2009). Matrix Factorization Techniques for Recommender Systems. Computer, 42(8), 30–37. https://doi.org/10.1109/MC.2009.263
Michael Hahsler (2016). recommenderlab: A Framework for Developing and Testing Recommendation Algorithms, R package. https://CRAN.R-project.org/package=recommenderlab
---
title: "Data 612 - Project 4"
author: Emmanuel Hayble-Gomes
date: "06/30/2020"
output:
  html_document:
    code_download: yes
    code_folding: hide
    highlight: pygments
    number_sections: yes
    theme: flatly
    toc: yes
    toc_float: yes
  pdf_document:
    toc: yes
---

# Introduction

The goal of this assignment is give you practice working with accuracy and other recommender system metrics.

In this assignment you’re asked to do at least one or (if you like) both of the following:

   • Work in a small group, and/or
   • Choose a different dataset to work with from your previous projects.

# Assignment Highlights

1. As in your previous assignments, compare the accuracy of at least two recommender system algorithms against    your offline data.

2. Implement support for at least one business or user experience goal such as increased serendipity, novelty,    or diversity.

3. Compare and report on any change in accuracy before and after you’ve made the change in #2.

4. As part of your textual conclusion, discuss one or more additional experiments that could be performed        and/or metrics that could be evaluated only if online evaluation was possible. Also, briefly propose how      you would design a reasonable online evaluation environment.

# About the Data

For Project 4, I will build on Project 3 using the same MovieLens dataset, and evaluate the models accuracy using RMSE, MSE and MAE.

The MovieLens Latest Small Datasets contain 100,000 ratings and 3,600 tag applications applied to 9,000 movies by 600 users.

```{r, message=FALSE, warning=FALSE}
library(dplyr)
library(ggplot2)
library(kableExtra)
library(recommenderlab)
```

## Loading the MovieLense Data

```{r}
data (MovieLense)
MovieLense
```


```{r}
summary(rowCounts(MovieLense))
```

```{r}
slotNames(MovieLense)
class(MovieLense)
head(colnames(MovieLense),10)
```

## Data structure

```{r}
str(MovieLense)
```

## Distribution of ratings

```{r}
hist(rowCounts(MovieLense), 
     breaks = 20, 
     main = "Raw Distribution of User's Rating"
     )
```

## Exploring the ratings values

```{r}
ratingvalues <- as.vector(MovieLense@data)
unique(ratingvalues) # The rating is numeric with the least value as 0.5 and the highest values as 5.

tablerating <- table(ratingvalues)
tablerating
```

## Excluding the missing values

```{r}
ratingvalues <- ratingvalues[ratingvalues !=0]
```


## Distribution of the Ratings

```{r}
hist(ratingvalues, 
     breaks = 5, 
     main="Distribution of Ratings",
     xlab="Ratings",
     col="grey",
     freq=TRUE
     )
```

# Pre-processing

**Convert data to numeric**

```{r}
movies <- as(MovieLense, 'data.frame')
movies$user <- as.numeric(movies$user)
movies$item <- as.numeric(movies$item)
```

## Create the sparse Matrix

```{r}
ratingsMat <- sparseMatrix(i = movies$user, j = movies$item, x = movies$rating, 
                               dims = c(length(unique(movies$user)), length(unique(movies$item))),  
                               dimnames = list(paste("u", 1:length(unique(movies$user)), sep = ""), 
                                               paste("m", 1:length(unique(movies$item)), sep = "")))

ratingsReal <- new("realRatingMatrix", data = ratingsMat)
ratingsReal
```

**I'm selecting the Users who have rated at least 100 movies and those movies that have been watched at least 150 times**

```{r}
ratings <- MovieLense[rowCounts(MovieLense) > 100, colCounts(MovieLense) > 150]
```

## Modified Distributions

```{r}
hist(rowCounts(ratings), 
     breaks = 20, 
     main = "Modified Distribution of User Rating"
     )

hist(colCounts(ratings), 
     breaks = 20, 
     main = "Modified Distribution of Movie Rating"
     )
```

**I'm selecting the Users who have rated at least 100 movies and those movies that have been watched at least 150 times**

```{r}
ratings <- MovieLense[rowCounts(MovieLense) > 100, colCounts(MovieLense) > 150]
```


# Spliting the Data Set

Use 90% for training the model

```{r}
Scheme <- evaluationScheme(ratings, method = "split", k = 1,
                        train = 0.9, given= 10, goodRating=4)
```


```{r}
Scheme
```


```{r}
ModelAlgorithms <- list(
  "Random" = list(name="RANDOM", param=list(normalize = "Z-score")),
  "Popular" = list(name="POPULAR", param=list(normalize = "Z-score")),
  "UserBased" = list(name="UBCF", param=list(normalize = "Z-score",
                                                 method="Cosine",
                                                 nn=50, minRating=3)),
  "ItemBased" = list(name="IBCF2", param=list(normalize = "Z-score"
                                                 ))
  
  )
```

# Algorithms to Predict Ratings

```{r}
ModelResults <- evaluate(Scheme, ModelAlgorithms, type = "ratings")
```

## **Comparing the Recommender Systems**

```{r}
plot(ModelResults)
title(main = "Model Error Comparison")
```

Popular appears to be the best performing Model with the lowest RMSE, MSE and MAE.

## Let's predict next movies

```{r}
ModelResults <- evaluate(Scheme, ModelAlgorithms, n=c(1, 5, 10, 15, 20, 25))
```

# Model Metrics/Performance

## ROC curve

```{r}
plot(ModelResults, annotate = TRUE, legend = "topleft")
title("ROC Curve")
```

## Precision and Recall

```{r}
plot(ModelResults, "prec/rec", annotate = TRUE) # precision and recall
title("Precision and Recall")
```


The Popular Model performs better than Random and the User Based model.


# Implement Support for Business/User Experiance Goal

The Popular and UserBased models outperforms the Random, I will create a hybrid model consisting of Popular and UserBased models.

It may not always be desirable to recommend products that are likely to be most highly rated by a user. Recommending somewhat unexpected products may improve user experience, expand user preferences, provide additional knowledge about a user.

## Training, Testing and Evaluation Data sets

**Prepare training dataset**

```{r}
train <- getData(Scheme, "train")
```

**Prepare testing set**

```{r}
test <- getData(Scheme, "known")
```

**Prepare evaluation set**

```{r}
evaluation <- getData(Scheme, "unknown")
```

# Recommender Systems

## The Popular Based Model

Building the User Based Model using 25 nearest neighbor. Building the user collaborative filtering system to recommend movies to users based on how similar they are with other users.

```{r}
Popularmodel <- Recommender(train, method = "POPULAR")
```

### Making Predictions with newdata = test

```{r}
Popularpred <- predict(Popularmodel, newdata = test, type = "ratings")
```

### Model Evaluation - Popular Model

```{r}
Popularaccuracy <- calcPredictionAccuracy(Popularpred, evaluation)
Popularaccuracy
```


## The User Based Model

Building the User Based Model using 25 nearest neighbor. Building the user collaborative filtering system to recommend movies to users based on how similar they are with other users.

```{r}
Usermodel <- Recommender(train, method = "UBCF", 
                     param=list(normalize = "center", method="Cosine", nn=25))
```

### Making Predictions with newdata = test

```{r}
Userpred <- predict(Usermodel, newdata = test, type = "ratings")
```

### Model Evaluation - User Based

```{r}
Useraccuracy <- calcPredictionAccuracy(Userpred, evaluation)
Useraccuracy
```

## The Hybrid Model

```{r}
HybridModel <- HybridRecommender(Popularmodel, Usermodel, weights = c(0.99, 
    0.01))
```

### Making Predictions with newdata = test

```{r}
predHybrid <- predict(HybridModel, newdata = test, type = "ratings")
```

### Model Evaluation - Hybrid Model

```{r}
accuracyHybrid <- calcPredictionAccuracy(predHybrid, evaluation)
accuracyHybrid
```

# Comprison of the Model Accuracy

```{r}
ModelError <- rbind(
    Popular = calcPredictionAccuracy(Popularpred, evaluation),
    UserBased = calcPredictionAccuracy(Userpred, evaluation),
    HYBRID = calcPredictionAccuracy(predHybrid, evaluation)
    )
ModelError %>% round(3) %>% 
kable()%>%
kable_styling(bootstrap_options = c("striped", "hover"))
```

The prediction accuracy of the Hybrid Model is similar to the accuracy of the Popular Model. Based on a blend of the two models, the RMSE, MSE and MAE of the Hybrid model decreased slightly in accuracy.

# Conclusion

The purpose of this exercise was to build a recommender system and to practice working with accuracy and other recommender system metrics.

For the given data set, POPULAR and the Userbased Models within recommenderlab provided the best results. It is worth remembering that the choice of the recommender algorithm depends on the context. A Hybrid of both models provided performance accuarcy similar to the Popular Model.

In addition to the methods provided within the recommenderlab library, there are other supervised and unsupervised machine learning algorithm as well as neutral network-driven deep learning algorithms that could be used to build recommendation systems.


### **References**

Breese JS, Heckerman D, Kadie C (1998). "Empirical Analysis of Predictive Algorithms for Collaborative Filtering." In Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference, pp. 43-52.

Kohavi, Ron (1995). "A study of cross-validation and bootstrap for accuracy estimation and model selection". Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1137-1143.

Koren, Y., Bell, R., & Volinsky, C. (2009). Matrix Factorization Techniques for Recommender Systems. Computer, 42(8), 30–37. https://doi.org/10.1109/MC.2009.263

Michael Hahsler (2016). recommenderlab: A Framework for Developing and Testing Recommendation Algorithms, R package. https://CRAN.R-project.org/package=recommenderlab
