library(RCurl)
## Loading required package: bitops
movings_ratings_url <- getURL("https://raw.githubusercontent.com/fivethirtyeight/data/master/fandango/fandango_score_comparison.csv")
movie_ratings <-data.frame(read.csv(text=movings_ratings_url, header=T))
dim(movie_ratings)
## [1] 146  22
head(movie_ratings)
##                             FILM RottenTomatoes RottenTomatoes_User
## 1 Avengers: Age of Ultron (2015)             74                  86
## 2              Cinderella (2015)             85                  80
## 3                 Ant-Man (2015)             80                  90
## 4         Do You Believe? (2015)             18                  84
## 5  Hot Tub Time Machine 2 (2015)             14                  28
## 6       The Water Diviner (2015)             63                  62
##   Metacritic Metacritic_User IMDB Fandango_Stars Fandango_Ratingvalue
## 1         66             7.1  7.8            5.0                  4.5
## 2         67             7.5  7.1            5.0                  4.5
## 3         64             8.1  7.8            5.0                  4.5
## 4         22             4.7  5.4            5.0                  4.5
## 5         29             3.4  5.1            3.5                  3.0
## 6         50             6.8  7.2            4.5                  4.0
##   RT_norm RT_user_norm Metacritic_norm Metacritic_user_nom IMDB_norm
## 1    3.70          4.3            3.30                3.55      3.90
## 2    4.25          4.0            3.35                3.75      3.55
## 3    4.00          4.5            3.20                4.05      3.90
## 4    0.90          4.2            1.10                2.35      2.70
## 5    0.70          1.4            1.45                1.70      2.55
## 6    3.15          3.1            2.50                3.40      3.60
##   RT_norm_round RT_user_norm_round Metacritic_norm_round
## 1           3.5                4.5                   3.5
## 2           4.5                4.0                   3.5
## 3           4.0                4.5                   3.0
## 4           1.0                4.0                   1.0
## 5           0.5                1.5                   1.5
## 6           3.0                3.0                   2.5
##   Metacritic_user_norm_round IMDB_norm_round Metacritic_user_vote_count
## 1                        3.5             4.0                       1330
## 2                        4.0             3.5                        249
## 3                        4.0             4.0                        627
## 4                        2.5             2.5                         31
## 5                        1.5             2.5                         88
## 6                        3.5             3.5                         34
##   IMDB_user_vote_count Fandango_votes Fandango_Difference
## 1               271107          14846                 0.5
## 2                65709          12640                 0.5
## 3               103660          12055                 0.5
## 4                 3136           1793                 0.5
## 5                19560           1021                 0.5
## 6                39373            397                 0.5

Introduction

From article: Fandango is using a 3 to 5 star scale. And that’s not the only thing wrong with its ratings. I found an issue with the methodology Fandango uses to average user ratings on its website: Fandango never rounds the average down.

Always rounding up, and often against expectations of normal rounding practices

Returning to the 209 films that had 30 or more user reviews on Fandango.com, the average movie gained 0.25 stars from this rounding.

R Markdown

This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.

When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:

summary(cars)
##      speed           dist       
##  Min.   : 4.0   Min.   :  2.00  
##  1st Qu.:12.0   1st Qu.: 26.00  
##  Median :15.0   Median : 36.00  
##  Mean   :15.4   Mean   : 42.98  
##  3rd Qu.:19.0   3rd Qu.: 56.00  
##  Max.   :25.0   Max.   :120.00

Including Plots

You can also embed plots, for example:

Note that the echo = FALSE parameter was added to the code chunk to prevent printing of the R code that generated the plot.