“THE BEST DECADE FOR MOVIES”

“A Comparison between the Movie Ratings of the 1990s and 2000s”

Rioflorido, Cha (S3789114)

Last updated: 22 October, 2019

RPUBS

Link information
The Best Decade for Movies: http://rpubs.com/cha_rioflorido/Movie_Ratings

INTRODUCTION

THE EVOLUTION OF THE WORLD OF ENTERTAINMENT

Before the film industry was revolutionized with the proliferation of video-on-demand and online streaming services, 3d/4d films, and popularity of blockbuster comic book movies, the cinematic universe was influenced by the significant developments which happened in the earlier decades.

INTRODUCTION

DEVELOPMENTS THAT DEFINED 1990s AND 2000s

Key Shifts Decade: 1990s Decade: 2000s
Known For Decade of remakes, re-releases, and more sequels Era of franchise films
Innovation Rise of computer-generated imagery Age of advanced special effects (CGI, performance capture)
Video Access Popularity of video rentals Popularity of the internet and digital videos
In-Home Formats Launch of DVDs Launch of high-definition DVDs and Blu Rays
Popular Genres and Blockbusters
  • Action-Thrillers [Mission Impossible, Top Gun, Face/Off]
  • Romantic-Comedy [Pretty Woman, Sleepless in Seattle]
  • Animated-Cartoon [The Lion King, Aladdin, Toy Story]
  • Drama [Titanic, Forrest Gump, Armageddon]
  • Adventure-Fantasy [Jurassic Park, Star Wars]
  • Adventure-Fantasy [Avatar, The Lord of the Rings, Harry Potter, X-Men]
  • Musicals [Chicago, Moulin Rouge]
  • Animated Feature [Finding Nemo, Shrek]
  • Horror-Suspense [Saw, Scary Movie, Final Destination]
  • Foreign Films [Slumdog Millionnaire, Crouching Tiger Hidden Dragon]
  • PROBLEM STATEMENT

    Both decades were known for its respective iconic, award-winning, and top-grossing pictures.

    METHODOLOGY

    DATA SOURCE

    DATA MANIPULATION

    Key Variables Data Types Observations
    Title Number Character Title’s unique identifier
    Average Rating Numeric (Dbl) Scale from 1.0 to 10.0
    Number of Votes Numeric (Dbl) Ranges from 5 to 2,138,866
    Primary Title Character Actual titles used by the filmmakers
    Start Year Numeric (Dbl) Ranges from 1990 to 2009
    Decade Factor 2 Levels: 1990s and 2000s

    METHODOLOGY

    SAMPLING

    STATISTICAL TESTS

    An independent two-sample t-test (at 95% CI) was performed upon meeting the following conditions:

    1. Scanning and handling of outliers using Tukey’s method
    2. Testing the assumption of normality using skewness and kurtosis tests
    3. Testing the homogeneity of variance using Levene’s test

    RESULTS

    CONDITION 1: TUKEY’S METHOD OF OUTLIER DETECTION

    RESULTS

    SUMMARY STATISTICS

    Measures of central tendency, percentiles, and variability of both decades were almost identical except for the differences in the mean, standard deviation, and sample size as highlighted below.

    Decade Min Q1 Median Q3 Max Mean SD n Missing
    1990s 2.9 5.3 6.2 6.9 9.2 6.030050 1.160287 14882 0
    2000s 2.9 5.3 6.2 6.9 9.2 6.061325 1.205790 28114 0

    RESULTS

    CONDITION 2: TESTING THE ASSUMPTION OF NORMALITY

    RESULTS

    CONDITION 2: TESTING THE ASSUMPTION OF NORMALITY

    Decade: 1990s

    ## [1] -0.3432102
    ## [1] -0.3270806

    Decade: 2000s

    ## [1] -0.4261925
    ## [1] -0.2611371

    RESULTS

    CONDITION 3: HOMOGENEITY OF VARIANCE (LEVENE’S TEST)

    leveneTest(IMDB5$averageRating ~ IMDB5$Decade, data = IMDB5)
    ## Levene's Test for Homogeneity of Variance (center = median)
    ##          Df F value    Pr(>F)    
    ## group     1  13.584 0.0002284 ***
    ##       42994                      
    ## ---
    ## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

    HYPOTHESIS TESTING

    AN INDEPENDENT TWO-SAMPLE t-TEST ASSUMING UNEQUAL VARIANCE

    \[H_0: \mu_1 - \mu_2 = 0\]

    \[H_A: \mu_1 - \mu_2 \ne 0\]

    where \(\mu_1\) and \(\mu_2\) represent the population means of the two groups: 1990s and 2000s respectively.

    DECISION RULES

    HYPOTHESIS TESTING

    AN INDEPENDENT TWO-SAMPLE t-TEST ASSUMING UNEQUAL VARIANCE

    Two-tailed test’s significance level was \(\alpha = 0.05\). Welch t-test was applied following the unequal variance assumption.

    t.test(Decade_1990$averageRating, Decade_2000$averageRating, var.equal = FALSE, alternative = "two.sided")
    ## 
    ##  Welch Two Sample t-test
    ## 
    ## data:  Decade_1990$averageRating and Decade_2000$averageRating
    ## t = -2.6229, df = 31337, p-value = 0.008722
    ## alternative hypothesis: true difference in means is not equal to 0
    ## 95 percent confidence interval:
    ##  -0.05464682 -0.00790437
    ## sample estimates:
    ## mean of x mean of y 
    ##  6.030050  6.061325

    CRITICAL VALUE THRESHOLD: +/- 1.96

    qt(p = 0.05/2, df = 14882   + 28114 - 2, lower.tail = FALSE)
    ## [1] 1.960019

    DISCUSSION

    INTERPRETATION OF THE RESULTS

    A two-sample t-test was performed to check for a significant difference between the mean average user rating of the movies in the 1990s and 2000s.

    DISCUSSION

    INTERPRETATION OF THE RESULTS

    The central tendencies of both decades lie within the rating of 6.0 to 6.2, with majority of the data ranging between 4.5 to 7.5. However, note that the distinguishable variations came from the distribution of higher ratings in 2000s.

    CONCLUSION

    SUMMARY

    The results of the investigation suggest that there is a statistically significant difference between the average user rating of the movies in the 1990s and 2000s, t(df = 31,337) = −2.62, p = 0.01, 95% [-0.05 -0.01].

    Movies in the 2000s have significantly higher average user ratings than the movies in the 1990s.

    RECOMMENDATIONS

    In spite of the significant developments observed in the 1990s and 2000s, statistical tests applied, and substantial sample size, there are limitations associated with claiming 2000s as “The Best Decade for Movies”.

    A more stringent criteria, sample, and/or advanced statistical test would be the best approach to reduce further variabilities and maximize the outcome of this research.

    Factors Limitations Considerations
    Variables
    • Potential factors such as critic’s ratings, awards, and box-office sales were not considered.
    • All movies were taken into consideration, including indie and local films.
    • Only two decades were represented.
    • Despite setting the number of votes above 30, weighted average could be a better option to consider the ratings according to the number of votes.
  • Categorization according to film type (e.g. mainstream or hollywood, international films) or movie rating (rated-r, pg-13) could also indicate differences in ratings.
  • Expand period coverage to include past and present.
  • Sample Limited to registered users with internet access. Option to include critics, frequent movie watchers aged 18+ years old, and/or tailor-fit the composition of the audience according to the movie rating.
    Statistical Test Only an independent two-sample t-test was applied Deep dive analytics using regression analysis can be performed to identify which variables can strongly predict the highest-rated movies and in which decade.

    REFERENCES

    Reference List
    Anonymous. [Website Image]. Retrieved from http://editorial.designtaxi.com/editorial-images/news-flatdesigns160316/1.jpg
    Anonymous. [Website Image]. Retrieved from https://wallpapercave.com/wp/wp2714502.jpg
    Anonymous. Finding Nemo Poster. [Website Image]. Retrieved from https://images-na.ssl-images-amazon.com/images/I/519tGMFll3L._SY445_.jpg
    Anonymous. Slumdog Millionaire Poster. [Website Image]. Retrieved from https://www.movieposter.com/posters/archive/main/110/MPW-55462
    Big Screen Movie Posters. (2015). The Dark Knight Poster. [Website Image]. Retrieved from https://images-na.ssl-images-amazon.com/images/I/81AJdOIEIhL._SY679_.jpg
    Dirks, T. (2019). The History of Film The 1990s. Retrieved from https://www.filmsite.org/90sintro.html
    Dirks, T. (2019). The History of Film The 2000s. Retrieved from https://www.filmsite.org/2000sintro.html
    IMDb Datasets. (2019). Retrieved from https://www.imdb.com/interfaces/
    Movie Poster Arena. Ironman Poster [Website Image]. Retrieved from https://images-na.ssl-images-amazon.com/images/I/61h3QYQmxeL._SY679_.jpg
    Movie Poster Arena. The Lord of the Rings The Fellowship of the Rings Poster. [Website Image]. Retrieved from https://images-na.ssl-images-amazon.com/images/I/51uKITEiT1L.jpg
    Pop Culture Graphics. (2009). Avatar Poster. [Website Image]. Retrieved from https://images-na.ssl-images-amazon.com/images/I/41kTVLeW1CL.jpg
    Pop Culture Graphics. Inglourious Basterds Poster. [Website Image]. Retrieved from https://m.media-amazon.com/images/M/MV5BOTJiNDEzOWYtMTVjOC00ZjlmLWE0NGMtZmE1OWVmZDQ2OWJhXkEyXkFqcGdeQXVyNTIzOTk5ODM@._V1_.jpg
    Poster Stop Online. Gladiator Poster. [Website Image]. Retrieved from https://m.media-amazon.com/images/M/MV5BMDliMmNhNDEtODUyOS00MjNlLTgxODEtN2U3NzIxMGVkZTA1L2ltYWdlXkEyXkFqcGdeQXVyNjU0OTQ0OTY@._V1_.jpg
    R Logo. (2016). [Website Image]. Retrieved from https://www.r-project.org/logo/
    Wildish, S. 00s Film Alphabet. [Website Image]. Retrieved from http://stephenwildish.co.uk/images/00s.jpg
    Wildish, S. 90s Film Alphabet. [Website Image]. Retrieved from http://stephenwildish.co.uk/images/90sfilmalphabet-600.jpg