This analysis uses the movies dataset available in the ggplot2 library. It is a smaller subset of the largest freely available database of international movies published by the Internet Movie Database (IMDB).
The dataset used here captures the following variables from the IMDB database:
## [1] "title" "year" "length" "budget" "rating"
## [6] "votes" "r1" "r2" "r3" "r4"
## [11] "r5" "r6" "r7" "r8" "r9"
## [16] "r10" "mpaa" "Action" "Animation" "Comedy"
## [21] "Drama" "Documentary" "Romance" "Short"
This application provides the following visuals for exploratory analysis of the movies dataset:
Shows movies released by year -- Allows user to select the starting and ending years
Shows a density curve of the viewer ratings -- This page also gives the mean rating for each MPAA category
Plots a histogram of movie length by genre -- Allows the user to select any genre of movies to see the frequency distribution of their lengths
This application was developed using the Shiny platform by RStudio and hosted on the publicly available Shiny.io website at the url https://rwebapps.shinyapps.io/movie-app
References for some of the resources used for building this application: