Movies Dataset Analysis

A Visual Exploration

Overview

This analysis uses the movies dataset available in the ggplot2 library. It is a smaller subset of the largest freely available database of international movies published by the Internet Movie Database (IMDB).

The dataset used here captures the following variables from the IMDB database:

##  [1] "title"       "year"        "length"      "budget"      "rating"     
##  [6] "votes"       "r1"          "r2"          "r3"          "r4"         
## [11] "r5"          "r6"          "r7"          "r8"          "r9"         
## [16] "r10"         "mpaa"        "Action"      "Animation"   "Comedy"     
## [21] "Drama"       "Documentary" "Romance"     "Short"

Functional Features

This application provides the following visuals for exploratory analysis of the movies dataset:

  • Shows movies released by year -- Allows user to select the starting and ending years

  • Shows a density curve of the viewer ratings -- This page also gives the mean rating for each MPAA category

  • Plots a histogram of movie length by genre -- Allows the user to select any genre of movies to see the frequency distribution of their lengths

Movie Analysis Application

This application was developed using the Shiny platform by RStudio and hosted on the publicly available Shiny.io website at the url https://rwebapps.shinyapps.io/movie-app

References

References for some of the resources used for building this application: