IMDB is the “Internet Movie Database” that is a great resource for identifying the details of movies release. This ranking is based upon movies released in the year 2020 and are a type of “feature”. The world of movies is continually changing, especially in 2020 due to COVID where many movies were released directly to streaming versus the typical box office releases that would occur in a normal year.
I would like to better understand the impact of genre and runtime on a movings overall rating and if certain genres have longer films on average. Gross Revenue is an interesting factor but with the limited number of releases the gross revenue at the box office for direct to streaming features is a unique aspect that is likely less relevant today as it has been in the past.
The data collected from IMDB is the top 100 rankings and the data gathered includes the Title, Genre (using first as primary), run time (minutes), gross box office revenue, overall rating, and the title of the film.
Looking at the movie run times in minutes by genre type it may come as no surprise that biographies have the longest run time on average by genre. May be however it is a mystery that a mystery genre time has the shortest run time.
Which movies have a higer rating? Animation is the clear favorite when it comes to the genre ranking the highest on the top 100 for 2020. May be this is impacted by people needing an escpe from reality in the time of COVID?
As mentione dbefore the impact of streaming makes the analysis a bit more of a challenge to understand the overall revenue impact of a film. It is clear based upon those with reported revenue that action films are a clear winner when it comes to the box office.
Reviewing the impact of rating on the gross box office revenue does not appear to show a strong relationship between rating and revnue. There also appears to be distinct levels of revenue in clusters.
It is interesting to review how movies generate revenue and you would expect the higher rated films to produce the highest results in terms of gross revenue. My next steps would be to perform a regression to better understand the correlation based upon the results above to understand the influence on rating to gross revenue or to identify if other factors sucha as genre or runtime may have a greater impact.