Introduction

In this report, I will seek to clarify and analyze this data by looking at several key relationships. This report will analyze the distribution of the continuous variables, it will look at the relationships between many of the key variables, it will isolate exceptions to generalizations made about the relationships, and it will analyze trends in individual seasons.

This data is obtained from the office_ratings data set. It describes 6 variables for each episode. The ones that will be important for this report are viewers, the number of viewers in millions on the original air date, imdb_rating, the average fan rating on IMDb.com from 1-10, and total_votes, the number of ratings left by viewers on IMDb.com

In this report I will be making use of tidyverse to create visualizations

library(tidyverse)

Conclusion

To summaraize, there are several general conclusions that can be drawn about this data. Firstly, the most obvious conclusion is that the last two season of The Office were incredibly poorly received compared to the others. Additionally, we can conclude that with a few exceptions, the more an episode is viewed, the higher it is rated, and the more people leave a rating. Furthermore, we can conclude that as time went on, popularity and appeal stayed relatively the same after season 1 and before season 8. There was a very slight downward trend but it wasn’t very large. Finally, each season had its ups and downs but towards the end, more people seemed to stop watching.