Import the Movie attendance in Iceland dataset
I first imported the ice_movies dataset using read_delim.
I then used the glimpse() command to show the variables in the Ice Movies data set.
## Rows: 3,799
## Columns: 12
## $ blog.date <date> 2021-01-25, 2021-01-25, 2021-01-25, 2021-01-25, 2…
## $ rank.this.week <dbl> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15,…
## $ rank.last.week <dbl> NA, 4, 1, 5, 2, 8, 3, 11, 12, 10, 9, NA, 13, 14, 7…
## $ weeks.in.release <dbl> NA, 16, 3, 3, 6, 8, 3, 2, 17, 8, 4, NA, 13, 2, 2, …
## $ film.title <chr> "Monster Hunter", "Dragon Rider", "Honest Thief", …
## $ distributor.name <chr> "Sena", "Samfilm", "Myndform", "Samfilm", "Samfilm…
## $ gross.box.o.weekend <dbl> 1572126, 1417327, 934945, 688795, 662733, 520025, …
## $ adm.weekend <dbl> 1045, 1204, 650, 594, 467, 320, 266, 274, 220, 197…
## $ weekend.start <date> 2021-01-22, 2021-01-22, 2021-01-22, 2021-01-22, 2…
## $ weekend.end <date> 2021-01-24, 2021-01-24, 2021-01-24, 2021-01-24, 2…
## $ adm.to.date <dbl> 1045, 4744, 3580, 2071, 15197, 3974, 2470, 543, 58…
## $ total.box.o.to.date <dbl> 1572126, 5067923, 4924054, 2293264, 21034734, 5837…
Data Reorganization
I then broke the blog.date variable into month, day and year using the separate() command and calculated the average total.box.o.to.date for each month and year using the group_by() and summarise() commands. I then joined the new dataset with the original dataset using the left_join() command.
Box Office Earnins
I then created a chart of the mean total box office earnings to date by month for the years 2018, 2019 and 2020. (I decided to remove 2017 and 2021 due to missing data which would have created an incomplete chart)
Conclusion
The chart indicates there may be some seasonality in box office earnings, with lower box office earnings in the spring and higher box office earnings in summer months. However, more data from additional years would be necessary to confirm this. Additionally, there was a marked drop in box office earnings beginning in March of 2020, likely due to the COVID-19 pandemic.