# excel file
monster_comparison <- read_excel("../00_data/MyData.xlsx")
monster_comparison
## # A tibble: 603 × 75
## index series_name network season title imdb engagement date_aired
## <dbl> <chr> <chr> <chr> <chr> <dbl> <dbl> <dttm>
## 1 1 Scooby Doo, … CBS 1 What… 8.1 556 1969-09-13 00:00:00
## 2 2 Scooby Doo, … CBS 1 A Cl… 8.1 479 1969-09-20 00:00:00
## 3 3 Scooby Doo, … CBS 1 Hass… 8 455 1969-09-27 00:00:00
## 4 4 Scooby Doo, … CBS 1 Mine… 7.8 426 1969-10-04 00:00:00
## 5 5 Scooby Doo, … CBS 1 Deco… 7.5 391 1969-10-11 00:00:00
## 6 6 Scooby Doo, … CBS 1 What… 8.4 384 1969-10-18 00:00:00
## 7 7 Scooby Doo, … CBS 1 Neve… 7.6 358 1969-10-25 00:00:00
## 8 8 Scooby Doo, … CBS 1 Foul… 8.2 358 1969-11-01 00:00:00
## 9 9 Scooby Doo, … CBS 1 The … 8.1 371 1969-11-08 00:00:00
## 10 10 Scooby Doo, … CBS 1 Bedl… 8 346 1969-11-15 00:00:00
## # ℹ 593 more rows
## # ℹ 67 more variables: run_time <dbl>, format <chr>, monster_name <chr>,
## # monster_gender <chr>, monster_type <chr>, monster_subtype <chr>,
## # monster_species <chr>, monster_real <chr>, monster_amount <dbl>,
## # caught_fred <chr>, caught_daphnie <chr>, caught_velma <chr>,
## # caught_shaggy <chr>, caught_scooby <chr>, captured_fred <chr>,
## # captured_daphnie <chr>, captured_velma <chr>, captured_shaggy <chr>, …
Do the newer series have more monsters appear in their episodes than the earlier ones?
monster_comparison %>% ggplot(aes(monster_amount, series_name %>% as_factor())) +
geom_point(mapping = aes(x = monster_amount, y = series_name))
According to the data, the shows that are ion the older side to in the middle, seem to be the ones with a higher monster count consistently, while the newer ones tend to stay under five monsters in an episode. The older series is at the bottom with the newer at the top.
How do I prevent my data from being alphabetized in the scatter plot? I want it sorted in the order it was in on my spreadsheet.