Import data

# csv file
Mydata <- readr::read_csv("../00_data/Mydata.csv")
Mydata
## # A tibble: 65,706 × 8
##     ...1  year lake  species      grand_total comments region            values
##    <dbl> <dbl> <chr> <chr>              <dbl> <chr>    <chr>              <dbl>
##  1     1  1991 Erie  American Eel           1 <NA>     Michigan (MI)          0
##  2     2  1991 Erie  American Eel           1 <NA>     New York (NY)          0
##  3     3  1991 Erie  American Eel           1 <NA>     Ohio (OH)              0
##  4     4  1991 Erie  American Eel           1 <NA>     Pennsylvania (PA)      0
##  5     5  1991 Erie  American Eel           1 <NA>     U.S. Total             0
##  6     6  1991 Erie  American Eel           1 <NA>     Canada (ONT)           1
##  7     7  1992 Erie  American Eel           0 <NA>     Michigan (MI)          0
##  8     8  1992 Erie  American Eel           0 <NA>     New York (NY)          0
##  9     9  1992 Erie  American Eel           0 <NA>     Ohio (OH)              0
## 10    10  1992 Erie  American Eel           0 <NA>     Pennsylvania (PA)      0
## # ℹ 65,696 more rows

State one question

What regions do each lake cover?

Plot data

ggplot(data = Mydata) + 
  geom_point(mapping = aes(x = lake, y = region))

Interpret

Lake Michigan and Lake Huron cover many regions.