Import data

# csv file

# excel file
data <- read_excel("../00_data/myData.xlsx")
data
## # A tibble: 1,187 × 19
##    cheese    url   milk  country region family type  fat_content calcium_content
##    <chr>     <chr> <chr> <chr>   <chr>  <chr>  <chr> <chr>       <chr>          
##  1 Aarewass… http… cow   Switze… NA     NA     semi… NA          NA             
##  2 Abbaye d… http… sheep France  Pays … NA     semi… NA          NA             
##  3 Abbaye d… http… cow   France  NA     NA     semi… 40-46%      NA             
##  4 Abbaye d… http… cow   France  Burgu… NA     semi… NA          NA             
##  5 Abbaye d… http… cow   France  Savoie NA     soft… NA          NA             
##  6 Abbaye d… http… cow   France  provi… NA     semi… NA          NA             
##  7 Abbaye d… http… cow   France  Nord-… NA     semi… 50%         NA             
##  8 Abbot’s … http… cow   Englan… North… Chedd… semi… NA          NA             
##  9 Abertam   http… sheep Czech … Karlo… NA     hard… 45%         NA             
## 10 Abondance http… cow   France  NA     NA     semi… NA          NA             
## # ℹ 1,177 more rows
## # ℹ 10 more variables: texture <chr>, rind <chr>, color <chr>, flavor <chr>,
## #   aroma <chr>, vegetarian <chr>, vegan <chr>, synonyms <chr>,
## #   alt_spellings <chr>, producers <chr>

State one question

What is the relationship between vegan and milk used?

Plot data

ggplot(data) +
  geom_point(mapping = aes(x = vegan, y = milk)) 

Interpret

All of the cheeses made using animal milks are vegan, while all of the plant based cheeses are not vegan.