ggplot(olive, aes(x = oleic, y = linoleic, color = region, size = palmitic)) +geom_point(alpha =0.7) +facet_wrap(~ region) +labs(title ="Relationship Between Oleic and Linoleic Acid by Region",x ="Oleic Acid (%)",y ="Linoleic Acid (%)",color ="Region",size ="Palmitic Acid (%)" ) +theme_minimal(base_size =12)
ggplot(olive, aes(x = oleic, y = linoleic, color = region, size = palmitic)) +geom_point(alpha =0.7) +labs(title ="Relationship Between Oleic and Linoleic Acid by Region",x ="Oleic Acid (%)",y ="Linoleic Acid (%)",color ="Region",size ="Palmitic Acid (%)" ) +theme_minimal(base_size =12)
I chose the olive dataset because my country is known for having good quality olive oil, so I found it interesting to work with. This dataset includes information about the fatty acid composition of olive oil samples from different regions in Italy. Before making the graph, I checked the variables I used (oleic, linoleic, palmitic, and region) and there were no missing values, so I didn’t need to do any data cleaning. I created a scatterplot to look at the relationship between oleic acid and linoleic acid. I also used color to show the different regions and size to represent the palmitic acid levels. From the graph, I noticed that as oleic acid increases, linoleic acid decreases. I also saw that the points form clusters by region, which suggests that olive oil composition changes depending on the region.