In this chunk, I’m going the load the dataset that I picked, “greenhouse_gases”, then I’m going to do some wrangling to check if the Data is clean.
Visualisation
In this code, I filtered the dataset to include only the years 20, 40, 60, 80, and 100. Then, I created a scatterplot to show how the concentration of different gases changed during those selected years. Each point represents a gas, with different colors showing different gas types. I also added a smooth trend line to show the overall pattern, and customized the plot with a title, axis labels, and a clean black-and-white theme.
library(tidyverse)
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr 1.1.4 ✔ readr 2.1.5
✔ forcats 1.0.0 ✔ stringr 1.5.1
✔ ggplot2 3.5.1 ✔ tibble 3.2.1
✔ lubridate 1.9.4 ✔ tidyr 1.3.1
✔ purrr 1.0.4
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag() masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
greenhouse_gases |>filter(year %in%c(20, 40, 60, 80, 100)) |>ggplot(aes(x = year, y = concentration)) +geom_point(aes(color = gas), alpha =0.5) +geom_smooth() +labs(x ="Year",y ="Concentration",color ="Gas",title ="Concentration of Gas at Selected Years" ) +theme(legend.position =c(0.11, 0.78)) +theme_bw()
Warning: A numeric `legend.position` argument in `theme()` was deprecated in ggplot2
3.5.0.
ℹ Please use the `legend.position.inside` argument of `theme()` instead.
`geom_smooth()` using method = 'loess' and formula = 'y ~ x'