Import two related datasets from TidyTuesday Project.
## Rows: 100000 Columns: 22
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (8): loc_id, subnational1_code, entry_technique, sub_id, obs_id, PROJ_P...
## dbl (14): latitude, longitude, Month, Day, Year, how_many, valid, reviewed, ...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## Rows: 254355 Columns: 62
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (2): loc_id, proj_period_id
## dbl (60): yard_type_pavement, yard_type_garden, yard_type_landsca, yard_type...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Describe the two datasets:
The datasets include information regarding bird counting in backyards at
specific times of day. Data 1 is called ‘feedwatch’ and includes
information regarding number of birds, time of spotting and some other
aspects. Data 2 is about the location/habitat the birds are spotted in,
and the setting/populations of people and other animals around that
could be interfering with the bird count.
Data 1: feedwatch
Data 2: site_data
## # A tibble: 10 × 4
## loc_id Year Month how_many
## <chr> <dbl> <dbl> <dbl>
## 1 L12896176 2021 1 2
## 2 L12744873 2020 11 2
## 3 L12726635 2020 12 3
## 4 L8112697 2021 4 2
## 5 L12761982 2021 3 2
## 6 L24457 2021 4 1
## 7 L5187485 2021 2 3
## 8 L8757955 2020 12 3
## 9 L217527 2020 12 1
## 10 L12772384 2021 4 6
## # A tibble: 10 × 3
## loc_id water_srcs_atleast nearby_feeders
## <chr> <dbl> <dbl>
## 1 L37787 0 0
## 2 L1803099 1 1
## 3 L10860534 0 0
## 4 L29442 1 1
## 5 L8125373 0 1
## 6 L80901 0 1
## 7 L77784 0 0
## 8 L37449 NA 0
## 9 L4101017 0 0
## 10 L72372 0 0
Describe the resulting data:
The data only includes the combined columns and the data is not
apearing, I am not sure if this is the expected outcome.
How is it different from the original two
datasets?
This data set includes items from columns in both data sets.
## # A tibble: 0 × 6
## # ℹ 6 variables: loc_id <chr>, water_srcs_atleast <dbl>, nearby_feeders <dbl>,
## # Year <dbl>, Month <dbl>, how_many <dbl>
Describe the resulting data:
How is it different from the original two datasets?
## # A tibble: 10 × 6
## loc_id water_srcs_atleast nearby_feeders Year Month how_many
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 L37787 0 0 NA NA NA
## 2 L1803099 1 1 NA NA NA
## 3 L10860534 0 0 NA NA NA
## 4 L29442 1 1 NA NA NA
## 5 L8125373 0 1 NA NA NA
## 6 L80901 0 1 NA NA NA
## 7 L77784 0 0 NA NA NA
## 8 L37449 NA 0 NA NA NA
## 9 L4101017 0 0 NA NA NA
## 10 L72372 0 0 NA NA NA
Describe the resulting data:
How is it different from the original two datasets?
## # A tibble: 10 × 6
## loc_id water_srcs_atleast nearby_feeders Year Month how_many
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 L12896176 NA NA 2021 1 2
## 2 L12744873 NA NA 2020 11 2
## 3 L12726635 NA NA 2020 12 3
## 4 L8112697 NA NA 2021 4 2
## 5 L12761982 NA NA 2021 3 2
## 6 L24457 NA NA 2021 4 1
## 7 L5187485 NA NA 2021 2 3
## 8 L8757955 NA NA 2020 12 3
## 9 L217527 NA NA 2020 12 1
## 10 L12772384 NA NA 2021 4 6
Describe the resulting data:
How is it different from the original two datasets?
## # A tibble: 20 × 6
## loc_id water_srcs_atleast nearby_feeders Year Month how_many
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 L37787 0 0 NA NA NA
## 2 L1803099 1 1 NA NA NA
## 3 L10860534 0 0 NA NA NA
## 4 L29442 1 1 NA NA NA
## 5 L8125373 0 1 NA NA NA
## 6 L80901 0 1 NA NA NA
## 7 L77784 0 0 NA NA NA
## 8 L37449 NA 0 NA NA NA
## 9 L4101017 0 0 NA NA NA
## 10 L72372 0 0 NA NA NA
## 11 L12896176 NA NA 2021 1 2
## 12 L12744873 NA NA 2020 11 2
## 13 L12726635 NA NA 2020 12 3
## 14 L8112697 NA NA 2021 4 2
## 15 L12761982 NA NA 2021 3 2
## 16 L24457 NA NA 2021 4 1
## 17 L5187485 NA NA 2021 2 3
## 18 L8757955 NA NA 2020 12 3
## 19 L217527 NA NA 2020 12 1
## 20 L12772384 NA NA 2021 4 6
Describe the resulting data:
How is it different from the original two datasets?
## # A tibble: 0 × 3
## # ℹ 3 variables: loc_id <chr>, water_srcs_atleast <dbl>, nearby_feeders <dbl>
Describe the resulting data:
How is it different from the original two datasets?
## # A tibble: 10 × 3
## loc_id water_srcs_atleast nearby_feeders
## <chr> <dbl> <dbl>
## 1 L37787 0 0
## 2 L1803099 1 1
## 3 L10860534 0 0
## 4 L29442 1 1
## 5 L8125373 0 1
## 6 L80901 0 1
## 7 L77784 0 0
## 8 L37449 NA 0
## 9 L4101017 0 0
## 10 L72372 0 0