Creation date: 2018 February 27
Updated: 2018 March 22
This document provides basic QA/QC assessments for the willow height dataset updated with 2016 and 2017 data entered from field sheets (version 2018-03-21). Analyses aim to characterize basic data structure and identify potential issues such as:
Questions/Issues:
1. “NA”, “na” and “missing” inconsistently used for ‘plantht’ 2. Unsure on the interpretation of the “2” for the dam field. Three unique values: 0,1,2. Expected only 0 and 1. Interpreted as “obs” 3. Any special meaning to “ID” field or is this a file-specific index? 4. In previous QA/QC checks, it was noted that willid is not unique across all sites. Temporary fix combining “willid” and “siteid2”
Issue:: As noted above, “NA”, “na” and “missing” inconsistently used for ‘plantht’
Solution:: Values for plantht equal to “missing” or “na” were changed to NA
From the above analyses, it’s clear that not all of the historical data are present. Specifically missing are lots of Spring height measurements.
## Parsed with column specification:
## cols(
## ID1 = col_integer(),
## site = col_character(),
## year = col_integer(),
## season = col_character(),
## willid = col_integer(),
## exp = col_integer(),
## dam = col_integer(),
## browse = col_integer(),
## plantht = col_integer(),
## browseintensity = col_double(),
## production = col_double()
## )
## Warning in rbind(names(probs), probs_f): number of columns of result is not
## a multiple of vector length (arg 1)
## Warning: 5 parsing failures.
## row # A tibble: 5 x 5 col row col expected actual file expected <int> <chr> <chr> <chr> <chr> actual 1 2827 plantht no trailing characters .5 'D:/Dropbox/PROJECTS/Yell_N~ file 2 2832 plantht no trailing characters .5 'D:/Dropbox/PROJECTS/Yell_N~ row 3 2836 plantht no trailing characters .5 'D:/Dropbox/PROJECTS/Yell_N~ col 4 3564 plantht no trailing characters .158 'D:/Dropbox/PROJECTS/Yell_N~ expected 5 6528 plantht no trailing characters .4 'D:/Dropbox/PROJECTS/Yell_N~
## [1] "ID1" "site" "year"
## [4] "season" "willid" "exp"
## [7] "dam" "browse" "plantht"
## [10] "browseintensity" "production"
## [1] "ID" "site" "year" "season"
## [5] "willid" "exp" "dam" "browse"
## [9] "plantht" "treat" "site2" "willid.site2"
Inlcludes diplicate willowid across sites, seasons, and years…
Only willid duplicated within a site, season, and year These may represent moved tags.
## Warning: Removed 285 rows containing non-finite values (stat_boxplot).
## Warning: Removed 283 rows containing non-finite values (stat_boxplot).
## Warning: Removed 2 rows containing non-finite values (stat_boxplot).