Create a new script for this class and call it Lecture 4. Download the Desmoines data from the Teams site and save it in the Data folder of this project. Load the Desmoines data, call it dat, and view it
Code
remove(list=ls()) #clear all from memoryrequire("data.table")require("ggplot2")require("lubridate")require("tidyverse")dat <-fread("Data/Weather_DesMoines.csv")str(dat)
Ok, so there is a lot of variables. What are we working with?
Code
summary(dat)
WBANNO LST_DATE CRX_VN LONGITUDE
Min. :54902 Min. :20200101 Min. :2.622 Min. :-93.29
1st Qu.:54902 1st Qu.:20200401 1st Qu.:2.622 1st Qu.:-93.29
Median :54902 Median :20200702 Median :2.622 Median :-93.29
Mean :54902 Mean :20200667 Mean :2.622 Mean :-93.29
3rd Qu.:54902 3rd Qu.:20200983 3rd Qu.:2.622 3rd Qu.:-93.29
Max. :54902 Max. :20201231 Max. :2.622 Max. :-93.29
LATITUDE T_DAILY_MAX T_DAILY_MIN T_DAILY_MEAN
Min. :41.56 Min. :-9999.000 Min. :-9999.00 Min. :-18.80
1st Qu.:41.56 1st Qu.: 6.025 1st Qu.: -3.75 1st Qu.: 1.60
Median :41.56 Median : 17.050 Median : 3.45 Median : 10.10
Mean :41.56 Mean : -11.286 Mean : -214.24 Mean : 10.42
3rd Qu.:41.56 3rd Qu.: 26.700 3rd Qu.: 12.97 3rd Qu.: 20.07
Max. :41.56 Max. : 35.100 Max. : 23.80 Max. : 28.20
T_DAILY_AVG P_DAILY_CALC SOLARAD_DAILY SUR_TEMP_DAILY_TYPE
Min. :-19.000 Min. : 0.000 Min. : 0.140 Length:366
1st Qu.: 1.125 1st Qu.: 0.000 1st Qu.: 7.173 Class :character
Median : 10.200 Median : 0.000 Median :13.260 Mode :character
Mean : 10.161 Mean : 2.129 Mean :13.735
3rd Qu.: 20.000 3rd Qu.: 0.400 3rd Qu.:20.058
Max. : 27.900 Max. :75.300 Max. :30.050
SUR_TEMP_DAILY_MAX SUR_TEMP_DAILY_MIN SUR_TEMP_DAILY_AVG RH_DAILY_MAX
Min. :-12.70 Min. :-25.900 Min. :-19.000 Min. : 55.40
1st Qu.: 8.85 1st Qu.: -5.175 1st Qu.: 0.825 1st Qu.: 88.40
Median : 20.65 Median : 2.700 Median : 9.750 Median : 94.40
Mean : 18.26 Mean : 2.886 Mean : 9.722 Mean : 91.63
3rd Qu.: 28.68 3rd Qu.: 11.775 3rd Qu.: 19.475 3rd Qu.: 97.40
Max. : 37.70 Max. : 23.100 Max. : 27.200 Max. :100.00
RH_DAILY_MIN RH_DAILY_AVG SOIL_MOISTURE_5_DAILY SOIL_MOISTURE_10_DAILY
Min. :17.20 Min. :35.30 Min. :-99.0000 Min. :-99.0000
1st Qu.:39.65 1st Qu.:65.55 1st Qu.: 0.2102 1st Qu.: 0.2422
Median :49.85 Median :74.00 Median : 0.3060 Median : 0.3345
Mean :52.00 Mean :73.67 Mean : -8.6550 Mean : -8.6252
3rd Qu.:63.35 3rd Qu.:82.62 3rd Qu.: 0.3680 3rd Qu.: 0.3970
Max. :97.90 Max. :99.70 Max. : 0.4280 Max. : 0.4410
SOIL_MOISTURE_20_DAILY SOIL_MOISTURE_50_DAILY SOIL_MOISTURE_100_DAILY
Min. :-99.0000 Min. :-99.0000 Min. :-99.0000
1st Qu.: 0.3320 1st Qu.: 0.3892 1st Qu.: 0.4537
Median : 0.3835 Median : 0.4345 Median : 0.4900
Mean : -3.9722 Mean : -4.4716 Mean : -3.3280
3rd Qu.: 0.4170 3rd Qu.: 0.4490 3rd Qu.: 0.5000
Max. : 0.4710 Max. : 0.4870 Max. : 0.5260
SOIL_TEMP_5_DAILY SOIL_TEMP_10_DAILY SOIL_TEMP_20_DAILY SOIL_TEMP_50_DAILY
Min. :-9999.00 Min. :-9999.00 Min. :-9999.00 Min. :-9999.00
1st Qu.: 1.00 1st Qu.: 0.90 1st Qu.: 2.10 1st Qu.: 3.00
Median : 6.95 Median : 6.75 Median : 7.45 Median : 7.25
Mean : -837.34 Mean :-1056.32 Mean : -892.04 Mean :-1165.39
3rd Qu.: 17.90 3rd Qu.: 17.30 3rd Qu.: 17.43 3rd Qu.: 17.00
Max. : 24.00 Max. : 23.30 Max. : 22.60 Max. : 21.10
SOIL_TEMP_100_DAILY
Min. :-9999.00
1st Qu.: 4.20
Median : 8.05
Mean : -974.07
3rd Qu.: 15.88
Max. : 20.00
WBANNO is the location code of the measurements.
LST_DATE is the measurement data ranging from 2020/01/01 to 2020/12/31.
LONGITUDE and LATITUDE shows the location of the measurements, all are at the same location.
T_DAILY_MAX,T_DAILY_MIN,T_DAILY_MEAN,T_DAILY_AVG,P_DAILY_CALC are all temperature variables
SOLARAD_DAILY represents daily solar radiation
RH_ is relative humidity
SOIL_ shows soil variables
Subsetting
We are not interested in all of these variables, lets make a copy of the data and call it d.OR and create a new version of dat wherein we keep the variables of interest as T_DAILY_MEAN and SOLARAD_DAILY.