Summary

Data from USA extreme weather events recorded between 1950 and 2011 is analysed in terms of the effect on the population health and the USA economy. The data is processed and cleaned according to the information provided by the U.S. National Oceanic and Atmospheric Administration’s (NOAA) website documentation. The results are summarised and tabulated. The results show that Tornados cause the most harm to the population and floods are the most costly to the economy.

Introduction

Storms and other severe weather events can cause both public health and economic problems for communities and municipalities. Many severe events can result in fatalities, injuries, and property damage, and preventing such outcomes to the extent possible is a key concern.

This project involves exploring the U.S. National Oceanic and Atmospheric Administration’s (NOAA) storm database. This database tracks characteristics of major storms and weather events in the United States, including when and where they occur, as well as estimates of any fatalities, injuries, and property damage.

.

Data Processing

Data

The data for this assignment come in the form of a comma-separated-value file compressed via the bzip2 algorithm to reduce its size. The file can be downloaded from the National Climatic Data Center’s website. Storm Data Documentation for the data is also available on their website, or can be accessed directly from the link provided.

The data is downloaded from the website and the zipped file unzipped to the working directory in which the analysis is to take place.

Data Details

The events in the database start in the year 1950 and end in November 2011. In the earlier years of the database there are generally fewer events recorded, most likely due to a lack of good records. More recent years are considered more complete, although for the analysis provided, all the data is used An extensive amount of data processing is required in order to determine the effect of the storm weather events on the population and economy.

Data processing Details

The working directory is set to the same directory and the packages required for analysis loaded.

setwd("/Users/Tina/Desktop/ReproducibleResearch/RRStorm")
install.packages("dplyr", repos = "http://cran.us.r-project.org")
## 
## The downloaded binary packages are in
##  /var/folders/s1/g0plq2957pg4py2k629m0fz00000gn/T//Rtmpf2MZEN/downloaded_packages
library("dplyr")
## 
## Attaching package: 'dplyr'
## 
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## 
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union

A few lines are read in to see the data structure

StormDataShort <- read.csv("/Users/Tina/Desktop/ReproducibleResearch/repdata-data-StormData.csv", header = TRUE, sep =",", nrows = 10)
str(StormDataShort)
## 'data.frame':    10 obs. of  37 variables:
##  $ STATE__   : num  1 1 1 1 1 1 1 1 1 1
##  $ BGN_DATE  : Factor w/ 7 levels "1/22/1952 0:00:00",..: 6 6 5 7 2 2 3 1 4 4
##  $ BGN_TIME  : int  130 145 1600 900 1500 2000 100 900 2000 2000
##  $ TIME_ZONE : Factor w/ 1 level "CST": 1 1 1 1 1 1 1 1 1 1
##  $ COUNTY    : num  97 3 57 89 43 77 9 123 125 57
##  $ COUNTYNAME: Factor w/ 9 levels "BALDWIN","BLOUNT",..: 7 1 4 6 3 5 2 8 9 4
##  $ STATE     : Factor w/ 1 level "AL": 1 1 1 1 1 1 1 1 1 1
##  $ EVTYPE    : Factor w/ 1 level "TORNADO": 1 1 1 1 1 1 1 1 1 1
##  $ BGN_RANGE : num  0 0 0 0 0 0 0 0 0 0
##  $ BGN_AZI   : logi  NA NA NA NA NA NA ...
##  $ BGN_LOCATI: logi  NA NA NA NA NA NA ...
##  $ END_DATE  : logi  NA NA NA NA NA NA ...
##  $ END_TIME  : logi  NA NA NA NA NA NA ...
##  $ COUNTY_END: num  0 0 0 0 0 0 0 0 0 0
##  $ COUNTYENDN: logi  NA NA NA NA NA NA ...
##  $ END_RANGE : num  0 0 0 0 0 0 0 0 0 0
##  $ END_AZI   : logi  NA NA NA NA NA NA ...
##  $ END_LOCATI: logi  NA NA NA NA NA NA ...
##  $ LENGTH    : num  14 2 0.1 0 0 1.5 1.5 0 3.3 2.3
##  $ WIDTH     : num  100 150 123 100 150 177 33 33 100 100
##  $ F         : int  3 2 2 2 2 2 2 1 3 3
##  $ MAG       : num  0 0 0 0 0 0 0 0 0 0
##  $ FATALITIES: num  0 0 0 0 0 0 0 0 1 0
##  $ INJURIES  : num  15 0 2 2 2 6 1 0 14 0
##  $ PROPDMG   : num  25 2.5 25 2.5 2.5 2.5 2.5 2.5 25 25
##  $ PROPDMGEXP: Factor w/ 1 level "K": 1 1 1 1 1 1 1 1 1 1
##  $ CROPDMG   : num  0 0 0 0 0 0 0 0 0 0
##  $ CROPDMGEXP: logi  NA NA NA NA NA NA ...
##  $ WFO       : logi  NA NA NA NA NA NA ...
##  $ STATEOFFIC: logi  NA NA NA NA NA NA ...
##  $ ZONENAMES : logi  NA NA NA NA NA NA ...
##  $ LATITUDE  : num  3040 3042 3340 3458 3412 ...
##  $ LONGITUDE : num  8812 8755 8742 8626 8642 ...
##  $ LATITUDE_E: num  3051 0 0 0 0 ...
##  $ LONGITUDE_: num  8806 0 0 0 0 ...
##  $ REMARKS   : logi  NA NA NA NA NA NA ...
##  $ REFNUM    : num  1 2 3 4 5 6 7 8 9 10

Get a short summary to see which columns will most likely have missing values

summary(StormDataShort)
##     STATE__                BGN_DATE    BGN_TIME      TIME_ZONE
##  Min.   :1   1/22/1952 0:00:00 :1   Min.   : 100.0   CST:10   
##  1st Qu.:1   11/15/1951 0:00:00:2   1st Qu.: 333.8            
##  Median :1   11/16/1951 0:00:00:1   Median :1200.0            
##  Mean   :1   2/13/1952 0:00:00 :2   Mean   :1127.5            
##  3rd Qu.:1   2/20/1951 0:00:00 :1   3rd Qu.:1900.0            
##  Max.   :1   4/18/1950 0:00:00 :2   Max.   :2000.0            
##              6/8/1951 0:00:00  :1                             
##      COUNTY           COUNTYNAME STATE       EVTYPE     BGN_RANGE
##  Min.   :  3.0   FAYETTE   :2    AL:10   TORNADO:10   Min.   :0  
##  1st Qu.: 46.5   BALDWIN   :1                         1st Qu.:0  
##  Median : 67.0   BLOUNT    :1                         Median :0  
##  Mean   : 68.0   CULLMAN   :1                         Mean   :0  
##  3rd Qu.: 95.0   LAUDERDALE:1                         3rd Qu.:0  
##  Max.   :125.0   MADISON   :1                         Max.   :0  
##                  (Other)   :3                                    
##  BGN_AZI        BGN_LOCATI     END_DATE       END_TIME         COUNTY_END
##  Mode:logical   Mode:logical   Mode:logical   Mode:logical   Min.   :0   
##  NA's:10        NA's:10        NA's:10        NA's:10        1st Qu.:0   
##                                                              Median :0   
##                                                              Mean   :0   
##                                                              3rd Qu.:0   
##                                                              Max.   :0   
##                                                                          
##  COUNTYENDN       END_RANGE END_AZI        END_LOCATI         LENGTH      
##  Mode:logical   Min.   :0   Mode:logical   Mode:logical   Min.   : 0.000  
##  NA's:10        1st Qu.:0   NA's:10        NA's:10        1st Qu.: 0.025  
##                 Median :0                                 Median : 1.500  
##                 Mean   :0                                 Mean   : 2.470  
##                 3rd Qu.:0                                 3rd Qu.: 2.225  
##                 Max.   :0                                 Max.   :14.000  
##                                                                           
##      WIDTH             F             MAG      FATALITIES     INJURIES    
##  Min.   : 33.0   Min.   :1.00   Min.   :0   Min.   :0.0   Min.   : 0.00  
##  1st Qu.:100.0   1st Qu.:2.00   1st Qu.:0   1st Qu.:0.0   1st Qu.: 0.25  
##  Median :100.0   Median :2.00   Median :0   Median :0.0   Median : 2.00  
##  Mean   :106.6   Mean   :2.20   Mean   :0   Mean   :0.1   Mean   : 4.20  
##  3rd Qu.:143.2   3rd Qu.:2.75   3rd Qu.:0   3rd Qu.:0.0   3rd Qu.: 5.00  
##  Max.   :177.0   Max.   :3.00   Max.   :0   Max.   :1.0   Max.   :15.00  
##                                                                          
##     PROPDMG     PROPDMGEXP    CROPDMG  CROPDMGEXP       WFO         
##  Min.   : 2.5   K:10       Min.   :0   Mode:logical   Mode:logical  
##  1st Qu.: 2.5              1st Qu.:0   NA's:10        NA's:10       
##  Median : 2.5              Median :0                                
##  Mean   :11.5              Mean   :0                                
##  3rd Qu.:25.0              3rd Qu.:0                                
##  Max.   :25.0              Max.   :0                                
##                                                                     
##  STATEOFFIC     ZONENAMES         LATITUDE      LONGITUDE   
##  Mode:logical   Mode:logical   Min.   :3040   Min.   :8558  
##  NA's:10        NA's:10        1st Qu.:3275   1st Qu.:8634  
##                                Median :3338   Median :8739  
##                                Mean   :3307   Mean   :8699  
##                                3rd Qu.:3410   3rd Qu.:8746  
##                                Max.   :3458   Max.   :8812  
##                                                             
##    LATITUDE_E       LONGITUDE_   REMARKS            REFNUM     
##  Min.   :   0.0   Min.   :   0   Mode:logical   Min.   : 1.00  
##  1st Qu.:   0.0   1st Qu.:   0   NA's:10        1st Qu.: 3.25  
##  Median :   0.0   Median :   0                  Median : 5.50  
##  Mean   : 972.4   Mean   :2628                  Mean   : 5.50  
##  3rd Qu.:2288.2   3rd Qu.:6553                  3rd Qu.: 7.75  
##  Max.   :3337.0   Max.   :8806                  Max.   :10.00  
## 

NA’s observed in columns c(“BGN_AZI”, “BGN_LOCATI”, “END_DATE”, “END_TIME”, “COUNTYENDN”, “END_AZI”, “END_LOCATI”, “CROPDMGEXP”, “WFO”, “STATEOFFIC”, “ZONENAMES”, “REMARKS”, “REFNUM”) Note that most of these columns are not going to be needed

Columns needed are: “BGN_DATE”, “EVTYPE”, “FATALITIES”, “INJURIES”, “PROPDMG”, “PROPDMGEXP”, “CROPDMG”, “CROPDMGEXP”

Get header names

StormDataColNames <- make.names(names(StormDataShort))
StormDataColNames
##  [1] "STATE__"    "BGN_DATE"   "BGN_TIME"   "TIME_ZONE"  "COUNTY"    
##  [6] "COUNTYNAME" "STATE"      "EVTYPE"     "BGN_RANGE"  "BGN_AZI"   
## [11] "BGN_LOCATI" "END_DATE"   "END_TIME"   "COUNTY_END" "COUNTYENDN"
## [16] "END_RANGE"  "END_AZI"    "END_LOCATI" "LENGTH"     "WIDTH"     
## [21] "F"          "MAG"        "FATALITIES" "INJURIES"   "PROPDMG"   
## [26] "PROPDMGEXP" "CROPDMG"    "CROPDMGEXP" "WFO"        "STATEOFFIC"
## [31] "ZONENAMES"  "LATITUDE"   "LONGITUDE"  "LATITUDE_E" "LONGITUDE_"
## [36] "REMARKS"    "REFNUM"

Reading in all the data, changing format of factors to characters, this takes a while!

StormDataRaw <- read.table("/Users/Tina/Desktop/ReproducibleResearch/repdata-data-StormData.csv", header = TRUE, sep =",", stringsAsFactors = FALSE)

Extracting the columns needed for analysis from rows that only have non-zero injuries/fatalities and economic effects

ColsNeeded <- c("BGN_DATE", "EVTYPE", "FATALITIES", "INJURIES", "PROPDMG", "PROPDMGEXP", "CROPDMG", "CROPDMGEXP")

Logical expressions needed for subsetting, considering data that only has non-zero entries for fatalities, injuries and economic effect of the events:

Note that we need the columns ending with EXP to see the scale of the numbers

SubsetRequirement <- c(StormDataRaw$INJURIES > 0 | StormDataRaw$FATALITIES > 0 | 
                               StormDataRaw$PROPDMG > 0 | StormDataRaw$CROPDMG > 0)
StormDataSubset <- subset(x = StormDataRaw, subset = SubsetRequirement, select = ColsNeeded)

Changing the date column to date format:

StormDataSubset$Start_Date <- as.Date(StormDataSubset$BGN_DATE, format = "%m/%d/%Y %H:%M:%S")
str(StormDataSubset)
## 'data.frame':    254633 obs. of  9 variables:
##  $ BGN_DATE  : chr  "4/18/1950 0:00:00" "4/18/1950 0:00:00" "2/20/1951 0:00:00" "6/8/1951 0:00:00" ...
##  $ EVTYPE    : chr  "TORNADO" "TORNADO" "TORNADO" "TORNADO" ...
##  $ FATALITIES: num  0 0 0 0 0 0 0 0 1 0 ...
##  $ INJURIES  : num  15 0 2 2 2 6 1 0 14 0 ...
##  $ PROPDMG   : num  25 2.5 25 2.5 2.5 2.5 2.5 2.5 25 25 ...
##  $ PROPDMGEXP: chr  "K" "K" "K" "K" ...
##  $ CROPDMG   : num  0 0 0 0 0 0 0 0 0 0 ...
##  $ CROPDMGEXP: chr  "" "" "" "" ...
##  $ Start_Date: Date, format: "1950-04-18" "1950-04-18" ...

We now only have 254633 observations to deal with in 9 variables

Find unique event types Extract the event types as a single variable for ease of “looking at”

EventType <- StormDataSubset$EVTYPE
str(EventType)
##  chr [1:254633] "TORNADO" "TORNADO" "TORNADO" "TORNADO" ...
unique(EventType) 
##   [1] "TORNADO"                        "TSTM WIND"                     
##   [3] "HAIL"                           "ICE STORM/FLASH FLOOD"         
##   [5] "WINTER STORM"                   "HURRICANE OPAL/HIGH WINDS"     
##   [7] "THUNDERSTORM WINDS"             "HURRICANE ERIN"                
##   [9] "HURRICANE OPAL"                 "HEAVY RAIN"                    
##  [11] "LIGHTNING"                      "THUNDERSTORM WIND"             
##  [13] "DENSE FOG"                      "RIP CURRENT"                   
##  [15] "THUNDERSTORM WINS"              "FLASH FLOODING"                
##  [17] "FLASH FLOOD"                    "TORNADO F0"                    
##  [19] "THUNDERSTORM WINDS LIGHTNING"   "THUNDERSTORM WINDS/HAIL"       
##  [21] "HEAT"                           "HIGH WINDS"                    
##  [23] "WIND"                           "HEAVY RAINS"                   
##  [25] "LIGHTNING AND HEAVY RAIN"       "THUNDERSTORM WINDS HAIL"       
##  [27] "COLD"                           "HEAVY RAIN/LIGHTNING"          
##  [29] "FLASH FLOODING/THUNDERSTORM WI" "FLOODING"                      
##  [31] "WATERSPOUT"                     "EXTREME COLD"                  
##  [33] "LIGHTNING/HEAVY RAIN"           "BREAKUP FLOODING"              
##  [35] "HIGH WIND"                      "FREEZE"                        
##  [37] "RIVER FLOOD"                    "HIGH WINDS HEAVY RAINS"        
##  [39] "AVALANCHE"                      "MARINE MISHAP"                 
##  [41] "HIGH TIDES"                     "HIGH WIND/SEAS"                
##  [43] "HIGH WINDS/HEAVY RAIN"          "HIGH SEAS"                     
##  [45] "COASTAL FLOOD"                  "SEVERE TURBULENCE"             
##  [47] "RECORD RAINFALL"                "HEAVY SNOW"                    
##  [49] "HEAVY SNOW/WIND"                "DUST STORM"                    
##  [51] "FLOOD"                          "APACHE COUNTY"                 
##  [53] "SLEET"                          "DUST DEVIL"                    
##  [55] "ICE STORM"                      "EXCESSIVE HEAT"                
##  [57] "THUNDERSTORM WINDS/FUNNEL CLOU" "GUSTY WINDS"                   
##  [59] "FLOODING/HEAVY RAIN"            "HEAVY SURF COASTAL FLOODING"   
##  [61] "HIGH SURF"                      "WILD FIRES"                    
##  [63] "HIGH"                           "WINTER STORM HIGH WINDS"       
##  [65] "WINTER STORMS"                  "MUDSLIDES"                     
##  [67] "RAINSTORM"                      "SEVERE THUNDERSTORM"           
##  [69] "SEVERE THUNDERSTORMS"           "SEVERE THUNDERSTORM WINDS"     
##  [71] "THUNDERSTORMS WINDS"            "FLOOD/FLASH FLOOD"             
##  [73] "FLOOD/RAIN/WINDS"               "THUNDERSTORMS"                 
##  [75] "FLASH FLOOD WINDS"              "WINDS"                         
##  [77] "FUNNEL CLOUD"                   "HIGH WIND DAMAGE"              
##  [79] "STRONG WIND"                    "HEAVY SNOWPACK"                
##  [81] "FLASH FLOOD/"                   "HEAVY SURF"                    
##  [83] "DRY MIRCOBURST WINDS"           "DRY MICROBURST"                
##  [85] "URBAN FLOOD"                    "THUNDERSTORM WINDSS"           
##  [87] "MICROBURST WINDS"               "HEAT WAVE"                     
##  [89] "UNSEASONABLY WARM"              "COASTAL FLOODING"              
##  [91] "STRONG WINDS"                   "BLIZZARD"                      
##  [93] "WATERSPOUT/TORNADO"             "WATERSPOUT TORNADO"            
##  [95] "STORM SURGE"                    "URBAN/SMALL STREAM FLOOD"      
##  [97] "WATERSPOUT-"                    "TORNADOES, TSTM WIND, HAIL"    
##  [99] "TROPICAL STORM ALBERTO"         "TROPICAL STORM"                
## [101] "TROPICAL STORM GORDON"          "TROPICAL STORM JERRY"          
## [103] "LIGHTNING THUNDERSTORM WINDS"   "URBAN FLOODING"                
## [105] "MINOR FLOODING"                 "WATERSPOUT-TORNADO"            
## [107] "LIGHTNING INJURY"               "LIGHTNING AND THUNDERSTORM WIN"
## [109] "FLASH FLOODS"                   "THUNDERSTORM WINDS53"          
## [111] "WILDFIRE"                       "DAMAGING FREEZE"               
## [113] "THUNDERSTORM WINDS 13"          "HURRICANE"                     
## [115] "SNOW"                           "LIGNTNING"                     
## [117] "FROST"                          "FREEZING RAIN/SNOW"            
## [119] "HIGH WINDS/"                    "THUNDERSNOW"                   
## [121] "FLOODS"                         "COOL AND WET"                  
## [123] "HEAVY RAIN/SNOW"                "GLAZE ICE"                     
## [125] "MUD SLIDE"                      "HIGH  WINDS"                   
## [127] "RURAL FLOOD"                    "MUD SLIDES"                    
## [129] "EXTREME HEAT"                   "DROUGHT"                       
## [131] "COLD AND WET CONDITIONS"        "EXCESSIVE WETNESS"             
## [133] "SLEET/ICE STORM"                "GUSTNADO"                      
## [135] "FREEZING RAIN"                  "SNOW AND HEAVY SNOW"           
## [137] "GROUND BLIZZARD"                "EXTREME WIND CHILL"            
## [139] "MAJOR FLOOD"                    "SNOW/HEAVY SNOW"               
## [141] "FREEZING RAIN/SLEET"            "ICE JAM FLOODING"              
## [143] "COLD AIR TORNADO"               "WIND DAMAGE"                   
## [145] "FOG"                            "TSTM WIND 55"                  
## [147] "SMALL STREAM FLOOD"             "THUNDERTORM WINDS"             
## [149] "HAIL/WINDS"                     "SNOW AND ICE"                  
## [151] "WIND STORM"                     "GRASS FIRES"                   
## [153] "LAKE FLOOD"                     "HAIL/WIND"                     
## [155] "WIND/HAIL"                      "ICE"                           
## [157] "SNOW AND ICE STORM"             "THUNDERSTORM  WINDS"           
## [159] "WINTER WEATHER"                 "DROUGHT/EXCESSIVE HEAT"        
## [161] "THUNDERSTORMS WIND"             "TUNDERSTORM WIND"              
## [163] "URBAN AND SMALL STREAM FLOODIN" "THUNDERSTORM WIND/LIGHTNING"   
## [165] "HEAVY RAIN/SEVERE WEATHER"      "THUNDERSTORM"                  
## [167] "WATERSPOUT/ TORNADO"            "LIGHTNING."                    
## [169] "HURRICANE-GENERATED SWELLS"     "RIVER AND STREAM FLOOD"        
## [171] "HIGH WINDS/COASTAL FLOOD"       "RAIN"                          
## [173] "RIVER FLOODING"                 "ICE FLOES"                     
## [175] "THUNDERSTORM WIND G50"          "LIGHTNING FIRE"                
## [177] "HEAVY LAKE SNOW"                "RECORD COLD"                   
## [179] "HEAVY SNOW/FREEZING RAIN"       "COLD WAVE"                     
## [181] "DUST DEVIL WATERSPOUT"          "TORNADO F3"                    
## [183] "TORNDAO"                        "FLOOD/RIVER FLOOD"             
## [185] "MUD SLIDES URBAN FLOODING"      "TORNADO F1"                    
## [187] "GLAZE/ICE STORM"                "GLAZE"                         
## [189] "HEAVY SNOW/WINTER STORM"        "MICROBURST"                    
## [191] "AVALANCE"                       "BLIZZARD/WINTER STORM"         
## [193] "DUST STORM/HIGH WINDS"          "ICE JAM"                       
## [195] "FOREST FIRES"                   "FROST\\FREEZE"                 
## [197] "THUNDERSTORM WINDS."            "HVY RAIN"                      
## [199] "HAIL 150"                       "HAIL 075"                      
## [201] "HAIL 100"                       "THUNDERSTORM WIND G55"         
## [203] "HAIL 125"                       "THUNDERSTORM WIND G60"         
## [205] "THUNDERSTORM WINDS G60"         "HARD FREEZE"                   
## [207] "HAIL 200"                       "HEAVY SNOW AND HIGH WINDS"     
## [209] "HEAVY SNOW/HIGH WINDS & FLOOD"  "HEAVY RAIN AND FLOOD"          
## [211] "RIP CURRENTS/HEAVY SURF"        "URBAN AND SMALL"               
## [213] "WILDFIRES"                      "FOG AND COLD TEMPERATURES"     
## [215] "SNOW/COLD"                      "FLASH FLOOD FROM ICE JAMS"     
## [217] "TSTM WIND G58"                  "MUDSLIDE"                      
## [219] "HEAVY SNOW SQUALLS"             "SNOW SQUALL"                   
## [221] "SNOW/ICE STORM"                 "HEAVY SNOW/SQUALLS"            
## [223] "HEAVY SNOW-SQUALLS"             "ICY ROADS"                     
## [225] "HEAVY MIX"                      "SNOW FREEZING RAIN"            
## [227] "SNOW/SLEET"                     "SNOW/FREEZING RAIN"            
## [229] "SNOW SQUALLS"                   "SNOW/SLEET/FREEZING RAIN"      
## [231] "RECORD SNOW"                    "HAIL 0.75"                     
## [233] "RECORD HEAT"                    "THUNDERSTORM WIND 65MPH"       
## [235] "THUNDERSTORM WIND/ TREES"       "THUNDERSTORM WIND/AWNING"      
## [237] "THUNDERSTORM WIND 98 MPH"       "THUNDERSTORM WIND TREES"       
## [239] "TORNADO F2"                     "RIP CURRENTS"                  
## [241] "HURRICANE EMILY"                "COASTAL SURGE"                 
## [243] "HURRICANE GORDON"               "HURRICANE FELIX"               
## [245] "THUNDERSTORM WIND 60 MPH"       "THUNDERSTORM WINDS 63 MPH"     
## [247] "THUNDERSTORM WIND/ TREE"        "THUNDERSTORM DAMAGE TO"        
## [249] "THUNDERSTORM WIND 65 MPH"       "FLASH FLOOD - HEAVY RAIN"      
## [251] "THUNDERSTORM WIND."             "FLASH FLOOD/ STREET"           
## [253] "BLOWING SNOW"                   "HEAVY SNOW/BLIZZARD"           
## [255] "THUNDERSTORM HAIL"              "THUNDERSTORM WINDSHAIL"        
## [257] "LIGHTNING  WAUSEON"             "THUDERSTORM WINDS"             
## [259] "ICE AND SNOW"                   "STORM FORCE WINDS"             
## [261] "HEAVY SNOW/ICE"                 "LIGHTING"                      
## [263] "HIGH WIND/HEAVY SNOW"           "THUNDERSTORM WINDS AND"        
## [265] "HEAVY PRECIPITATION"            "HIGH WIND/BLIZZARD"            
## [267] "TSTM WIND DAMAGE"               "FLOOD FLASH"                   
## [269] "RAIN/WIND"                      "SNOW/ICE"                      
## [271] "HAIL 75"                        "HEAT WAVE DROUGHT"             
## [273] "HEAVY SNOW/BLIZZARD/AVALANCHE"  "HEAT WAVES"                    
## [275] "UNSEASONABLY WARM AND DRY"      "UNSEASONABLY COLD"             
## [277] "RECORD/EXCESSIVE HEAT"          "THUNDERSTORM WIND G52"         
## [279] "HIGH WAVES"                     "FLASH FLOOD/FLOOD"             
## [281] "FLOOD/FLASH"                    "LOW TEMPERATURE"               
## [283] "HEAVY RAINS/FLOODING"           "THUNDERESTORM WINDS"           
## [285] "THUNDERSTORM WINDS/FLOODING"    "HYPOTHERMIA"                   
## [287] "THUNDEERSTORM WINDS"            "THUNERSTORM WINDS"             
## [289] "HIGH WINDS/COLD"                "COLD/WINDS"                    
## [291] "SNOW/ BITTER COLD"              "COLD WEATHER"                  
## [293] "RAPIDLY RISING WATER"           "WILD/FOREST FIRE"              
## [295] "ICE/STRONG WINDS"               "SNOW/HIGH WINDS"               
## [297] "HIGH WINDS/SNOW"                "SNOWMELT FLOODING"             
## [299] "HEAVY SNOW AND STRONG WINDS"    "SNOW ACCUMULATION"             
## [301] "SNOW/ ICE"                      "SNOW/BLOWING SNOW"             
## [303] "TORNADOES"                      "THUNDERSTORM WIND/HAIL"        
## [305] "FREEZING DRIZZLE"               "HAIL 175"                      
## [307] "FLASH FLOODING/FLOOD"           "HAIL 275"                      
## [309] "HAIL 450"                       "EXCESSIVE RAINFALL"            
## [311] "THUNDERSTORMW"                  "HAILSTORM"                     
## [313] "TSTM WINDS"                     "TSTMW"                         
## [315] "TSTM WIND 65)"                  "TROPICAL STORM DEAN"           
## [317] "THUNDERSTORM WINDS/ FLOOD"      "LANDSLIDE"                     
## [319] "HIGH WIND AND SEAS"             "THUNDERSTORMWINDS"             
## [321] "WILD/FOREST FIRES"              "HEAVY SEAS"                    
## [323] "HAIL DAMAGE"                    "FLOOD & HEAVY RAIN"            
## [325] "?"                              "THUNDERSTROM WIND"             
## [327] "FLOOD/FLASHFLOOD"               "HIGH WATER"                    
## [329] "HIGH WIND 48"                   "LANDSLIDES"                    
## [331] "URBAN/SMALL STREAM"             "BRUSH FIRE"                    
## [333] "HEAVY SHOWER"                   "HEAVY SWELLS"                  
## [335] "URBAN SMALL"                    "URBAN FLOODS"                  
## [337] "FLASH FLOOD/LANDSLIDE"          "HEAVY RAIN/SMALL STREAM URBAN" 
## [339] "FLASH FLOOD LANDSLIDES"         "TSTM WIND/HAIL"                
## [341] "Other"                          "Ice jam flood (minor"          
## [343] "Tstm Wind"                      "URBAN/SML STREAM FLD"          
## [345] "ROUGH SURF"                     "Heavy Surf"                    
## [347] "Dust Devil"                     "Marine Accident"               
## [349] "Freeze"                         "Strong Wind"                   
## [351] "COASTAL STORM"                  "Erosion/Cstl Flood"            
## [353] "River Flooding"                 "Damaging Freeze"               
## [355] "Beach Erosion"                  "High Surf"                     
## [357] "Heavy Rain/High Surf"           "Unseasonable Cold"             
## [359] "Early Frost"                    "Wintry Mix"                    
## [361] "Extreme Cold"                   "Coastal Flooding"              
## [363] "Torrential Rainfall"            "Landslump"                     
## [365] "Hurricane Edouard"              "Coastal Storm"                 
## [367] "TIDAL FLOODING"                 "Tidal Flooding"                
## [369] "Strong Winds"                   "EXTREME WINDCHILL"             
## [371] "Glaze"                          "Extended Cold"                 
## [373] "Whirlwind"                      "Heavy snow shower"             
## [375] "Light snow"                     "Light Snow"                    
## [377] "MIXED PRECIP"                   "Freezing Spray"                
## [379] "DOWNBURST"                      "Mudslides"                     
## [381] "Microburst"                     "Mudslide"                      
## [383] "Cold"                           "Coastal Flood"                 
## [385] "Snow Squalls"                   "Wind Damage"                   
## [387] "Light Snowfall"                 "Freezing Drizzle"              
## [389] "Gusty wind/rain"                "GUSTY WIND/HVY RAIN"           
## [391] "Wind"                           "Cold Temperature"              
## [393] "Heat Wave"                      "Snow"                          
## [395] "COLD AND SNOW"                  "RAIN/SNOW"                     
## [397] "TSTM WIND (G45)"                "Gusty Winds"                   
## [399] "GUSTY WIND"                     "TSTM WIND 40"                  
## [401] "TSTM WIND 45"                   "TSTM WIND (41)"                
## [403] "TSTM WIND (G40)"                "Frost/Freeze"                  
## [405] "AGRICULTURAL FREEZE"            "OTHER"                         
## [407] "Hypothermia/Exposure"           "HYPOTHERMIA/EXPOSURE"          
## [409] "Lake Effect Snow"               "Freezing Rain"                 
## [411] "Mixed Precipitation"            "BLACK ICE"                     
## [413] "COASTALSTORM"                   "LIGHT SNOW"                    
## [415] "DAM BREAK"                      "Gusty winds"                   
## [417] "blowing snow"                   "GRADIENT WIND"                 
## [419] "TSTM WIND AND LIGHTNING"        "gradient wind"                 
## [421] "Gradient wind"                  "Freezing drizzle"              
## [423] "WET MICROBURST"                 "Heavy surf and wind"           
## [425] "TYPHOON"                        "HIGH SWELLS"                   
## [427] "SMALL HAIL"                     "UNSEASONAL RAIN"               
## [429] "COASTAL FLOODING/EROSION"       " TSTM WIND (G45)"              
## [431] "TSTM WIND  (G45)"               "HIGH WIND (G40)"               
## [433] "TSTM WIND (G35)"                "COASTAL EROSION"               
## [435] "SEICHE"                         "COASTAL  FLOODING/EROSION"     
## [437] "HYPERTHERMIA/EXPOSURE"          "WINTRY MIX"                    
## [439] "ROCK SLIDE"                     "GUSTY WIND/HAIL"               
## [441] " TSTM WIND"                     "LANDSPOUT"                     
## [443] "EXCESSIVE SNOW"                 "LAKE EFFECT SNOW"              
## [445] "FLOOD/FLASH/FLOOD"              "MIXED PRECIPITATION"           
## [447] "WIND AND WAVE"                  "LIGHT FREEZING RAIN"           
## [449] "ICE ROADS"                      "ROUGH SEAS"                    
## [451] "TSTM WIND G45"                  "NON-SEVERE WIND DAMAGE"        
## [453] "WARM WEATHER"                   "THUNDERSTORM WIND (G40)"       
## [455] " FLASH FLOOD"                   "LATE SEASON SNOW"              
## [457] "WINTER WEATHER MIX"             "ROGUE WAVE"                    
## [459] "FALLING SNOW/ICE"               "NON-TSTM WIND"                 
## [461] "NON TSTM WIND"                  "BLOWING DUST"                  
## [463] "VOLCANIC ASH"                   "   HIGH SURF ADVISORY"         
## [465] "HAZARDOUS SURF"                 "WHIRLWIND"                     
## [467] "ICE ON ROAD"                    "DROWNING"                      
## [469] "EXTREME COLD/WIND CHILL"        "MARINE TSTM WIND"              
## [471] "HURRICANE/TYPHOON"              "WINTER WEATHER/MIX"            
## [473] "FROST/FREEZE"                   "ASTRONOMICAL HIGH TIDE"        
## [475] "HEAVY SURF/HIGH SURF"           "TROPICAL DEPRESSION"           
## [477] "LAKE-EFFECT SNOW"               "MARINE HIGH WIND"              
## [479] "TSUNAMI"                        "STORM SURGE/TIDE"              
## [481] "COLD/WIND CHILL"                "LAKESHORE FLOOD"               
## [483] "MARINE THUNDERSTORM WIND"       "MARINE STRONG WIND"            
## [485] "ASTRONOMICAL LOW TIDE"          "DENSE SMOKE"                   
## [487] "MARINE HAIL"                    "FREEZING FOG"

These are not the same categories as in the documentation Manually create a vector of event types from the Storm Data Event Table given in the National Weather Service Storm Data Documentation

EventNames <- c("Astronomical Low Tide", "Avalanche","Blizzard", "Coastal Flood", "Cold/Wind Chill",
                "Debris Flow", "Dense Fog", "Dense Smoke", "Drought", "Dust Devil", "Dust Storm",
                "Excessive Heat", "Extreme Cold/Wind Chill",
                "Flash Flood", "Flood", "Frost/Freeze", "Funnel Cloud", "Freezing Fog",
                "Hail", "Heat", "Heavy Rain", "Heavy Snow", "High Surf", "High Wind", "Hurrican(Typhoon)",
                "Ice Storm", "Lake-Effect Snow", "Lakeshore Flood", "Lightning", 
                "Marine Hail", "Marine High Wind", "Marine Strong Wind", "Marine Thunderstorm Wind",
                "Rip Current",
                "Seiche", "Sleet", "Storm Surge/Tide", "Strong Wind",
                "Thunderstorm Wind", "Tornado", "Tropical Depression", "Tropical Storm", "Tsunami",
                "Volcanic Ash",
                "Waterspout", "Wildfire", "Winter Storm", "Winter Weather")

Change case to upper to match the events format in the data and Checking manual creation of event types from documentation, there should be 48

EventNames <- toupper(EventNames)
summary(EventNames)
##    Length     Class      Mode 
##        48 character character
unique(EventNames)
##  [1] "ASTRONOMICAL LOW TIDE"    "AVALANCHE"               
##  [3] "BLIZZARD"                 "COASTAL FLOOD"           
##  [5] "COLD/WIND CHILL"          "DEBRIS FLOW"             
##  [7] "DENSE FOG"                "DENSE SMOKE"             
##  [9] "DROUGHT"                  "DUST DEVIL"              
## [11] "DUST STORM"               "EXCESSIVE HEAT"          
## [13] "EXTREME COLD/WIND CHILL"  "FLASH FLOOD"             
## [15] "FLOOD"                    "FROST/FREEZE"            
## [17] "FUNNEL CLOUD"             "FREEZING FOG"            
## [19] "HAIL"                     "HEAT"                    
## [21] "HEAVY RAIN"               "HEAVY SNOW"              
## [23] "HIGH SURF"                "HIGH WIND"               
## [25] "HURRICAN(TYPHOON)"        "ICE STORM"               
## [27] "LAKE-EFFECT SNOW"         "LAKESHORE FLOOD"         
## [29] "LIGHTNING"                "MARINE HAIL"             
## [31] "MARINE HIGH WIND"         "MARINE STRONG WIND"      
## [33] "MARINE THUNDERSTORM WIND" "RIP CURRENT"             
## [35] "SEICHE"                   "SLEET"                   
## [37] "STORM SURGE/TIDE"         "STRONG WIND"             
## [39] "THUNDERSTORM WIND"        "TORNADO"                 
## [41] "TROPICAL DEPRESSION"      "TROPICAL STORM"          
## [43] "TSUNAMI"                  "VOLCANIC ASH"            
## [45] "WATERSPOUT"               "WILDFIRE"                
## [47] "WINTER STORM"             "WINTER WEATHER"
str(EventNames)
##  chr [1:48] "ASTRONOMICAL LOW TIDE" "AVALANCHE" "BLIZZARD" ...

We now have a character vector of the event names which are permissible

We can use this as something to match the EVTYPES to.

unique(StormDataSubset$EVTYPE)
##   [1] "TORNADO"                        "TSTM WIND"                     
##   [3] "HAIL"                           "ICE STORM/FLASH FLOOD"         
##   [5] "WINTER STORM"                   "HURRICANE OPAL/HIGH WINDS"     
##   [7] "THUNDERSTORM WINDS"             "HURRICANE ERIN"                
##   [9] "HURRICANE OPAL"                 "HEAVY RAIN"                    
##  [11] "LIGHTNING"                      "THUNDERSTORM WIND"             
##  [13] "DENSE FOG"                      "RIP CURRENT"                   
##  [15] "THUNDERSTORM WINS"              "FLASH FLOODING"                
##  [17] "FLASH FLOOD"                    "TORNADO F0"                    
##  [19] "THUNDERSTORM WINDS LIGHTNING"   "THUNDERSTORM WINDS/HAIL"       
##  [21] "HEAT"                           "HIGH WINDS"                    
##  [23] "WIND"                           "HEAVY RAINS"                   
##  [25] "LIGHTNING AND HEAVY RAIN"       "THUNDERSTORM WINDS HAIL"       
##  [27] "COLD"                           "HEAVY RAIN/LIGHTNING"          
##  [29] "FLASH FLOODING/THUNDERSTORM WI" "FLOODING"                      
##  [31] "WATERSPOUT"                     "EXTREME COLD"                  
##  [33] "LIGHTNING/HEAVY RAIN"           "BREAKUP FLOODING"              
##  [35] "HIGH WIND"                      "FREEZE"                        
##  [37] "RIVER FLOOD"                    "HIGH WINDS HEAVY RAINS"        
##  [39] "AVALANCHE"                      "MARINE MISHAP"                 
##  [41] "HIGH TIDES"                     "HIGH WIND/SEAS"                
##  [43] "HIGH WINDS/HEAVY RAIN"          "HIGH SEAS"                     
##  [45] "COASTAL FLOOD"                  "SEVERE TURBULENCE"             
##  [47] "RECORD RAINFALL"                "HEAVY SNOW"                    
##  [49] "HEAVY SNOW/WIND"                "DUST STORM"                    
##  [51] "FLOOD"                          "APACHE COUNTY"                 
##  [53] "SLEET"                          "DUST DEVIL"                    
##  [55] "ICE STORM"                      "EXCESSIVE HEAT"                
##  [57] "THUNDERSTORM WINDS/FUNNEL CLOU" "GUSTY WINDS"                   
##  [59] "FLOODING/HEAVY RAIN"            "HEAVY SURF COASTAL FLOODING"   
##  [61] "HIGH SURF"                      "WILD FIRES"                    
##  [63] "HIGH"                           "WINTER STORM HIGH WINDS"       
##  [65] "WINTER STORMS"                  "MUDSLIDES"                     
##  [67] "RAINSTORM"                      "SEVERE THUNDERSTORM"           
##  [69] "SEVERE THUNDERSTORMS"           "SEVERE THUNDERSTORM WINDS"     
##  [71] "THUNDERSTORMS WINDS"            "FLOOD/FLASH FLOOD"             
##  [73] "FLOOD/RAIN/WINDS"               "THUNDERSTORMS"                 
##  [75] "FLASH FLOOD WINDS"              "WINDS"                         
##  [77] "FUNNEL CLOUD"                   "HIGH WIND DAMAGE"              
##  [79] "STRONG WIND"                    "HEAVY SNOWPACK"                
##  [81] "FLASH FLOOD/"                   "HEAVY SURF"                    
##  [83] "DRY MIRCOBURST WINDS"           "DRY MICROBURST"                
##  [85] "URBAN FLOOD"                    "THUNDERSTORM WINDSS"           
##  [87] "MICROBURST WINDS"               "HEAT WAVE"                     
##  [89] "UNSEASONABLY WARM"              "COASTAL FLOODING"              
##  [91] "STRONG WINDS"                   "BLIZZARD"                      
##  [93] "WATERSPOUT/TORNADO"             "WATERSPOUT TORNADO"            
##  [95] "STORM SURGE"                    "URBAN/SMALL STREAM FLOOD"      
##  [97] "WATERSPOUT-"                    "TORNADOES, TSTM WIND, HAIL"    
##  [99] "TROPICAL STORM ALBERTO"         "TROPICAL STORM"                
## [101] "TROPICAL STORM GORDON"          "TROPICAL STORM JERRY"          
## [103] "LIGHTNING THUNDERSTORM WINDS"   "URBAN FLOODING"                
## [105] "MINOR FLOODING"                 "WATERSPOUT-TORNADO"            
## [107] "LIGHTNING INJURY"               "LIGHTNING AND THUNDERSTORM WIN"
## [109] "FLASH FLOODS"                   "THUNDERSTORM WINDS53"          
## [111] "WILDFIRE"                       "DAMAGING FREEZE"               
## [113] "THUNDERSTORM WINDS 13"          "HURRICANE"                     
## [115] "SNOW"                           "LIGNTNING"                     
## [117] "FROST"                          "FREEZING RAIN/SNOW"            
## [119] "HIGH WINDS/"                    "THUNDERSNOW"                   
## [121] "FLOODS"                         "COOL AND WET"                  
## [123] "HEAVY RAIN/SNOW"                "GLAZE ICE"                     
## [125] "MUD SLIDE"                      "HIGH  WINDS"                   
## [127] "RURAL FLOOD"                    "MUD SLIDES"                    
## [129] "EXTREME HEAT"                   "DROUGHT"                       
## [131] "COLD AND WET CONDITIONS"        "EXCESSIVE WETNESS"             
## [133] "SLEET/ICE STORM"                "GUSTNADO"                      
## [135] "FREEZING RAIN"                  "SNOW AND HEAVY SNOW"           
## [137] "GROUND BLIZZARD"                "EXTREME WIND CHILL"            
## [139] "MAJOR FLOOD"                    "SNOW/HEAVY SNOW"               
## [141] "FREEZING RAIN/SLEET"            "ICE JAM FLOODING"              
## [143] "COLD AIR TORNADO"               "WIND DAMAGE"                   
## [145] "FOG"                            "TSTM WIND 55"                  
## [147] "SMALL STREAM FLOOD"             "THUNDERTORM WINDS"             
## [149] "HAIL/WINDS"                     "SNOW AND ICE"                  
## [151] "WIND STORM"                     "GRASS FIRES"                   
## [153] "LAKE FLOOD"                     "HAIL/WIND"                     
## [155] "WIND/HAIL"                      "ICE"                           
## [157] "SNOW AND ICE STORM"             "THUNDERSTORM  WINDS"           
## [159] "WINTER WEATHER"                 "DROUGHT/EXCESSIVE HEAT"        
## [161] "THUNDERSTORMS WIND"             "TUNDERSTORM WIND"              
## [163] "URBAN AND SMALL STREAM FLOODIN" "THUNDERSTORM WIND/LIGHTNING"   
## [165] "HEAVY RAIN/SEVERE WEATHER"      "THUNDERSTORM"                  
## [167] "WATERSPOUT/ TORNADO"            "LIGHTNING."                    
## [169] "HURRICANE-GENERATED SWELLS"     "RIVER AND STREAM FLOOD"        
## [171] "HIGH WINDS/COASTAL FLOOD"       "RAIN"                          
## [173] "RIVER FLOODING"                 "ICE FLOES"                     
## [175] "THUNDERSTORM WIND G50"          "LIGHTNING FIRE"                
## [177] "HEAVY LAKE SNOW"                "RECORD COLD"                   
## [179] "HEAVY SNOW/FREEZING RAIN"       "COLD WAVE"                     
## [181] "DUST DEVIL WATERSPOUT"          "TORNADO F3"                    
## [183] "TORNDAO"                        "FLOOD/RIVER FLOOD"             
## [185] "MUD SLIDES URBAN FLOODING"      "TORNADO F1"                    
## [187] "GLAZE/ICE STORM"                "GLAZE"                         
## [189] "HEAVY SNOW/WINTER STORM"        "MICROBURST"                    
## [191] "AVALANCE"                       "BLIZZARD/WINTER STORM"         
## [193] "DUST STORM/HIGH WINDS"          "ICE JAM"                       
## [195] "FOREST FIRES"                   "FROST\\FREEZE"                 
## [197] "THUNDERSTORM WINDS."            "HVY RAIN"                      
## [199] "HAIL 150"                       "HAIL 075"                      
## [201] "HAIL 100"                       "THUNDERSTORM WIND G55"         
## [203] "HAIL 125"                       "THUNDERSTORM WIND G60"         
## [205] "THUNDERSTORM WINDS G60"         "HARD FREEZE"                   
## [207] "HAIL 200"                       "HEAVY SNOW AND HIGH WINDS"     
## [209] "HEAVY SNOW/HIGH WINDS & FLOOD"  "HEAVY RAIN AND FLOOD"          
## [211] "RIP CURRENTS/HEAVY SURF"        "URBAN AND SMALL"               
## [213] "WILDFIRES"                      "FOG AND COLD TEMPERATURES"     
## [215] "SNOW/COLD"                      "FLASH FLOOD FROM ICE JAMS"     
## [217] "TSTM WIND G58"                  "MUDSLIDE"                      
## [219] "HEAVY SNOW SQUALLS"             "SNOW SQUALL"                   
## [221] "SNOW/ICE STORM"                 "HEAVY SNOW/SQUALLS"            
## [223] "HEAVY SNOW-SQUALLS"             "ICY ROADS"                     
## [225] "HEAVY MIX"                      "SNOW FREEZING RAIN"            
## [227] "SNOW/SLEET"                     "SNOW/FREEZING RAIN"            
## [229] "SNOW SQUALLS"                   "SNOW/SLEET/FREEZING RAIN"      
## [231] "RECORD SNOW"                    "HAIL 0.75"                     
## [233] "RECORD HEAT"                    "THUNDERSTORM WIND 65MPH"       
## [235] "THUNDERSTORM WIND/ TREES"       "THUNDERSTORM WIND/AWNING"      
## [237] "THUNDERSTORM WIND 98 MPH"       "THUNDERSTORM WIND TREES"       
## [239] "TORNADO F2"                     "RIP CURRENTS"                  
## [241] "HURRICANE EMILY"                "COASTAL SURGE"                 
## [243] "HURRICANE GORDON"               "HURRICANE FELIX"               
## [245] "THUNDERSTORM WIND 60 MPH"       "THUNDERSTORM WINDS 63 MPH"     
## [247] "THUNDERSTORM WIND/ TREE"        "THUNDERSTORM DAMAGE TO"        
## [249] "THUNDERSTORM WIND 65 MPH"       "FLASH FLOOD - HEAVY RAIN"      
## [251] "THUNDERSTORM WIND."             "FLASH FLOOD/ STREET"           
## [253] "BLOWING SNOW"                   "HEAVY SNOW/BLIZZARD"           
## [255] "THUNDERSTORM HAIL"              "THUNDERSTORM WINDSHAIL"        
## [257] "LIGHTNING  WAUSEON"             "THUDERSTORM WINDS"             
## [259] "ICE AND SNOW"                   "STORM FORCE WINDS"             
## [261] "HEAVY SNOW/ICE"                 "LIGHTING"                      
## [263] "HIGH WIND/HEAVY SNOW"           "THUNDERSTORM WINDS AND"        
## [265] "HEAVY PRECIPITATION"            "HIGH WIND/BLIZZARD"            
## [267] "TSTM WIND DAMAGE"               "FLOOD FLASH"                   
## [269] "RAIN/WIND"                      "SNOW/ICE"                      
## [271] "HAIL 75"                        "HEAT WAVE DROUGHT"             
## [273] "HEAVY SNOW/BLIZZARD/AVALANCHE"  "HEAT WAVES"                    
## [275] "UNSEASONABLY WARM AND DRY"      "UNSEASONABLY COLD"             
## [277] "RECORD/EXCESSIVE HEAT"          "THUNDERSTORM WIND G52"         
## [279] "HIGH WAVES"                     "FLASH FLOOD/FLOOD"             
## [281] "FLOOD/FLASH"                    "LOW TEMPERATURE"               
## [283] "HEAVY RAINS/FLOODING"           "THUNDERESTORM WINDS"           
## [285] "THUNDERSTORM WINDS/FLOODING"    "HYPOTHERMIA"                   
## [287] "THUNDEERSTORM WINDS"            "THUNERSTORM WINDS"             
## [289] "HIGH WINDS/COLD"                "COLD/WINDS"                    
## [291] "SNOW/ BITTER COLD"              "COLD WEATHER"                  
## [293] "RAPIDLY RISING WATER"           "WILD/FOREST FIRE"              
## [295] "ICE/STRONG WINDS"               "SNOW/HIGH WINDS"               
## [297] "HIGH WINDS/SNOW"                "SNOWMELT FLOODING"             
## [299] "HEAVY SNOW AND STRONG WINDS"    "SNOW ACCUMULATION"             
## [301] "SNOW/ ICE"                      "SNOW/BLOWING SNOW"             
## [303] "TORNADOES"                      "THUNDERSTORM WIND/HAIL"        
## [305] "FREEZING DRIZZLE"               "HAIL 175"                      
## [307] "FLASH FLOODING/FLOOD"           "HAIL 275"                      
## [309] "HAIL 450"                       "EXCESSIVE RAINFALL"            
## [311] "THUNDERSTORMW"                  "HAILSTORM"                     
## [313] "TSTM WINDS"                     "TSTMW"                         
## [315] "TSTM WIND 65)"                  "TROPICAL STORM DEAN"           
## [317] "THUNDERSTORM WINDS/ FLOOD"      "LANDSLIDE"                     
## [319] "HIGH WIND AND SEAS"             "THUNDERSTORMWINDS"             
## [321] "WILD/FOREST FIRES"              "HEAVY SEAS"                    
## [323] "HAIL DAMAGE"                    "FLOOD & HEAVY RAIN"            
## [325] "?"                              "THUNDERSTROM WIND"             
## [327] "FLOOD/FLASHFLOOD"               "HIGH WATER"                    
## [329] "HIGH WIND 48"                   "LANDSLIDES"                    
## [331] "URBAN/SMALL STREAM"             "BRUSH FIRE"                    
## [333] "HEAVY SHOWER"                   "HEAVY SWELLS"                  
## [335] "URBAN SMALL"                    "URBAN FLOODS"                  
## [337] "FLASH FLOOD/LANDSLIDE"          "HEAVY RAIN/SMALL STREAM URBAN" 
## [339] "FLASH FLOOD LANDSLIDES"         "TSTM WIND/HAIL"                
## [341] "Other"                          "Ice jam flood (minor"          
## [343] "Tstm Wind"                      "URBAN/SML STREAM FLD"          
## [345] "ROUGH SURF"                     "Heavy Surf"                    
## [347] "Dust Devil"                     "Marine Accident"               
## [349] "Freeze"                         "Strong Wind"                   
## [351] "COASTAL STORM"                  "Erosion/Cstl Flood"            
## [353] "River Flooding"                 "Damaging Freeze"               
## [355] "Beach Erosion"                  "High Surf"                     
## [357] "Heavy Rain/High Surf"           "Unseasonable Cold"             
## [359] "Early Frost"                    "Wintry Mix"                    
## [361] "Extreme Cold"                   "Coastal Flooding"              
## [363] "Torrential Rainfall"            "Landslump"                     
## [365] "Hurricane Edouard"              "Coastal Storm"                 
## [367] "TIDAL FLOODING"                 "Tidal Flooding"                
## [369] "Strong Winds"                   "EXTREME WINDCHILL"             
## [371] "Glaze"                          "Extended Cold"                 
## [373] "Whirlwind"                      "Heavy snow shower"             
## [375] "Light snow"                     "Light Snow"                    
## [377] "MIXED PRECIP"                   "Freezing Spray"                
## [379] "DOWNBURST"                      "Mudslides"                     
## [381] "Microburst"                     "Mudslide"                      
## [383] "Cold"                           "Coastal Flood"                 
## [385] "Snow Squalls"                   "Wind Damage"                   
## [387] "Light Snowfall"                 "Freezing Drizzle"              
## [389] "Gusty wind/rain"                "GUSTY WIND/HVY RAIN"           
## [391] "Wind"                           "Cold Temperature"              
## [393] "Heat Wave"                      "Snow"                          
## [395] "COLD AND SNOW"                  "RAIN/SNOW"                     
## [397] "TSTM WIND (G45)"                "Gusty Winds"                   
## [399] "GUSTY WIND"                     "TSTM WIND 40"                  
## [401] "TSTM WIND 45"                   "TSTM WIND (41)"                
## [403] "TSTM WIND (G40)"                "Frost/Freeze"                  
## [405] "AGRICULTURAL FREEZE"            "OTHER"                         
## [407] "Hypothermia/Exposure"           "HYPOTHERMIA/EXPOSURE"          
## [409] "Lake Effect Snow"               "Freezing Rain"                 
## [411] "Mixed Precipitation"            "BLACK ICE"                     
## [413] "COASTALSTORM"                   "LIGHT SNOW"                    
## [415] "DAM BREAK"                      "Gusty winds"                   
## [417] "blowing snow"                   "GRADIENT WIND"                 
## [419] "TSTM WIND AND LIGHTNING"        "gradient wind"                 
## [421] "Gradient wind"                  "Freezing drizzle"              
## [423] "WET MICROBURST"                 "Heavy surf and wind"           
## [425] "TYPHOON"                        "HIGH SWELLS"                   
## [427] "SMALL HAIL"                     "UNSEASONAL RAIN"               
## [429] "COASTAL FLOODING/EROSION"       " TSTM WIND (G45)"              
## [431] "TSTM WIND  (G45)"               "HIGH WIND (G40)"               
## [433] "TSTM WIND (G35)"                "COASTAL EROSION"               
## [435] "SEICHE"                         "COASTAL  FLOODING/EROSION"     
## [437] "HYPERTHERMIA/EXPOSURE"          "WINTRY MIX"                    
## [439] "ROCK SLIDE"                     "GUSTY WIND/HAIL"               
## [441] " TSTM WIND"                     "LANDSPOUT"                     
## [443] "EXCESSIVE SNOW"                 "LAKE EFFECT SNOW"              
## [445] "FLOOD/FLASH/FLOOD"              "MIXED PRECIPITATION"           
## [447] "WIND AND WAVE"                  "LIGHT FREEZING RAIN"           
## [449] "ICE ROADS"                      "ROUGH SEAS"                    
## [451] "TSTM WIND G45"                  "NON-SEVERE WIND DAMAGE"        
## [453] "WARM WEATHER"                   "THUNDERSTORM WIND (G40)"       
## [455] " FLASH FLOOD"                   "LATE SEASON SNOW"              
## [457] "WINTER WEATHER MIX"             "ROGUE WAVE"                    
## [459] "FALLING SNOW/ICE"               "NON-TSTM WIND"                 
## [461] "NON TSTM WIND"                  "BLOWING DUST"                  
## [463] "VOLCANIC ASH"                   "   HIGH SURF ADVISORY"         
## [465] "HAZARDOUS SURF"                 "WHIRLWIND"                     
## [467] "ICE ON ROAD"                    "DROWNING"                      
## [469] "EXTREME COLD/WIND CHILL"        "MARINE TSTM WIND"              
## [471] "HURRICANE/TYPHOON"              "WINTER WEATHER/MIX"            
## [473] "FROST/FREEZE"                   "ASTRONOMICAL HIGH TIDE"        
## [475] "HEAVY SURF/HIGH SURF"           "TROPICAL DEPRESSION"           
## [477] "LAKE-EFFECT SNOW"               "MARINE HIGH WIND"              
## [479] "TSUNAMI"                        "STORM SURGE/TIDE"              
## [481] "COLD/WIND CHILL"                "LAKESHORE FLOOD"               
## [483] "MARINE THUNDERSTORM WIND"       "MARINE STRONG WIND"            
## [485] "ASTRONOMICAL LOW TIDE"          "DENSE SMOKE"                   
## [487] "MARINE HAIL"                    "FREEZING FOG"

There are 488 different EVTYPES in the data that is now subset Strip white spaces from these character vectors to see if we can reduce the list of unique event types, creating a new column. Please note that this code takes a little while to run.

trim <- function (x) gsub("^\\s+|\\s+$", "", x)
StormDataSubset$Event <- sapply(StormDataSubset$EVTYPE, trim) 
StormDataSubset$Event <- toupper(StormDataSubset$Event)
unique(StormDataSubset$Event)
##   [1] "TORNADO"                        "TSTM WIND"                     
##   [3] "HAIL"                           "ICE STORM/FLASH FLOOD"         
##   [5] "WINTER STORM"                   "HURRICANE OPAL/HIGH WINDS"     
##   [7] "THUNDERSTORM WINDS"             "HURRICANE ERIN"                
##   [9] "HURRICANE OPAL"                 "HEAVY RAIN"                    
##  [11] "LIGHTNING"                      "THUNDERSTORM WIND"             
##  [13] "DENSE FOG"                      "RIP CURRENT"                   
##  [15] "THUNDERSTORM WINS"              "FLASH FLOODING"                
##  [17] "FLASH FLOOD"                    "TORNADO F0"                    
##  [19] "THUNDERSTORM WINDS LIGHTNING"   "THUNDERSTORM WINDS/HAIL"       
##  [21] "HEAT"                           "HIGH WINDS"                    
##  [23] "WIND"                           "HEAVY RAINS"                   
##  [25] "LIGHTNING AND HEAVY RAIN"       "THUNDERSTORM WINDS HAIL"       
##  [27] "COLD"                           "HEAVY RAIN/LIGHTNING"          
##  [29] "FLASH FLOODING/THUNDERSTORM WI" "FLOODING"                      
##  [31] "WATERSPOUT"                     "EXTREME COLD"                  
##  [33] "LIGHTNING/HEAVY RAIN"           "BREAKUP FLOODING"              
##  [35] "HIGH WIND"                      "FREEZE"                        
##  [37] "RIVER FLOOD"                    "HIGH WINDS HEAVY RAINS"        
##  [39] "AVALANCHE"                      "MARINE MISHAP"                 
##  [41] "HIGH TIDES"                     "HIGH WIND/SEAS"                
##  [43] "HIGH WINDS/HEAVY RAIN"          "HIGH SEAS"                     
##  [45] "COASTAL FLOOD"                  "SEVERE TURBULENCE"             
##  [47] "RECORD RAINFALL"                "HEAVY SNOW"                    
##  [49] "HEAVY SNOW/WIND"                "DUST STORM"                    
##  [51] "FLOOD"                          "APACHE COUNTY"                 
##  [53] "SLEET"                          "DUST DEVIL"                    
##  [55] "ICE STORM"                      "EXCESSIVE HEAT"                
##  [57] "THUNDERSTORM WINDS/FUNNEL CLOU" "GUSTY WINDS"                   
##  [59] "FLOODING/HEAVY RAIN"            "HEAVY SURF COASTAL FLOODING"   
##  [61] "HIGH SURF"                      "WILD FIRES"                    
##  [63] "HIGH"                           "WINTER STORM HIGH WINDS"       
##  [65] "WINTER STORMS"                  "MUDSLIDES"                     
##  [67] "RAINSTORM"                      "SEVERE THUNDERSTORM"           
##  [69] "SEVERE THUNDERSTORMS"           "SEVERE THUNDERSTORM WINDS"     
##  [71] "THUNDERSTORMS WINDS"            "FLOOD/FLASH FLOOD"             
##  [73] "FLOOD/RAIN/WINDS"               "THUNDERSTORMS"                 
##  [75] "FLASH FLOOD WINDS"              "WINDS"                         
##  [77] "FUNNEL CLOUD"                   "HIGH WIND DAMAGE"              
##  [79] "STRONG WIND"                    "HEAVY SNOWPACK"                
##  [81] "FLASH FLOOD/"                   "HEAVY SURF"                    
##  [83] "DRY MIRCOBURST WINDS"           "DRY MICROBURST"                
##  [85] "URBAN FLOOD"                    "THUNDERSTORM WINDSS"           
##  [87] "MICROBURST WINDS"               "HEAT WAVE"                     
##  [89] "UNSEASONABLY WARM"              "COASTAL FLOODING"              
##  [91] "STRONG WINDS"                   "BLIZZARD"                      
##  [93] "WATERSPOUT/TORNADO"             "WATERSPOUT TORNADO"            
##  [95] "STORM SURGE"                    "URBAN/SMALL STREAM FLOOD"      
##  [97] "WATERSPOUT-"                    "TORNADOES, TSTM WIND, HAIL"    
##  [99] "TROPICAL STORM ALBERTO"         "TROPICAL STORM"                
## [101] "TROPICAL STORM GORDON"          "TROPICAL STORM JERRY"          
## [103] "LIGHTNING THUNDERSTORM WINDS"   "URBAN FLOODING"                
## [105] "MINOR FLOODING"                 "WATERSPOUT-TORNADO"            
## [107] "LIGHTNING INJURY"               "LIGHTNING AND THUNDERSTORM WIN"
## [109] "FLASH FLOODS"                   "THUNDERSTORM WINDS53"          
## [111] "WILDFIRE"                       "DAMAGING FREEZE"               
## [113] "THUNDERSTORM WINDS 13"          "HURRICANE"                     
## [115] "SNOW"                           "LIGNTNING"                     
## [117] "FROST"                          "FREEZING RAIN/SNOW"            
## [119] "HIGH WINDS/"                    "THUNDERSNOW"                   
## [121] "FLOODS"                         "COOL AND WET"                  
## [123] "HEAVY RAIN/SNOW"                "GLAZE ICE"                     
## [125] "MUD SLIDE"                      "HIGH  WINDS"                   
## [127] "RURAL FLOOD"                    "MUD SLIDES"                    
## [129] "EXTREME HEAT"                   "DROUGHT"                       
## [131] "COLD AND WET CONDITIONS"        "EXCESSIVE WETNESS"             
## [133] "SLEET/ICE STORM"                "GUSTNADO"                      
## [135] "FREEZING RAIN"                  "SNOW AND HEAVY SNOW"           
## [137] "GROUND BLIZZARD"                "EXTREME WIND CHILL"            
## [139] "MAJOR FLOOD"                    "SNOW/HEAVY SNOW"               
## [141] "FREEZING RAIN/SLEET"            "ICE JAM FLOODING"              
## [143] "COLD AIR TORNADO"               "WIND DAMAGE"                   
## [145] "FOG"                            "TSTM WIND 55"                  
## [147] "SMALL STREAM FLOOD"             "THUNDERTORM WINDS"             
## [149] "HAIL/WINDS"                     "SNOW AND ICE"                  
## [151] "WIND STORM"                     "GRASS FIRES"                   
## [153] "LAKE FLOOD"                     "HAIL/WIND"                     
## [155] "WIND/HAIL"                      "ICE"                           
## [157] "SNOW AND ICE STORM"             "THUNDERSTORM  WINDS"           
## [159] "WINTER WEATHER"                 "DROUGHT/EXCESSIVE HEAT"        
## [161] "THUNDERSTORMS WIND"             "TUNDERSTORM WIND"              
## [163] "URBAN AND SMALL STREAM FLOODIN" "THUNDERSTORM WIND/LIGHTNING"   
## [165] "HEAVY RAIN/SEVERE WEATHER"      "THUNDERSTORM"                  
## [167] "WATERSPOUT/ TORNADO"            "LIGHTNING."                    
## [169] "HURRICANE-GENERATED SWELLS"     "RIVER AND STREAM FLOOD"        
## [171] "HIGH WINDS/COASTAL FLOOD"       "RAIN"                          
## [173] "RIVER FLOODING"                 "ICE FLOES"                     
## [175] "THUNDERSTORM WIND G50"          "LIGHTNING FIRE"                
## [177] "HEAVY LAKE SNOW"                "RECORD COLD"                   
## [179] "HEAVY SNOW/FREEZING RAIN"       "COLD WAVE"                     
## [181] "DUST DEVIL WATERSPOUT"          "TORNADO F3"                    
## [183] "TORNDAO"                        "FLOOD/RIVER FLOOD"             
## [185] "MUD SLIDES URBAN FLOODING"      "TORNADO F1"                    
## [187] "GLAZE/ICE STORM"                "GLAZE"                         
## [189] "HEAVY SNOW/WINTER STORM"        "MICROBURST"                    
## [191] "AVALANCE"                       "BLIZZARD/WINTER STORM"         
## [193] "DUST STORM/HIGH WINDS"          "ICE JAM"                       
## [195] "FOREST FIRES"                   "FROST\\FREEZE"                 
## [197] "THUNDERSTORM WINDS."            "HVY RAIN"                      
## [199] "HAIL 150"                       "HAIL 075"                      
## [201] "HAIL 100"                       "THUNDERSTORM WIND G55"         
## [203] "HAIL 125"                       "THUNDERSTORM WIND G60"         
## [205] "THUNDERSTORM WINDS G60"         "HARD FREEZE"                   
## [207] "HAIL 200"                       "HEAVY SNOW AND HIGH WINDS"     
## [209] "HEAVY SNOW/HIGH WINDS & FLOOD"  "HEAVY RAIN AND FLOOD"          
## [211] "RIP CURRENTS/HEAVY SURF"        "URBAN AND SMALL"               
## [213] "WILDFIRES"                      "FOG AND COLD TEMPERATURES"     
## [215] "SNOW/COLD"                      "FLASH FLOOD FROM ICE JAMS"     
## [217] "TSTM WIND G58"                  "MUDSLIDE"                      
## [219] "HEAVY SNOW SQUALLS"             "SNOW SQUALL"                   
## [221] "SNOW/ICE STORM"                 "HEAVY SNOW/SQUALLS"            
## [223] "HEAVY SNOW-SQUALLS"             "ICY ROADS"                     
## [225] "HEAVY MIX"                      "SNOW FREEZING RAIN"            
## [227] "SNOW/SLEET"                     "SNOW/FREEZING RAIN"            
## [229] "SNOW SQUALLS"                   "SNOW/SLEET/FREEZING RAIN"      
## [231] "RECORD SNOW"                    "HAIL 0.75"                     
## [233] "RECORD HEAT"                    "THUNDERSTORM WIND 65MPH"       
## [235] "THUNDERSTORM WIND/ TREES"       "THUNDERSTORM WIND/AWNING"      
## [237] "THUNDERSTORM WIND 98 MPH"       "THUNDERSTORM WIND TREES"       
## [239] "TORNADO F2"                     "RIP CURRENTS"                  
## [241] "HURRICANE EMILY"                "COASTAL SURGE"                 
## [243] "HURRICANE GORDON"               "HURRICANE FELIX"               
## [245] "THUNDERSTORM WIND 60 MPH"       "THUNDERSTORM WINDS 63 MPH"     
## [247] "THUNDERSTORM WIND/ TREE"        "THUNDERSTORM DAMAGE TO"        
## [249] "THUNDERSTORM WIND 65 MPH"       "FLASH FLOOD - HEAVY RAIN"      
## [251] "THUNDERSTORM WIND."             "FLASH FLOOD/ STREET"           
## [253] "BLOWING SNOW"                   "HEAVY SNOW/BLIZZARD"           
## [255] "THUNDERSTORM HAIL"              "THUNDERSTORM WINDSHAIL"        
## [257] "LIGHTNING  WAUSEON"             "THUDERSTORM WINDS"             
## [259] "ICE AND SNOW"                   "STORM FORCE WINDS"             
## [261] "HEAVY SNOW/ICE"                 "LIGHTING"                      
## [263] "HIGH WIND/HEAVY SNOW"           "THUNDERSTORM WINDS AND"        
## [265] "HEAVY PRECIPITATION"            "HIGH WIND/BLIZZARD"            
## [267] "TSTM WIND DAMAGE"               "FLOOD FLASH"                   
## [269] "RAIN/WIND"                      "SNOW/ICE"                      
## [271] "HAIL 75"                        "HEAT WAVE DROUGHT"             
## [273] "HEAVY SNOW/BLIZZARD/AVALANCHE"  "HEAT WAVES"                    
## [275] "UNSEASONABLY WARM AND DRY"      "UNSEASONABLY COLD"             
## [277] "RECORD/EXCESSIVE HEAT"          "THUNDERSTORM WIND G52"         
## [279] "HIGH WAVES"                     "FLASH FLOOD/FLOOD"             
## [281] "FLOOD/FLASH"                    "LOW TEMPERATURE"               
## [283] "HEAVY RAINS/FLOODING"           "THUNDERESTORM WINDS"           
## [285] "THUNDERSTORM WINDS/FLOODING"    "HYPOTHERMIA"                   
## [287] "THUNDEERSTORM WINDS"            "THUNERSTORM WINDS"             
## [289] "HIGH WINDS/COLD"                "COLD/WINDS"                    
## [291] "SNOW/ BITTER COLD"              "COLD WEATHER"                  
## [293] "RAPIDLY RISING WATER"           "WILD/FOREST FIRE"              
## [295] "ICE/STRONG WINDS"               "SNOW/HIGH WINDS"               
## [297] "HIGH WINDS/SNOW"                "SNOWMELT FLOODING"             
## [299] "HEAVY SNOW AND STRONG WINDS"    "SNOW ACCUMULATION"             
## [301] "SNOW/ ICE"                      "SNOW/BLOWING SNOW"             
## [303] "TORNADOES"                      "THUNDERSTORM WIND/HAIL"        
## [305] "FREEZING DRIZZLE"               "HAIL 175"                      
## [307] "FLASH FLOODING/FLOOD"           "HAIL 275"                      
## [309] "HAIL 450"                       "EXCESSIVE RAINFALL"            
## [311] "THUNDERSTORMW"                  "HAILSTORM"                     
## [313] "TSTM WINDS"                     "TSTMW"                         
## [315] "TSTM WIND 65)"                  "TROPICAL STORM DEAN"           
## [317] "THUNDERSTORM WINDS/ FLOOD"      "LANDSLIDE"                     
## [319] "HIGH WIND AND SEAS"             "THUNDERSTORMWINDS"             
## [321] "WILD/FOREST FIRES"              "HEAVY SEAS"                    
## [323] "HAIL DAMAGE"                    "FLOOD & HEAVY RAIN"            
## [325] "?"                              "THUNDERSTROM WIND"             
## [327] "FLOOD/FLASHFLOOD"               "HIGH WATER"                    
## [329] "HIGH WIND 48"                   "LANDSLIDES"                    
## [331] "URBAN/SMALL STREAM"             "BRUSH FIRE"                    
## [333] "HEAVY SHOWER"                   "HEAVY SWELLS"                  
## [335] "URBAN SMALL"                    "URBAN FLOODS"                  
## [337] "FLASH FLOOD/LANDSLIDE"          "HEAVY RAIN/SMALL STREAM URBAN" 
## [339] "FLASH FLOOD LANDSLIDES"         "TSTM WIND/HAIL"                
## [341] "OTHER"                          "ICE JAM FLOOD (MINOR"          
## [343] "URBAN/SML STREAM FLD"           "ROUGH SURF"                    
## [345] "MARINE ACCIDENT"                "COASTAL STORM"                 
## [347] "EROSION/CSTL FLOOD"             "BEACH EROSION"                 
## [349] "HEAVY RAIN/HIGH SURF"           "UNSEASONABLE COLD"             
## [351] "EARLY FROST"                    "WINTRY MIX"                    
## [353] "TORRENTIAL RAINFALL"            "LANDSLUMP"                     
## [355] "HURRICANE EDOUARD"              "TIDAL FLOODING"                
## [357] "EXTREME WINDCHILL"              "EXTENDED COLD"                 
## [359] "WHIRLWIND"                      "HEAVY SNOW SHOWER"             
## [361] "LIGHT SNOW"                     "MIXED PRECIP"                  
## [363] "FREEZING SPRAY"                 "DOWNBURST"                     
## [365] "LIGHT SNOWFALL"                 "GUSTY WIND/RAIN"               
## [367] "GUSTY WIND/HVY RAIN"            "COLD TEMPERATURE"              
## [369] "COLD AND SNOW"                  "RAIN/SNOW"                     
## [371] "TSTM WIND (G45)"                "GUSTY WIND"                    
## [373] "TSTM WIND 40"                   "TSTM WIND 45"                  
## [375] "TSTM WIND (41)"                 "TSTM WIND (G40)"               
## [377] "FROST/FREEZE"                   "AGRICULTURAL FREEZE"           
## [379] "HYPOTHERMIA/EXPOSURE"           "LAKE EFFECT SNOW"              
## [381] "MIXED PRECIPITATION"            "BLACK ICE"                     
## [383] "COASTALSTORM"                   "DAM BREAK"                     
## [385] "GRADIENT WIND"                  "TSTM WIND AND LIGHTNING"       
## [387] "WET MICROBURST"                 "HEAVY SURF AND WIND"           
## [389] "TYPHOON"                        "HIGH SWELLS"                   
## [391] "SMALL HAIL"                     "UNSEASONAL RAIN"               
## [393] "COASTAL FLOODING/EROSION"       "TSTM WIND  (G45)"              
## [395] "HIGH WIND (G40)"                "TSTM WIND (G35)"               
## [397] "COASTAL EROSION"                "SEICHE"                        
## [399] "COASTAL  FLOODING/EROSION"      "HYPERTHERMIA/EXPOSURE"         
## [401] "ROCK SLIDE"                     "GUSTY WIND/HAIL"               
## [403] "LANDSPOUT"                      "EXCESSIVE SNOW"                
## [405] "FLOOD/FLASH/FLOOD"              "WIND AND WAVE"                 
## [407] "LIGHT FREEZING RAIN"            "ICE ROADS"                     
## [409] "ROUGH SEAS"                     "TSTM WIND G45"                 
## [411] "NON-SEVERE WIND DAMAGE"         "WARM WEATHER"                  
## [413] "THUNDERSTORM WIND (G40)"        "LATE SEASON SNOW"              
## [415] "WINTER WEATHER MIX"             "ROGUE WAVE"                    
## [417] "FALLING SNOW/ICE"               "NON-TSTM WIND"                 
## [419] "NON TSTM WIND"                  "BLOWING DUST"                  
## [421] "VOLCANIC ASH"                   "HIGH SURF ADVISORY"            
## [423] "HAZARDOUS SURF"                 "ICE ON ROAD"                   
## [425] "DROWNING"                       "EXTREME COLD/WIND CHILL"       
## [427] "MARINE TSTM WIND"               "HURRICANE/TYPHOON"             
## [429] "WINTER WEATHER/MIX"             "ASTRONOMICAL HIGH TIDE"        
## [431] "HEAVY SURF/HIGH SURF"           "TROPICAL DEPRESSION"           
## [433] "LAKE-EFFECT SNOW"               "MARINE HIGH WIND"              
## [435] "TSUNAMI"                        "STORM SURGE/TIDE"              
## [437] "COLD/WIND CHILL"                "LAKESHORE FLOOD"               
## [439] "MARINE THUNDERSTORM WIND"       "MARINE STRONG WIND"            
## [441] "ASTRONOMICAL LOW TIDE"          "DENSE SMOKE"                   
## [443] "MARINE HAIL"                    "FREEZING FOG"

Reduced to 444 by stripping white spaces and changing the case. Create a new column for checking classifications that are and are not in the list.

StormDataSubset$EventClassified <- ifelse(StormDataSubset$Event %in% 
                                        EventNames, "Yes", "No")

All the events that now are not in the category list have to be inspected and categorised. The reclassification stems from the data base input change over the years, as described in detail here:. From the results we see that these records were captured between 5/16/1968 (16th May 1968) and 4/19/1996 (19th April 1996). The descriptions provided in state that: * Between 1950 to 1954 only Tornado events were recorded * Between 1955 and 1992 only Tornado, Thunderstorm Wind and Hail were recorded and from 1993 and 1995 only these same events were extracted from text files. * After 1996, the 48 different categories of weather events were used.

Using this information, we can classify all the unclassified events accordingly as * Between 1950 and 1954 all events are Tornados * Between 1955 and 1995, events are either Tornados or Thunderstorm Wind and Hail * After 1995, we should have all the events categorised correctly. If there are discrepancies, they might be typos.

*Subsetting the data into the three groups:#

StormDataPre55 <- StormDataSubset[StormDataSubset$Start_Date < "1955-01-01", ]
StormDataBtw55And93 <- StormDataSubset[StormDataSubset$Start_Date > "1955-01-01" & StormDataSubset$Start_Date < "1993-01-01",]
StormDataPost93 <- StormDataSubset[StormDataSubset$Start_Date > "1993-01-01",]

Looking at the unique Unclassified events in these three groups: #Pre 1955:

unique(StormDataPre55$EVTYPE)
## [1] "TORNADO"
unique(StormDataPre55$Event)
## [1] "TORNADO"
unique(StormDataPre55$EventClassified)
## [1] "Yes"

The only storm classification we have is Tornado, which is correct

Between 1955 and 1993:

unique(StormDataBtw55And93$EVTYPE)
## [1] "TORNADO"   "TSTM WIND" "HAIL"
unique(StormDataBtw55And93$Event)
## [1] "TORNADO"   "TSTM WIND" "HAIL"
unique(StormDataBtw55And93$EventClassified)
## [1] "Yes" "No"

Here we have three categories: Tornado, TSTM WIND and “HAIL” in EVTYPE and Event. In EventClassified we have No because of the TSTM WIND, because TSTM WIND does not match with the EventNames in the documentation Renaming these events according to the required nomenclature in the documentation:

StormDataBtw55And93[StormDataBtw55And93$Event == "TSTM WIND", "Event"] <- "THUNDERSTORM WIND"

Post 1993:

StormDataPost93$EventClassified <- ifelse(StormDataPost93$Event %in% 
                                                      EventNames, "Yes", "No")
unique(StormDataPost93$EVTYPE)
##   [1] "ICE STORM/FLASH FLOOD"          "WINTER STORM"                  
##   [3] "HURRICANE OPAL/HIGH WINDS"      "THUNDERSTORM WINDS"            
##   [5] "TORNADO"                        "HURRICANE ERIN"                
##   [7] "HURRICANE OPAL"                 "HEAVY RAIN"                    
##   [9] "LIGHTNING"                      "THUNDERSTORM WIND"             
##  [11] "DENSE FOG"                      "HAIL"                          
##  [13] "RIP CURRENT"                    "THUNDERSTORM WINS"             
##  [15] "FLASH FLOODING"                 "FLASH FLOOD"                   
##  [17] "TORNADO F0"                     "THUNDERSTORM WINDS LIGHTNING"  
##  [19] "THUNDERSTORM WINDS/HAIL"        "HEAT"                          
##  [21] "HIGH WINDS"                     "WIND"                          
##  [23] "HEAVY RAINS"                    "LIGHTNING AND HEAVY RAIN"      
##  [25] "THUNDERSTORM WINDS HAIL"        "COLD"                          
##  [27] "HEAVY RAIN/LIGHTNING"           "FLASH FLOODING/THUNDERSTORM WI"
##  [29] "FLOODING"                       "WATERSPOUT"                    
##  [31] "EXTREME COLD"                   "LIGHTNING/HEAVY RAIN"          
##  [33] "BREAKUP FLOODING"               "HIGH WIND"                     
##  [35] "FREEZE"                         "RIVER FLOOD"                   
##  [37] "HIGH WINDS HEAVY RAINS"         "AVALANCHE"                     
##  [39] "MARINE MISHAP"                  "HIGH TIDES"                    
##  [41] "HIGH WIND/SEAS"                 "HIGH WINDS/HEAVY RAIN"         
##  [43] "HIGH SEAS"                      "COASTAL FLOOD"                 
##  [45] "SEVERE TURBULENCE"              "RECORD RAINFALL"               
##  [47] "HEAVY SNOW"                     "HEAVY SNOW/WIND"               
##  [49] "DUST STORM"                     "FLOOD"                         
##  [51] "APACHE COUNTY"                  "SLEET"                         
##  [53] "DUST DEVIL"                     "ICE STORM"                     
##  [55] "TSTM WIND"                      "EXCESSIVE HEAT"                
##  [57] "THUNDERSTORM WINDS/FUNNEL CLOU" "GUSTY WINDS"                   
##  [59] "FLOODING/HEAVY RAIN"            "HEAVY SURF COASTAL FLOODING"   
##  [61] "HIGH SURF"                      "WILD FIRES"                    
##  [63] "HIGH"                           "WINTER STORM HIGH WINDS"       
##  [65] "WINTER STORMS"                  "MUDSLIDES"                     
##  [67] "RAINSTORM"                      "SEVERE THUNDERSTORM"           
##  [69] "SEVERE THUNDERSTORMS"           "SEVERE THUNDERSTORM WINDS"     
##  [71] "THUNDERSTORMS WINDS"            "FLOOD/FLASH FLOOD"             
##  [73] "FLOOD/RAIN/WINDS"               "THUNDERSTORMS"                 
##  [75] "FLASH FLOOD WINDS"              "WINDS"                         
##  [77] "FUNNEL CLOUD"                   "HIGH WIND DAMAGE"              
##  [79] "STRONG WIND"                    "HEAVY SNOWPACK"                
##  [81] "FLASH FLOOD/"                   "HEAVY SURF"                    
##  [83] "DRY MIRCOBURST WINDS"           "DRY MICROBURST"                
##  [85] "URBAN FLOOD"                    "THUNDERSTORM WINDSS"           
##  [87] "MICROBURST WINDS"               "HEAT WAVE"                     
##  [89] "UNSEASONABLY WARM"              "COASTAL FLOODING"              
##  [91] "STRONG WINDS"                   "BLIZZARD"                      
##  [93] "WATERSPOUT/TORNADO"             "WATERSPOUT TORNADO"            
##  [95] "STORM SURGE"                    "URBAN/SMALL STREAM FLOOD"      
##  [97] "WATERSPOUT-"                    "TORNADOES, TSTM WIND, HAIL"    
##  [99] "TROPICAL STORM ALBERTO"         "TROPICAL STORM"                
## [101] "TROPICAL STORM GORDON"          "TROPICAL STORM JERRY"          
## [103] "LIGHTNING THUNDERSTORM WINDS"   "URBAN FLOODING"                
## [105] "MINOR FLOODING"                 "WATERSPOUT-TORNADO"            
## [107] "LIGHTNING INJURY"               "LIGHTNING AND THUNDERSTORM WIN"
## [109] "FLASH FLOODS"                   "THUNDERSTORM WINDS53"          
## [111] "WILDFIRE"                       "DAMAGING FREEZE"               
## [113] "THUNDERSTORM WINDS 13"          "HURRICANE"                     
## [115] "SNOW"                           "LIGNTNING"                     
## [117] "FROST"                          "FREEZING RAIN/SNOW"            
## [119] "HIGH WINDS/"                    "THUNDERSNOW"                   
## [121] "FLOODS"                         "COOL AND WET"                  
## [123] "HEAVY RAIN/SNOW"                "GLAZE ICE"                     
## [125] "MUD SLIDE"                      "HIGH  WINDS"                   
## [127] "RURAL FLOOD"                    "MUD SLIDES"                    
## [129] "EXTREME HEAT"                   "DROUGHT"                       
## [131] "COLD AND WET CONDITIONS"        "EXCESSIVE WETNESS"             
## [133] "SLEET/ICE STORM"                "GUSTNADO"                      
## [135] "FREEZING RAIN"                  "SNOW AND HEAVY SNOW"           
## [137] "GROUND BLIZZARD"                "EXTREME WIND CHILL"            
## [139] "MAJOR FLOOD"                    "SNOW/HEAVY SNOW"               
## [141] "FREEZING RAIN/SLEET"            "ICE JAM FLOODING"              
## [143] "COLD AIR TORNADO"               "WIND DAMAGE"                   
## [145] "FOG"                            "TSTM WIND 55"                  
## [147] "SMALL STREAM FLOOD"             "THUNDERTORM WINDS"             
## [149] "HAIL/WINDS"                     "SNOW AND ICE"                  
## [151] "WIND STORM"                     "GRASS FIRES"                   
## [153] "LAKE FLOOD"                     "HAIL/WIND"                     
## [155] "WIND/HAIL"                      "ICE"                           
## [157] "SNOW AND ICE STORM"             "THUNDERSTORM  WINDS"           
## [159] "WINTER WEATHER"                 "DROUGHT/EXCESSIVE HEAT"        
## [161] "THUNDERSTORMS WIND"             "TUNDERSTORM WIND"              
## [163] "URBAN AND SMALL STREAM FLOODIN" "THUNDERSTORM WIND/LIGHTNING"   
## [165] "HEAVY RAIN/SEVERE WEATHER"      "THUNDERSTORM"                  
## [167] "WATERSPOUT/ TORNADO"            "LIGHTNING."                    
## [169] "HURRICANE-GENERATED SWELLS"     "RIVER AND STREAM FLOOD"        
## [171] "HIGH WINDS/COASTAL FLOOD"       "RAIN"                          
## [173] "RIVER FLOODING"                 "ICE FLOES"                     
## [175] "THUNDERSTORM WIND G50"          "LIGHTNING FIRE"                
## [177] "HEAVY LAKE SNOW"                "RECORD COLD"                   
## [179] "HEAVY SNOW/FREEZING RAIN"       "COLD WAVE"                     
## [181] "DUST DEVIL WATERSPOUT"          "TORNADO F3"                    
## [183] "TORNDAO"                        "FLOOD/RIVER FLOOD"             
## [185] "MUD SLIDES URBAN FLOODING"      "TORNADO F1"                    
## [187] "GLAZE/ICE STORM"                "GLAZE"                         
## [189] "HEAVY SNOW/WINTER STORM"        "MICROBURST"                    
## [191] "AVALANCE"                       "BLIZZARD/WINTER STORM"         
## [193] "DUST STORM/HIGH WINDS"          "ICE JAM"                       
## [195] "FOREST FIRES"                   "FROST\\FREEZE"                 
## [197] "THUNDERSTORM WINDS."            "HVY RAIN"                      
## [199] "HAIL 150"                       "HAIL 075"                      
## [201] "HAIL 100"                       "THUNDERSTORM WIND G55"         
## [203] "HAIL 125"                       "THUNDERSTORM WIND G60"         
## [205] "THUNDERSTORM WINDS G60"         "HARD FREEZE"                   
## [207] "HAIL 200"                       "HEAVY SNOW AND HIGH WINDS"     
## [209] "HEAVY SNOW/HIGH WINDS & FLOOD"  "HEAVY RAIN AND FLOOD"          
## [211] "RIP CURRENTS/HEAVY SURF"        "URBAN AND SMALL"               
## [213] "WILDFIRES"                      "FOG AND COLD TEMPERATURES"     
## [215] "SNOW/COLD"                      "FLASH FLOOD FROM ICE JAMS"     
## [217] "TSTM WIND G58"                  "MUDSLIDE"                      
## [219] "HEAVY SNOW SQUALLS"             "SNOW SQUALL"                   
## [221] "SNOW/ICE STORM"                 "HEAVY SNOW/SQUALLS"            
## [223] "HEAVY SNOW-SQUALLS"             "ICY ROADS"                     
## [225] "HEAVY MIX"                      "SNOW FREEZING RAIN"            
## [227] "SNOW/SLEET"                     "SNOW/FREEZING RAIN"            
## [229] "SNOW SQUALLS"                   "SNOW/SLEET/FREEZING RAIN"      
## [231] "RECORD SNOW"                    "HAIL 0.75"                     
## [233] "RECORD HEAT"                    "THUNDERSTORM WIND 65MPH"       
## [235] "THUNDERSTORM WIND/ TREES"       "THUNDERSTORM WIND/AWNING"      
## [237] "THUNDERSTORM WIND 98 MPH"       "THUNDERSTORM WIND TREES"       
## [239] "TORNADO F2"                     "RIP CURRENTS"                  
## [241] "HURRICANE EMILY"                "COASTAL SURGE"                 
## [243] "HURRICANE GORDON"               "HURRICANE FELIX"               
## [245] "THUNDERSTORM WIND 60 MPH"       "THUNDERSTORM WINDS 63 MPH"     
## [247] "THUNDERSTORM WIND/ TREE"        "THUNDERSTORM DAMAGE TO"        
## [249] "THUNDERSTORM WIND 65 MPH"       "FLASH FLOOD - HEAVY RAIN"      
## [251] "THUNDERSTORM WIND."             "FLASH FLOOD/ STREET"           
## [253] "BLOWING SNOW"                   "HEAVY SNOW/BLIZZARD"           
## [255] "THUNDERSTORM HAIL"              "THUNDERSTORM WINDSHAIL"        
## [257] "LIGHTNING  WAUSEON"             "THUDERSTORM WINDS"             
## [259] "ICE AND SNOW"                   "STORM FORCE WINDS"             
## [261] "HEAVY SNOW/ICE"                 "LIGHTING"                      
## [263] "HIGH WIND/HEAVY SNOW"           "THUNDERSTORM WINDS AND"        
## [265] "HEAVY PRECIPITATION"            "HIGH WIND/BLIZZARD"            
## [267] "TSTM WIND DAMAGE"               "FLOOD FLASH"                   
## [269] "RAIN/WIND"                      "SNOW/ICE"                      
## [271] "HAIL 75"                        "HEAT WAVE DROUGHT"             
## [273] "HEAVY SNOW/BLIZZARD/AVALANCHE"  "HEAT WAVES"                    
## [275] "UNSEASONABLY WARM AND DRY"      "UNSEASONABLY COLD"             
## [277] "RECORD/EXCESSIVE HEAT"          "THUNDERSTORM WIND G52"         
## [279] "HIGH WAVES"                     "FLASH FLOOD/FLOOD"             
## [281] "FLOOD/FLASH"                    "LOW TEMPERATURE"               
## [283] "HEAVY RAINS/FLOODING"           "THUNDERESTORM WINDS"           
## [285] "THUNDERSTORM WINDS/FLOODING"    "HYPOTHERMIA"                   
## [287] "THUNDEERSTORM WINDS"            "THUNERSTORM WINDS"             
## [289] "HIGH WINDS/COLD"                "COLD/WINDS"                    
## [291] "SNOW/ BITTER COLD"              "COLD WEATHER"                  
## [293] "RAPIDLY RISING WATER"           "WILD/FOREST FIRE"              
## [295] "ICE/STRONG WINDS"               "SNOW/HIGH WINDS"               
## [297] "HIGH WINDS/SNOW"                "SNOWMELT FLOODING"             
## [299] "HEAVY SNOW AND STRONG WINDS"    "SNOW ACCUMULATION"             
## [301] "SNOW/ ICE"                      "SNOW/BLOWING SNOW"             
## [303] "TORNADOES"                      "THUNDERSTORM WIND/HAIL"        
## [305] "FREEZING DRIZZLE"               "HAIL 175"                      
## [307] "FLASH FLOODING/FLOOD"           "HAIL 275"                      
## [309] "HAIL 450"                       "EXCESSIVE RAINFALL"            
## [311] "THUNDERSTORMW"                  "HAILSTORM"                     
## [313] "TSTM WINDS"                     "TSTMW"                         
## [315] "TSTM WIND 65)"                  "TROPICAL STORM DEAN"           
## [317] "THUNDERSTORM WINDS/ FLOOD"      "LANDSLIDE"                     
## [319] "HIGH WIND AND SEAS"             "THUNDERSTORMWINDS"             
## [321] "WILD/FOREST FIRES"              "HEAVY SEAS"                    
## [323] "HAIL DAMAGE"                    "FLOOD & HEAVY RAIN"            
## [325] "?"                              "THUNDERSTROM WIND"             
## [327] "FLOOD/FLASHFLOOD"               "HIGH WATER"                    
## [329] "HIGH WIND 48"                   "LANDSLIDES"                    
## [331] "URBAN/SMALL STREAM"             "BRUSH FIRE"                    
## [333] "HEAVY SHOWER"                   "HEAVY SWELLS"                  
## [335] "URBAN SMALL"                    "URBAN FLOODS"                  
## [337] "FLASH FLOOD/LANDSLIDE"          "HEAVY RAIN/SMALL STREAM URBAN" 
## [339] "FLASH FLOOD LANDSLIDES"         "TSTM WIND/HAIL"                
## [341] "Other"                          "Ice jam flood (minor"          
## [343] "Tstm Wind"                      "URBAN/SML STREAM FLD"          
## [345] "ROUGH SURF"                     "Heavy Surf"                    
## [347] "Dust Devil"                     "Marine Accident"               
## [349] "Freeze"                         "Strong Wind"                   
## [351] "COASTAL STORM"                  "Erosion/Cstl Flood"            
## [353] "River Flooding"                 "Damaging Freeze"               
## [355] "Beach Erosion"                  "High Surf"                     
## [357] "Heavy Rain/High Surf"           "Unseasonable Cold"             
## [359] "Early Frost"                    "Wintry Mix"                    
## [361] "Extreme Cold"                   "Coastal Flooding"              
## [363] "Torrential Rainfall"            "Landslump"                     
## [365] "Hurricane Edouard"              "Coastal Storm"                 
## [367] "TIDAL FLOODING"                 "Tidal Flooding"                
## [369] "Strong Winds"                   "EXTREME WINDCHILL"             
## [371] "Glaze"                          "Extended Cold"                 
## [373] "Whirlwind"                      "Heavy snow shower"             
## [375] "Light snow"                     "Light Snow"                    
## [377] "MIXED PRECIP"                   "Freezing Spray"                
## [379] "DOWNBURST"                      "Mudslides"                     
## [381] "Microburst"                     "Mudslide"                      
## [383] "Cold"                           "Coastal Flood"                 
## [385] "Snow Squalls"                   "Wind Damage"                   
## [387] "Light Snowfall"                 "Freezing Drizzle"              
## [389] "Gusty wind/rain"                "GUSTY WIND/HVY RAIN"           
## [391] "Wind"                           "Cold Temperature"              
## [393] "Heat Wave"                      "Snow"                          
## [395] "COLD AND SNOW"                  "RAIN/SNOW"                     
## [397] "TSTM WIND (G45)"                "Gusty Winds"                   
## [399] "GUSTY WIND"                     "TSTM WIND 40"                  
## [401] "TSTM WIND 45"                   "TSTM WIND (41)"                
## [403] "TSTM WIND (G40)"                "Frost/Freeze"                  
## [405] "AGRICULTURAL FREEZE"            "OTHER"                         
## [407] "Hypothermia/Exposure"           "HYPOTHERMIA/EXPOSURE"          
## [409] "Lake Effect Snow"               "Freezing Rain"                 
## [411] "Mixed Precipitation"            "BLACK ICE"                     
## [413] "COASTALSTORM"                   "LIGHT SNOW"                    
## [415] "DAM BREAK"                      "Gusty winds"                   
## [417] "blowing snow"                   "GRADIENT WIND"                 
## [419] "TSTM WIND AND LIGHTNING"        "gradient wind"                 
## [421] "Gradient wind"                  "Freezing drizzle"              
## [423] "WET MICROBURST"                 "Heavy surf and wind"           
## [425] "TYPHOON"                        "HIGH SWELLS"                   
## [427] "SMALL HAIL"                     "UNSEASONAL RAIN"               
## [429] "COASTAL FLOODING/EROSION"       " TSTM WIND (G45)"              
## [431] "TSTM WIND  (G45)"               "HIGH WIND (G40)"               
## [433] "TSTM WIND (G35)"                "COASTAL EROSION"               
## [435] "SEICHE"                         "COASTAL  FLOODING/EROSION"     
## [437] "HYPERTHERMIA/EXPOSURE"          "WINTRY MIX"                    
## [439] "ROCK SLIDE"                     "GUSTY WIND/HAIL"               
## [441] " TSTM WIND"                     "LANDSPOUT"                     
## [443] "EXCESSIVE SNOW"                 "LAKE EFFECT SNOW"              
## [445] "FLOOD/FLASH/FLOOD"              "MIXED PRECIPITATION"           
## [447] "WIND AND WAVE"                  "LIGHT FREEZING RAIN"           
## [449] "ICE ROADS"                      "ROUGH SEAS"                    
## [451] "TSTM WIND G45"                  "NON-SEVERE WIND DAMAGE"        
## [453] "WARM WEATHER"                   "THUNDERSTORM WIND (G40)"       
## [455] " FLASH FLOOD"                   "LATE SEASON SNOW"              
## [457] "WINTER WEATHER MIX"             "ROGUE WAVE"                    
## [459] "FALLING SNOW/ICE"               "NON-TSTM WIND"                 
## [461] "NON TSTM WIND"                  "BLOWING DUST"                  
## [463] "VOLCANIC ASH"                   "   HIGH SURF ADVISORY"         
## [465] "HAZARDOUS SURF"                 "WHIRLWIND"                     
## [467] "ICE ON ROAD"                    "DROWNING"                      
## [469] "EXTREME COLD/WIND CHILL"        "MARINE TSTM WIND"              
## [471] "HURRICANE/TYPHOON"              "WINTER WEATHER/MIX"            
## [473] "FROST/FREEZE"                   "ASTRONOMICAL HIGH TIDE"        
## [475] "HEAVY SURF/HIGH SURF"           "TROPICAL DEPRESSION"           
## [477] "LAKE-EFFECT SNOW"               "MARINE HIGH WIND"              
## [479] "TSUNAMI"                        "STORM SURGE/TIDE"              
## [481] "COLD/WIND CHILL"                "LAKESHORE FLOOD"               
## [483] "MARINE THUNDERSTORM WIND"       "MARINE STRONG WIND"            
## [485] "ASTRONOMICAL LOW TIDE"          "DENSE SMOKE"                   
## [487] "MARINE HAIL"                    "FREEZING FOG"
unique(StormDataPost93$Event)
##   [1] "ICE STORM/FLASH FLOOD"          "WINTER STORM"                  
##   [3] "HURRICANE OPAL/HIGH WINDS"      "THUNDERSTORM WINDS"            
##   [5] "TORNADO"                        "HURRICANE ERIN"                
##   [7] "HURRICANE OPAL"                 "HEAVY RAIN"                    
##   [9] "LIGHTNING"                      "THUNDERSTORM WIND"             
##  [11] "DENSE FOG"                      "HAIL"                          
##  [13] "RIP CURRENT"                    "THUNDERSTORM WINS"             
##  [15] "FLASH FLOODING"                 "FLASH FLOOD"                   
##  [17] "TORNADO F0"                     "THUNDERSTORM WINDS LIGHTNING"  
##  [19] "THUNDERSTORM WINDS/HAIL"        "HEAT"                          
##  [21] "HIGH WINDS"                     "WIND"                          
##  [23] "HEAVY RAINS"                    "LIGHTNING AND HEAVY RAIN"      
##  [25] "THUNDERSTORM WINDS HAIL"        "COLD"                          
##  [27] "HEAVY RAIN/LIGHTNING"           "FLASH FLOODING/THUNDERSTORM WI"
##  [29] "FLOODING"                       "WATERSPOUT"                    
##  [31] "EXTREME COLD"                   "LIGHTNING/HEAVY RAIN"          
##  [33] "BREAKUP FLOODING"               "HIGH WIND"                     
##  [35] "FREEZE"                         "RIVER FLOOD"                   
##  [37] "HIGH WINDS HEAVY RAINS"         "AVALANCHE"                     
##  [39] "MARINE MISHAP"                  "HIGH TIDES"                    
##  [41] "HIGH WIND/SEAS"                 "HIGH WINDS/HEAVY RAIN"         
##  [43] "HIGH SEAS"                      "COASTAL FLOOD"                 
##  [45] "SEVERE TURBULENCE"              "RECORD RAINFALL"               
##  [47] "HEAVY SNOW"                     "HEAVY SNOW/WIND"               
##  [49] "DUST STORM"                     "FLOOD"                         
##  [51] "APACHE COUNTY"                  "SLEET"                         
##  [53] "DUST DEVIL"                     "ICE STORM"                     
##  [55] "TSTM WIND"                      "EXCESSIVE HEAT"                
##  [57] "THUNDERSTORM WINDS/FUNNEL CLOU" "GUSTY WINDS"                   
##  [59] "FLOODING/HEAVY RAIN"            "HEAVY SURF COASTAL FLOODING"   
##  [61] "HIGH SURF"                      "WILD FIRES"                    
##  [63] "HIGH"                           "WINTER STORM HIGH WINDS"       
##  [65] "WINTER STORMS"                  "MUDSLIDES"                     
##  [67] "RAINSTORM"                      "SEVERE THUNDERSTORM"           
##  [69] "SEVERE THUNDERSTORMS"           "SEVERE THUNDERSTORM WINDS"     
##  [71] "THUNDERSTORMS WINDS"            "FLOOD/FLASH FLOOD"             
##  [73] "FLOOD/RAIN/WINDS"               "THUNDERSTORMS"                 
##  [75] "FLASH FLOOD WINDS"              "WINDS"                         
##  [77] "FUNNEL CLOUD"                   "HIGH WIND DAMAGE"              
##  [79] "STRONG WIND"                    "HEAVY SNOWPACK"                
##  [81] "FLASH FLOOD/"                   "HEAVY SURF"                    
##  [83] "DRY MIRCOBURST WINDS"           "DRY MICROBURST"                
##  [85] "URBAN FLOOD"                    "THUNDERSTORM WINDSS"           
##  [87] "MICROBURST WINDS"               "HEAT WAVE"                     
##  [89] "UNSEASONABLY WARM"              "COASTAL FLOODING"              
##  [91] "STRONG WINDS"                   "BLIZZARD"                      
##  [93] "WATERSPOUT/TORNADO"             "WATERSPOUT TORNADO"            
##  [95] "STORM SURGE"                    "URBAN/SMALL STREAM FLOOD"      
##  [97] "WATERSPOUT-"                    "TORNADOES, TSTM WIND, HAIL"    
##  [99] "TROPICAL STORM ALBERTO"         "TROPICAL STORM"                
## [101] "TROPICAL STORM GORDON"          "TROPICAL STORM JERRY"          
## [103] "LIGHTNING THUNDERSTORM WINDS"   "URBAN FLOODING"                
## [105] "MINOR FLOODING"                 "WATERSPOUT-TORNADO"            
## [107] "LIGHTNING INJURY"               "LIGHTNING AND THUNDERSTORM WIN"
## [109] "FLASH FLOODS"                   "THUNDERSTORM WINDS53"          
## [111] "WILDFIRE"                       "DAMAGING FREEZE"               
## [113] "THUNDERSTORM WINDS 13"          "HURRICANE"                     
## [115] "SNOW"                           "LIGNTNING"                     
## [117] "FROST"                          "FREEZING RAIN/SNOW"            
## [119] "HIGH WINDS/"                    "THUNDERSNOW"                   
## [121] "FLOODS"                         "COOL AND WET"                  
## [123] "HEAVY RAIN/SNOW"                "GLAZE ICE"                     
## [125] "MUD SLIDE"                      "HIGH  WINDS"                   
## [127] "RURAL FLOOD"                    "MUD SLIDES"                    
## [129] "EXTREME HEAT"                   "DROUGHT"                       
## [131] "COLD AND WET CONDITIONS"        "EXCESSIVE WETNESS"             
## [133] "SLEET/ICE STORM"                "GUSTNADO"                      
## [135] "FREEZING RAIN"                  "SNOW AND HEAVY SNOW"           
## [137] "GROUND BLIZZARD"                "EXTREME WIND CHILL"            
## [139] "MAJOR FLOOD"                    "SNOW/HEAVY SNOW"               
## [141] "FREEZING RAIN/SLEET"            "ICE JAM FLOODING"              
## [143] "COLD AIR TORNADO"               "WIND DAMAGE"                   
## [145] "FOG"                            "TSTM WIND 55"                  
## [147] "SMALL STREAM FLOOD"             "THUNDERTORM WINDS"             
## [149] "HAIL/WINDS"                     "SNOW AND ICE"                  
## [151] "WIND STORM"                     "GRASS FIRES"                   
## [153] "LAKE FLOOD"                     "HAIL/WIND"                     
## [155] "WIND/HAIL"                      "ICE"                           
## [157] "SNOW AND ICE STORM"             "THUNDERSTORM  WINDS"           
## [159] "WINTER WEATHER"                 "DROUGHT/EXCESSIVE HEAT"        
## [161] "THUNDERSTORMS WIND"             "TUNDERSTORM WIND"              
## [163] "URBAN AND SMALL STREAM FLOODIN" "THUNDERSTORM WIND/LIGHTNING"   
## [165] "HEAVY RAIN/SEVERE WEATHER"      "THUNDERSTORM"                  
## [167] "WATERSPOUT/ TORNADO"            "LIGHTNING."                    
## [169] "HURRICANE-GENERATED SWELLS"     "RIVER AND STREAM FLOOD"        
## [171] "HIGH WINDS/COASTAL FLOOD"       "RAIN"                          
## [173] "RIVER FLOODING"                 "ICE FLOES"                     
## [175] "THUNDERSTORM WIND G50"          "LIGHTNING FIRE"                
## [177] "HEAVY LAKE SNOW"                "RECORD COLD"                   
## [179] "HEAVY SNOW/FREEZING RAIN"       "COLD WAVE"                     
## [181] "DUST DEVIL WATERSPOUT"          "TORNADO F3"                    
## [183] "TORNDAO"                        "FLOOD/RIVER FLOOD"             
## [185] "MUD SLIDES URBAN FLOODING"      "TORNADO F1"                    
## [187] "GLAZE/ICE STORM"                "GLAZE"                         
## [189] "HEAVY SNOW/WINTER STORM"        "MICROBURST"                    
## [191] "AVALANCE"                       "BLIZZARD/WINTER STORM"         
## [193] "DUST STORM/HIGH WINDS"          "ICE JAM"                       
## [195] "FOREST FIRES"                   "FROST\\FREEZE"                 
## [197] "THUNDERSTORM WINDS."            "HVY RAIN"                      
## [199] "HAIL 150"                       "HAIL 075"                      
## [201] "HAIL 100"                       "THUNDERSTORM WIND G55"         
## [203] "HAIL 125"                       "THUNDERSTORM WIND G60"         
## [205] "THUNDERSTORM WINDS G60"         "HARD FREEZE"                   
## [207] "HAIL 200"                       "HEAVY SNOW AND HIGH WINDS"     
## [209] "HEAVY SNOW/HIGH WINDS & FLOOD"  "HEAVY RAIN AND FLOOD"          
## [211] "RIP CURRENTS/HEAVY SURF"        "URBAN AND SMALL"               
## [213] "WILDFIRES"                      "FOG AND COLD TEMPERATURES"     
## [215] "SNOW/COLD"                      "FLASH FLOOD FROM ICE JAMS"     
## [217] "TSTM WIND G58"                  "MUDSLIDE"                      
## [219] "HEAVY SNOW SQUALLS"             "SNOW SQUALL"                   
## [221] "SNOW/ICE STORM"                 "HEAVY SNOW/SQUALLS"            
## [223] "HEAVY SNOW-SQUALLS"             "ICY ROADS"                     
## [225] "HEAVY MIX"                      "SNOW FREEZING RAIN"            
## [227] "SNOW/SLEET"                     "SNOW/FREEZING RAIN"            
## [229] "SNOW SQUALLS"                   "SNOW/SLEET/FREEZING RAIN"      
## [231] "RECORD SNOW"                    "HAIL 0.75"                     
## [233] "RECORD HEAT"                    "THUNDERSTORM WIND 65MPH"       
## [235] "THUNDERSTORM WIND/ TREES"       "THUNDERSTORM WIND/AWNING"      
## [237] "THUNDERSTORM WIND 98 MPH"       "THUNDERSTORM WIND TREES"       
## [239] "TORNADO F2"                     "RIP CURRENTS"                  
## [241] "HURRICANE EMILY"                "COASTAL SURGE"                 
## [243] "HURRICANE GORDON"               "HURRICANE FELIX"               
## [245] "THUNDERSTORM WIND 60 MPH"       "THUNDERSTORM WINDS 63 MPH"     
## [247] "THUNDERSTORM WIND/ TREE"        "THUNDERSTORM DAMAGE TO"        
## [249] "THUNDERSTORM WIND 65 MPH"       "FLASH FLOOD - HEAVY RAIN"      
## [251] "THUNDERSTORM WIND."             "FLASH FLOOD/ STREET"           
## [253] "BLOWING SNOW"                   "HEAVY SNOW/BLIZZARD"           
## [255] "THUNDERSTORM HAIL"              "THUNDERSTORM WINDSHAIL"        
## [257] "LIGHTNING  WAUSEON"             "THUDERSTORM WINDS"             
## [259] "ICE AND SNOW"                   "STORM FORCE WINDS"             
## [261] "HEAVY SNOW/ICE"                 "LIGHTING"                      
## [263] "HIGH WIND/HEAVY SNOW"           "THUNDERSTORM WINDS AND"        
## [265] "HEAVY PRECIPITATION"            "HIGH WIND/BLIZZARD"            
## [267] "TSTM WIND DAMAGE"               "FLOOD FLASH"                   
## [269] "RAIN/WIND"                      "SNOW/ICE"                      
## [271] "HAIL 75"                        "HEAT WAVE DROUGHT"             
## [273] "HEAVY SNOW/BLIZZARD/AVALANCHE"  "HEAT WAVES"                    
## [275] "UNSEASONABLY WARM AND DRY"      "UNSEASONABLY COLD"             
## [277] "RECORD/EXCESSIVE HEAT"          "THUNDERSTORM WIND G52"         
## [279] "HIGH WAVES"                     "FLASH FLOOD/FLOOD"             
## [281] "FLOOD/FLASH"                    "LOW TEMPERATURE"               
## [283] "HEAVY RAINS/FLOODING"           "THUNDERESTORM WINDS"           
## [285] "THUNDERSTORM WINDS/FLOODING"    "HYPOTHERMIA"                   
## [287] "THUNDEERSTORM WINDS"            "THUNERSTORM WINDS"             
## [289] "HIGH WINDS/COLD"                "COLD/WINDS"                    
## [291] "SNOW/ BITTER COLD"              "COLD WEATHER"                  
## [293] "RAPIDLY RISING WATER"           "WILD/FOREST FIRE"              
## [295] "ICE/STRONG WINDS"               "SNOW/HIGH WINDS"               
## [297] "HIGH WINDS/SNOW"                "SNOWMELT FLOODING"             
## [299] "HEAVY SNOW AND STRONG WINDS"    "SNOW ACCUMULATION"             
## [301] "SNOW/ ICE"                      "SNOW/BLOWING SNOW"             
## [303] "TORNADOES"                      "THUNDERSTORM WIND/HAIL"        
## [305] "FREEZING DRIZZLE"               "HAIL 175"                      
## [307] "FLASH FLOODING/FLOOD"           "HAIL 275"                      
## [309] "HAIL 450"                       "EXCESSIVE RAINFALL"            
## [311] "THUNDERSTORMW"                  "HAILSTORM"                     
## [313] "TSTM WINDS"                     "TSTMW"                         
## [315] "TSTM WIND 65)"                  "TROPICAL STORM DEAN"           
## [317] "THUNDERSTORM WINDS/ FLOOD"      "LANDSLIDE"                     
## [319] "HIGH WIND AND SEAS"             "THUNDERSTORMWINDS"             
## [321] "WILD/FOREST FIRES"              "HEAVY SEAS"                    
## [323] "HAIL DAMAGE"                    "FLOOD & HEAVY RAIN"            
## [325] "?"                              "THUNDERSTROM WIND"             
## [327] "FLOOD/FLASHFLOOD"               "HIGH WATER"                    
## [329] "HIGH WIND 48"                   "LANDSLIDES"                    
## [331] "URBAN/SMALL STREAM"             "BRUSH FIRE"                    
## [333] "HEAVY SHOWER"                   "HEAVY SWELLS"                  
## [335] "URBAN SMALL"                    "URBAN FLOODS"                  
## [337] "FLASH FLOOD/LANDSLIDE"          "HEAVY RAIN/SMALL STREAM URBAN" 
## [339] "FLASH FLOOD LANDSLIDES"         "TSTM WIND/HAIL"                
## [341] "OTHER"                          "ICE JAM FLOOD (MINOR"          
## [343] "URBAN/SML STREAM FLD"           "ROUGH SURF"                    
## [345] "MARINE ACCIDENT"                "COASTAL STORM"                 
## [347] "EROSION/CSTL FLOOD"             "BEACH EROSION"                 
## [349] "HEAVY RAIN/HIGH SURF"           "UNSEASONABLE COLD"             
## [351] "EARLY FROST"                    "WINTRY MIX"                    
## [353] "TORRENTIAL RAINFALL"            "LANDSLUMP"                     
## [355] "HURRICANE EDOUARD"              "TIDAL FLOODING"                
## [357] "EXTREME WINDCHILL"              "EXTENDED COLD"                 
## [359] "WHIRLWIND"                      "HEAVY SNOW SHOWER"             
## [361] "LIGHT SNOW"                     "MIXED PRECIP"                  
## [363] "FREEZING SPRAY"                 "DOWNBURST"                     
## [365] "LIGHT SNOWFALL"                 "GUSTY WIND/RAIN"               
## [367] "GUSTY WIND/HVY RAIN"            "COLD TEMPERATURE"              
## [369] "COLD AND SNOW"                  "RAIN/SNOW"                     
## [371] "TSTM WIND (G45)"                "GUSTY WIND"                    
## [373] "TSTM WIND 40"                   "TSTM WIND 45"                  
## [375] "TSTM WIND (41)"                 "TSTM WIND (G40)"               
## [377] "FROST/FREEZE"                   "AGRICULTURAL FREEZE"           
## [379] "HYPOTHERMIA/EXPOSURE"           "LAKE EFFECT SNOW"              
## [381] "MIXED PRECIPITATION"            "BLACK ICE"                     
## [383] "COASTALSTORM"                   "DAM BREAK"                     
## [385] "GRADIENT WIND"                  "TSTM WIND AND LIGHTNING"       
## [387] "WET MICROBURST"                 "HEAVY SURF AND WIND"           
## [389] "TYPHOON"                        "HIGH SWELLS"                   
## [391] "SMALL HAIL"                     "UNSEASONAL RAIN"               
## [393] "COASTAL FLOODING/EROSION"       "TSTM WIND  (G45)"              
## [395] "HIGH WIND (G40)"                "TSTM WIND (G35)"               
## [397] "COASTAL EROSION"                "SEICHE"                        
## [399] "COASTAL  FLOODING/EROSION"      "HYPERTHERMIA/EXPOSURE"         
## [401] "ROCK SLIDE"                     "GUSTY WIND/HAIL"               
## [403] "LANDSPOUT"                      "EXCESSIVE SNOW"                
## [405] "FLOOD/FLASH/FLOOD"              "WIND AND WAVE"                 
## [407] "LIGHT FREEZING RAIN"            "ICE ROADS"                     
## [409] "ROUGH SEAS"                     "TSTM WIND G45"                 
## [411] "NON-SEVERE WIND DAMAGE"         "WARM WEATHER"                  
## [413] "THUNDERSTORM WIND (G40)"        "LATE SEASON SNOW"              
## [415] "WINTER WEATHER MIX"             "ROGUE WAVE"                    
## [417] "FALLING SNOW/ICE"               "NON-TSTM WIND"                 
## [419] "NON TSTM WIND"                  "BLOWING DUST"                  
## [421] "VOLCANIC ASH"                   "HIGH SURF ADVISORY"            
## [423] "HAZARDOUS SURF"                 "ICE ON ROAD"                   
## [425] "DROWNING"                       "EXTREME COLD/WIND CHILL"       
## [427] "MARINE TSTM WIND"               "HURRICANE/TYPHOON"             
## [429] "WINTER WEATHER/MIX"             "ASTRONOMICAL HIGH TIDE"        
## [431] "HEAVY SURF/HIGH SURF"           "TROPICAL DEPRESSION"           
## [433] "LAKE-EFFECT SNOW"               "MARINE HIGH WIND"              
## [435] "TSUNAMI"                        "STORM SURGE/TIDE"              
## [437] "COLD/WIND CHILL"                "LAKESHORE FLOOD"               
## [439] "MARINE THUNDERSTORM WIND"       "MARINE STRONG WIND"            
## [441] "ASTRONOMICAL LOW TIDE"          "DENSE SMOKE"                   
## [443] "MARINE HAIL"                    "FREEZING FOG"
unique(StormDataPost93$EventClassified)
## [1] "No"  "Yes"

This data needs extensive work as there are several entries in which the Event does not correspond with an event in the list.

DataNeedingWorkPost93 <- StormDataPost93[StormDataPost93$EventClassified == "No",]
DataNotNeedingWorkPost93 <- StormDataPost93[StormDataPost93$EventClassified == "Yes",]

The sum of the injuries and fatalities for these events is:

sum(DataNeedingWorkPost93$INJURIES) #returns 10507
## [1] 10507
sum(DataNeedingWorkPost93$FATALITIES) # returns 1824
## [1] 1824

Compare these with the total for the Data Set:

sum(StormDataSubset$INJURIES) # returns 140528
## [1] 140528
sum(StormDataSubset$FATALITIES) # returns 15145
## [1] 15145

The numbers for the events in the data set that are not explicitly listed by the NOOA are fairly large. Proportion of total sums:

HumanInjuriesProportion <- sum(DataNeedingWorkPost93$INJURIES)/sum(StormDataSubset$INJURIES)
HumanInjuriesProportion
## [1] 0.07476802

yields 0.0748 or roughly 7 %

HumanFatalitiesProportion <- sum(DataNeedingWorkPost93$FATALITIES)/sum(StormDataSubset$FATALITIES)
HumanFatalitiesProportion
## [1] 0.1204358

yields 0.12 or roughly 12 %

The sum of the property and cost damage for these events is:

sum(DataNeedingWorkPost93$PROPDMG)
## [1] 2174003

Returns 2174003

sum(DataNeedingWorkPost93$CROPDMG) 
## [1] 188933.9

Returns 188933.9

Compare these with the total for the Data Set:

sum(StormDataSubset$PROPDMG) # returns 10884500
## [1] 10884500
sum(StormDataSubset$CROPDMG) # returns 1377827
## [1] 1377827

The numbers for the events in the data set that are not explicitly listed by the NOOA are fairly large. Proportion of total sums:

PropertyDamageProportion <- sum(DataNeedingWorkPost93$PROPDMG)/sum(StormDataSubset$PROPDMG)
PropertyDamageProportion 
## [1] 0.1997339
CropDamageProportion <- sum(DataNeedingWorkPost93$CROPDMG)/sum(StormDataSubset$CROPDMG)
CropDamageProportion
## [1] 0.1371245

Since these values are large, property damage yields 0.1997 or roughly 20 % and crop damage yields 0.137 or roughly 14 %work is required to recategorise the data according to the list by the NOOA

Work on DataNeedingWorkPost93, then rbind these together and bind those together with DataNotNeedingWorkPost93. The following code is extensive, not ideal and is most likely not optimal!

Use the subsetted data frame: DataNeedingWorkPost93 and reference list EventNames, recategorising the events:

#Thunderstorm Wind1:
ThunderstormRowsToRename1 <- grep("THU", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[ThunderstormRowsToRename1, "Event"] <- "THUNDERSTORM WIND"
# Rechecking how many events we now have that are not classfied:

#Thunderstorm Wind2:
ThunderstormRowsToRename2 <- grep("TSTM", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[ThunderstormRowsToRename2, "Event"] <- "THUNDERSTORM WIND"

#Thunderstorm Wind3:
ThunderstormRowsToRename3 <- grep("TU", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[ThunderstormRowsToRename3, "Event"] <- "THUNDERSTORM WIND"

In addition to the conventional Thunderstorms, dry microbursts also originate from High thunderstorms and will be classified as such.

DryMicroburstsRowsToReplace <- grep("DRY", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[DryMicroburstsRowsToReplace, "Event"] <- "THUNDERSTORM WIND"

#Hail 1:
HailRowsToReplace <- grep("HAIL", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HailRowsToReplace, "Event"] <- "HAIL"

AstonomicalLowTideRowsToReplace <- grep("ASTRO", DataNeedingWorkPost93$Event, ignore.case = FALSE)
AstonomicalLowTideRowsToReplace #returns none
## [1] 59676 61954 64479 65086 69927 69929 69930 69936
# EventNames[2]
AvalancheRowsToReplace <- grep("AVA", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[AvalancheRowsToReplace, "Event"] <- "AVALANCHE"

# EventNames[3]
BlizzardRowsToReplace <- grep("BLIZ", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[BlizzardRowsToReplace, "Event"] <- "BLIZZARD"

# EventNames[4]
CoastalFloodRowsToReplace <- grep("COAST", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[CoastalFloodRowsToReplace, "Event"] <- EventNames[4]

# EventNames[6]
LandSlideRowsToReplace <- grep("LAND", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[LandSlideRowsToReplace, "Event"] <- EventNames[6]

# EventNames[7]
DenseFogRowsToReplace <- grep("FOG", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[DenseFogRowsToReplace, "Event"] <- EventNames[7]

# EventNames[8]
DenseSmokeRowsToReplace <- grep("SMOKE", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[DenseFogRowsToReplace, "Event"] <- EventNames[8]

# EventNames[9]
DroughtRowsToReplace <- grep("DROUGHT", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[DroughtRowsToReplace, "Event"] <- EventNames[9]

# EventNames[10]
DustDevilRowsToReplace <- grep("DEVIL", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[DustDevilRowsToReplace, "Event"] <- EventNames[10]

# EventNames[11]
DustStormRowsToReplace <- grep("DUST STORM|BLOWING DUST", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[DustStormRowsToReplace, "Event"] <- EventNames[11]

# EventNames[12]
HeatRowsToReplace <- grep("HEAT", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HeatRowsToReplace, "Event"] <- EventNames[12]

# EventNames[13]
ExtremeColdWindChillRowsToReplace <- grep("COLD", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[ExtremeColdWindChillRowsToReplace, "Event"] <- EventNames[13]

# EventNames[14]
FlashFloodRowsToReplace <- grep("FLASH FLOOD|FLOOD FLASH|MAJOR FLOOD", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[FlashFloodRowsToReplace, "Event"] <- EventNames[14]

# EventNames[15]
FloodRowsToReplace <- grep("FLOOD/|BREAKUP FLOODING|URBAN FLOODS|EROSION/CSTL FLOOD", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[FloodRowsToReplace, "Event"]<- EventNames[15]

# EventNames[16]
FrostRowsToReplace <- grep("FROST", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[FrostRowsToReplace, "Event"] <- EventNames[16]

# EventNames[17]
FunnelCloudRowsToReplace <- grep("FUNNEL", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[FunnelCloudRowsToReplace, "Event"] <- EventNames[17]

# EventNames[19]
HailRowsToReplace <- grep("HAIL", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HailRowsToReplace, "Event"] <- EventNames[19]

# EventNames[20]
HeatRowsToReplace <- grep("HEAT", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HeatRowsToReplace, "Event"] <- EventNames[20]

# EventNames[21]
HeavyRainRowsToReplace <- grep("HEAVY RAIN", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HeavyRainRowsToReplace, "Event"] <- EventNames[21]

# EventNames[22]
HeavySnowRowsToReplace <- grep("HEAVY SNOW", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HeavySnowRowsToReplace, "Event"] <- EventNames[22]

# EventNames[23]
HighSurfRowsToReplace <- grep("HIGH SURF", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HighSurfRowsToReplace, "Event"] <- EventNames[23]

# EventNames[24]
HighWindRowsToReplace <- grep("HIGH WIND", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HighWindRowsToReplace, "Event"] <- EventNames[24]

# EventNames[25]
HurricaneRowsToReplace <- grep("HURRICAN|TYPHOON", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[HurricaneRowsToReplace, "Event"] <- EventNames[25]

# EventNames[26]
IceStormRowsToReplace <- grep("ICE", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[IceStormRowsToReplace, "Event"] <- EventNames[26]

# EventNames[27]
LakeEffectSnowRowsToReplace <- grep("LAKE", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[LakeEffectSnowRowsToReplace, "Event"] <- EventNames[27]

# EventNames[29]
LightningRowsToReplace <- grep("LIGHTNING", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[LightningRowsToReplace, "Event"] <- EventNames[29]

# EventNames[34]
RipCurrentRowsToReplace <- grep("RIP", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[RipCurrentRowsToReplace, "Event"] <- EventNames[34]

# EventNames[36]
SleetRowsToReplace <- grep("SLEET", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[SleetRowsToReplace, "Event"] <- EventNames[36]

# EventNames[37]
StormSurgeRowsToReplace <- grep("SURGE", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[StormSurgeRowsToReplace, "Event"] <- EventNames[37]

# EventNames[38]
StrongWindRowsToReplace <- grep("STRONG", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[StrongWindRowsToReplace, "Event"] <- EventNames[38]

# EventNames[40]
TornadoRowsToReplace <- grep("TORNADO|TORNDAO", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[TornadoRowsToReplace, "Event"] <- EventNames[40]

# EventNames[42]
TropicalStormRowsToReplace <- grep("TROPICAL", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[TropicalStormRowsToReplace, "Event"] <- EventNames[42]

# EventNames[45]
WaterspoutRowsToReplace <- grep("WATERSPOUT", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[WaterspoutRowsToReplace, "Event"] <- EventNames[45]

# EventNames[46]
WildFireRowsToReplace <- grep("WILD|FIRE", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[WildFireRowsToReplace, "Event"] <- EventNames[46]

# EventNames[47]
WinterStormRowsToReplace <- grep("WINTER", DataNeedingWorkPost93$Event, ignore.case = FALSE)
DataNeedingWorkPost93[WinterStormRowsToReplace, "Event"] <- EventNames[48]

# Done with replacing post '93 Events
StormDataPost93[StormDataPost93$EventClassified == "No","Event"] <- DataNeedingWorkPost93$Event
StormDataPost93 <- rbind(DataNeedingWorkPost93, DataNotNeedingWorkPost93)

Recombining the data frames that were created from splitting the original subset of the data by date

StormDataProcessed <- rbind(StormDataPre55, StormDataBtw55And93, StormDataPost93)

Converting the Events variable to factor:

EventsFactors <- unique(StormDataProcessed$Event)
StormDataProcessed$EventsFactors <- factor(StormDataProcessed$Event, EventsFactors)

Completed preprocessing of data.

Results

Question 1: Human Health Effect of Extreme Weather Events

We need to find which types of extreme weather events are the most harmful to the population

TotalHarm <- aggregate(cbind(INJURIES, FATALITIES) ~ Event, data = StormDataProcessed, sum)
TotalHarm$Totals <- TotalHarm$INJURIES + TotalHarm$FATALITIES
TotalHarm$FatalVsInjured <- ((TotalHarm$FATALITIES)/(TotalHarm$INJURIES)*100)
TotalHarmOrdered <- TotalHarm[order(-TotalHarm$Totals),]
head(TotalHarmOrdered[,])
##                 Event INJURIES FATALITIES Totals FatalVsInjured
## 136           TORNADO    91407       5636  97043       6.165830
## 134 THUNDERSTORM WIND     9549        788  10337       8.252173
## 22     EXCESSIVE HEAT     6525       1903   8428      29.164751
## 30              FLOOD     6789        471   7260       6.937693
## 84          LIGHTNING     5231        817   6048      15.618429
## 51               HEAT     2684       1229   3913      45.789866
x1 <- TotalHarmOrdered$Event[1:10]
y1 <- TotalHarmOrdered$Totals[1:10]
barplot(y1, width = 1, names.arg = x1, xlab = "Weather Event", ylab = "Total Harm to Humans")

The results show that the weather event that has caused the most harm is Tornados.

Of the harm they have caused, certain weather events are more fatal than others: The weather events that cause death more than injuries are determined by:

head(TotalHarmOrdered[which(TotalHarmOrdered$FatalVsInjured > 100),])
##                       Event INJURIES FATALITIES Totals FatalVsInjured
## 111             RIP CURRENT      529        577   1106       109.0737
## 26  EXTREME COLD/WIND CHILL      307        353    660       114.9837
## 5                 AVALANCHE      171        225    396       131.5789
## 10          COLD/WIND CHILL       12         95    107       791.6667
## 28        EXTREME WINDCHILL        5         17     22       340.0000
## 9             COASTAL FLOOD        9         10     19       111.1111

While the actual number of fatalities may be small compared with the harm that tornados cause, it is still worthwhile being aware of the most deadly weather event types.

Question 2: Economic Cost of Extreme Weather Events

Sorting throught the PROPDMGEXP and CROPDMGEXP to see how many levels there are:

StormDataProcessed$PROPDMGEXP <- toupper(StormDataProcessed$PROPDMGEXP)
StormDataProcessed$CROPDMGEXP <- toupper(StormDataProcessed$CROPDMGEXP)

Assumption: all labels not as K, M or B we are going to set as 1. Further investigation into the remarks is needed to see if the labels given mean something.

Create a list of the allowed exponents for the property damage conversion, then create a list of the conversion factors for each exponent. So the same for the crop damage:

PropDmgExp <- c("H", "K", "M", "B")
PropDmgConvert <- c(100, 1000, 1000000, 1000000)
CropDmgExp <- c("H", "K", "M", "B")
CropDmgConvert <- c(100, 1000, 1000000, 1000000)

Set all values in the EXP columns that are not in the exponents list to 1, then create a column for the conversion factors

StormDataProcessed$PropDmgConversion <- 0
StormDataProcessed[!StormDataProcessed$PROPDMGEXP %in% PropDmgExp, "PropDmgConversion"] <- 1
StormDataProcessed$CropDmgConversion <- 0
StormDataProcessed[!StormDataProcessed$CROPDMGEXP %in% CropDmgExp, "CropDmgConversion"] <- 1

Now set the value of CropDmgConversion to the appropriate value for the exponent:

StormDataProcessed[StormDataProcessed$PROPDMGEXP == "H", "PropDmgConversion"] <- 100
StormDataProcessed[StormDataProcessed$CROPDMGEXP == "H", "CropDmgConversion"] <- 100
StormDataProcessed[StormDataProcessed$PROPDMGEXP == "K", "PropDmgConversion"] <- 1000
StormDataProcessed[StormDataProcessed$CROPDMGEXP == "K", "CropDmgConversion"] <- 1000
StormDataProcessed[StormDataProcessed$PROPDMGEXP == "M", "PropDmgConversion"] <- 1000000
StormDataProcessed[StormDataProcessed$CROPDMGEXP == "M", "CropDmgConversion"] <- 1000000
# billions (assuming 1 000 000 000 in a US billion)
StormDataProcessed[StormDataProcessed$PROPDMGEXP == "B", "PropDmgConversion"] <- 1000000000
StormDataProcessed[StormDataProcessed$CROPDMGEXP == "B", "CropDmgConversion"] <- 1000000000

We now have the appropriate conversion factor for each value

Create a new column that multiplies the damage amount given by the conversion:

StormDataEconomicCost <- StormDataProcessed
StormDataEconomicCostProperty <- mutate(StormDataEconomicCost, PropDmgCost = PROPDMG * PropDmgConversion)
StormDataEconomicCostCrop <- mutate(StormDataEconomicCost, CropDmgCost = CROPDMG * CropDmgConversion)

We now have two data frames, one with the converted amount for the property damage, the other for crop damage and simplify this back into a single data frame:

StormDataEconomicCost <- merge(StormDataEconomicCostProperty,StormDataEconomicCostCrop)

Add the Event type to the dataframe and aggregate to plot the summary out.

TotalCost <- aggregate(cbind(PropDmgCost, CropDmgCost) ~ Event, data = StormDataEconomicCost, sum)
TotalCost$TotalEconomicCost <- TotalCost$PropDmgCost + TotalCost$CropDmgCost
TotalCost$CropVsPropCost <- (TotalCost$CropDmgCost/TotalCost$PropDmgCost)
TotalCostOrdered <- TotalCost[order(-TotalCost$TotalEconomicCost),]
head(TotalCostOrdered[,])
##                 Event  PropDmgCost CropDmgCost TotalEconomicCost
## 30              FLOOD 152185361927 55089184450      207274546377
## 70  HURRICAN(TYPHOON)  85257070010  5506117800       90763187810
## 136           TORNADO  88934164359   432352870       89366517229
## 132  STORM SURGE/TIDE  48052699000      855000       48053554000
## 48               HAIL  20900962103  4278407323       25179369426
## 29        FLASH FLOOD  21435368261  1909227150       23344595411
##     CropVsPropCost
## 30    3.619874e-01
## 70    6.458254e-02
## 136   4.861494e-03
## 132   1.779297e-05
## 48    2.046991e-01
## 29    8.906902e-02
x2 <- TotalCostOrdered$Event[1:10]
y2 <- TotalCostOrdered$TotalEconomicCost[1:10]
barplot(y2, width = 1, names.arg = x2, ylab = "Total Economi Cost in USD", xlab = "Storm Weather Event")

The results show that by far, floods cause the most damage that needs to be repaired. Of the damage caused by floods, property damage is far worse than crop damage.