The first observation is primary breed on the Y axis for the most common licensed cats in Seattle it is the Domestic Shorthair with about 18.5% of this species which is located on the x axis.
The first observation the primary dog breed is the Retriever, Labrador with about 9.6% of this species located in Seattle.
The most common cat and dog breeds out of the licesned pets located in Seattle Washington between the years 2017 through 2018 are for cats.
It is the Domestic Shorthair at about 18.5% or about 1 out of 5 cats are the Domestic Shorthair of this species that are licensed in Seattle.
When looking at the different breeds of dogs, this chart is more spread out.
Out of all the species of pets here in Seattle there are more dogs with 35,181 and cats coming in second with 17,294.
There are also goats within the city with 38 species.
When looking at each individual species of dogs, the most common dog name within Seattle is Lucy with 337 and coming in second is Charlie 306.
There are about 1,900 Golden Retrivers within the city of Seattle.
## # A tibble: 6 x 11
## primary_breed animal_name n name_total breed_total percent_of_breed
## <chr> <chr> <dbl> <int> <int> <dbl>
## 1 Havanese Oscar 8 100 466 0.0172
## 2 Spaniel, Ame~ Milo 7 120 362 0.0193
## 3 Boxer Rocky 8 113 485 0.0165
## 4 Pug Zoe 8 117 554 0.0144
## 5 Beagle Lucy 19 337 542 0.0351
## 6 Retriever, L~ Scout 34 127 4867 0.00699
## # ... with 5 more variables: percent_overall <dbl>,
## # overrepresented_ratio <dbl>, hypergeom_p_value <dbl>,
## # holm_p_value <dbl>, fdr <dbl>
Looking at the table:
The first observation is a Havanese named Oscar with 100 of dogs named Oscar with a breed total of 466.
Looking at fifth observation within the table, the name Lucy for a primary breed of dog is a Beagle who is overrepresented with 19 at 0.0351 percent of breed.
Looking at the sixth observation, Scout being a Retriever, Labador with 34 of this breed with the common name at 0.00699 percent of breed.
The numbers are not that high therefore they needed to do more statistcal examination because there was not enough infromation from the primary breeds.
If there were 100 million dogs the overrepresented breed would have been obvious to detect.
The graph does not look uniform because of the low amount of breeds that was examined in Seattle.
The hypergeometric p-values scale is a discrete probability distribution that describes the probability of successes from the size. The test is often used to identify which sub-populations are over- or under-represented in a sample. This test has a wide range of applications.
The p_value scale shape represents the pets with hypergentic breeds by name.
As the breed total goes up, the hypergentic p-vlaue gets closer to 1.
There is no zeros represented in this data set.
Overall Beagels are most likely to be named Lucy within the city of Seattle with 0.958%