For this week’s assignment, I chose to work with data provided on the Kaggle website, an online platform for data scientists, students and novices to data exploration etc. My topic of focus was Heart Disease and the data collected was from the Cleveland database. Unfortunately in America, heart disease is the number one killer of both men and women, it kills about 630,000 Americans each year, this data according to the CDC please find article here. The Kaggle data for Heart Disease can be found here, it is important to note that this initial data set contained 76 variables, however, this data set only published 14 of them. For this week’s assignment the following variables were used in my analysis.
Dependent variable: Chest pain type Independent variables: Gender & Age
My research question is as follows:
Does age and gender bear any effect on the type of chest pains an individual experiences?
HYPOTHESIS
I believe that age and gender will have a statistically significant effect on the type of chest pains that an individual experiences. I believe that men will experience greater chest pain types typically due to poor diet, higer cholesterol and lack lack of nutritional meals. I believe that the higher the person’s age, the more susceptible they are of experiencing different chest pain types due to old age, dietary habits and preventative care.
library(readr)
library(dplyr)
library(tidyr)
library(knitr)
library(ggplot2)
library(ggthemes)
Har<-read_csv("/Users/safiesaf/Downloads/heart.csv")
HeartData<-Har%>%
rename("Gender"=sex,
"Chest_Pain_Type"=cp,
"Age"=age)%>%
mutate(Gender=ifelse(Gender==0,"Female",
ifelse(Gender==1,"Male",NA)),
Chest_Pain_Type=ifelse(Chest_Pain_Type==1,"Typical Angina",
ifelse(Chest_Pain_Type==2,"Atypical Angina",
ifelse(Chest_Pain_Type==3,"Non-anginal",
ifelse(Chest_Pain_Type==4,"Asymptomatic",NA)))),
Age=ifelse(Age==997,NA,Age))%>%
filter(!is.na(Gender=="Male"|
Gender=="Female"),
!is.na(Chest_Pain_Type=="Typical Angina"|
Chest_Pain_Type=="Atypical Angina"|
Chest_Pain_Type=="Non-anginal"|
Chest_Pain_Type=="Asymptomatic"),
(Age!=997))%>%
select(Gender,
Chest_Pain_Type,
Age)
kable(HeartData)
| Gender | Chest_Pain_Type | Age |
|---|---|---|
| Male | Non-anginal | 63 |
| Male | Atypical Angina | 37 |
| Female | Typical Angina | 41 |
| Male | Typical Angina | 56 |
| Female | Typical Angina | 56 |
| Male | Typical Angina | 44 |
| Male | Atypical Angina | 52 |
| Male | Atypical Angina | 57 |
| Female | Atypical Angina | 48 |
| Male | Typical Angina | 49 |
| Male | Non-anginal | 64 |
| Female | Non-anginal | 58 |
| Female | Atypical Angina | 50 |
| Female | Atypical Angina | 58 |
| Female | Non-anginal | 66 |
| Female | Non-anginal | 69 |
| Male | Atypical Angina | 44 |
| Male | Atypical Angina | 61 |
| Male | Non-anginal | 40 |
| Female | Typical Angina | 71 |
| Male | Atypical Angina | 59 |
| Male | Atypical Angina | 51 |
| Female | Atypical Angina | 65 |
| Male | Atypical Angina | 53 |
| Female | Typical Angina | 41 |
| Male | Typical Angina | 44 |
| Male | Atypical Angina | 54 |
| Male | Non-anginal | 51 |
| Female | Atypical Angina | 46 |
| Female | Atypical Angina | 54 |
| Male | Atypical Angina | 54 |
| Female | Atypical Angina | 65 |
| Female | Atypical Angina | 65 |
| Female | Atypical Angina | 51 |
| Male | Typical Angina | 48 |
| Male | Atypical Angina | 39 |
| Male | Typical Angina | 52 |
| Male | Atypical Angina | 44 |
| Male | Atypical Angina | 47 |
| Female | Atypical Angina | 53 |
| Female | Atypical Angina | 51 |
| Male | Atypical Angina | 62 |
| Female | Atypical Angina | 44 |
| Female | Atypical Angina | 63 |
| Male | Typical Angina | 52 |
| Male | Non-anginal | 34 |
| Female | Atypical Angina | 71 |
| Male | Typical Angina | 54 |
| Male | Non-anginal | 52 |
| Male | Typical Angina | 41 |
| Male | Atypical Angina | 58 |
| Male | Atypical Angina | 51 |
| Female | Typical Angina | 45 |
| Male | Typical Angina | 44 |
| Male | Atypical Angina | 54 |
| Male | Atypical Angina | 51 |
| Male | Typical Angina | 29 |
| Female | Atypical Angina | 43 |
| Female | Typical Angina | 55 |
| Male | Atypical Angina | 51 |
| Male | Typical Angina | 59 |
| Male | Typical Angina | 52 |
| Male | Atypical Angina | 58 |
| Male | Atypical Angina | 41 |
| Male | Typical Angina | 45 |
| Female | Atypical Angina | 60 |
| Male | Non-anginal | 52 |
| Female | Atypical Angina | 67 |
| Male | Atypical Angina | 68 |
| Male | Typical Angina | 46 |
| Female | Atypical Angina | 54 |
| Male | Atypical Angina | 48 |
| Male | Atypical Angina | 52 |
| Female | Typical Angina | 54 |
| Female | Typical Angina | 45 |
| Male | Atypical Angina | 43 |
| Male | Atypical Angina | 53 |
| Male | Non-anginal | 42 |
| Male | Non-anginal | 59 |
| Female | Typical Angina | 63 |
| Male | Atypical Angina | 42 |
| Male | Atypical Angina | 50 |
| Female | Atypical Angina | 68 |
| Male | Non-anginal | 69 |
| Female | Typical Angina | 50 |
| Male | Atypical Angina | 57 |
| Female | Atypical Angina | 64 |
| Male | Typical Angina | 55 |
| Female | Atypical Angina | 37 |
| Male | Atypical Angina | 41 |
| Male | Non-anginal | 56 |
| Female | Typical Angina | 46 |
| Female | Atypical Angina | 41 |
| Female | Atypical Angina | 54 |
| Female | Atypical Angina | 39 |
| Female | Typical Angina | 34 |
| Female | Atypical Angina | 67 |
| Female | Atypical Angina | 52 |
| Female | Typical Angina | 74 |
| Female | Atypical Angina | 54 |
| Female | Typical Angina | 49 |
| Male | Typical Angina | 42 |
| Male | Typical Angina | 41 |
| Female | Typical Angina | 41 |
| Female | Atypical Angina | 60 |
| Male | Typical Angina | 62 |
| Female | Atypical Angina | 51 |
| Female | Atypical Angina | 42 |
| Female | Atypical Angina | 76 |
| Male | Typical Angina | 70 |
| Female | Atypical Angina | 44 |
| Female | Non-anginal | 60 |
| Male | Atypical Angina | 44 |
| Male | Atypical Angina | 42 |
| Male | Non-anginal | 64 |
| Female | Atypical Angina | 66 |
| Female | Atypical Angina | 39 |
| Male | Atypical Angina | 47 |
| Male | Typical Angina | 35 |
| Male | Typical Angina | 58 |
| Male | Typical Angina | 56 |
| Male | Typical Angina | 56 |
| Female | Typical Angina | 55 |
| Male | Typical Angina | 41 |
| Male | Atypical Angina | 38 |
| Male | Atypical Angina | 38 |
| Male | Atypical Angina | 56 |
| Male | Typical Angina | 48 |
| Male | Typical Angina | 58 |
| Male | Atypical Angina | 58 |
| Male | Atypical Angina | 64 |
| Male | Atypical Angina | 58 |
| Male | Atypical Angina | 50 |
| Male | Atypical Angina | 60 |
| Male | Atypical Angina | 46 |
| Male | Atypical Angina | 68 |
| Male | Atypical Angina | 49 |
| Male | Atypical Angina | 57 |
| Female | Atypical Angina | 62 |
| Male | Non-anginal | 65 |
| Male | Typical Angina | 62 |
| Male | Non-anginal | 59 |
| Male | Atypical Angina | 64 |
| Male | Atypical Angina | 47 |
| Male | Atypical Angina | 70 |
| Male | Typical Angina | 66 |
| Male | Typical Angina | 54 |
| Male | Atypical Angina | 69 |
| Male | Non-anginal | 59 |
| Male | Non-anginal | 38 |
| Male | Atypical Angina | 49 |
| Male | Non-anginal | 61 |
| Male | Typical Angina | 57 |
| Female | Typical Angina | 58 |
| Male | Atypical Angina | 59 |
| Male | Non-anginal | 59 |
| Male | Typical Angina | 57 |
| Male | Atypical Angina | 67 |
| Male | Non-anginal | 45 |
| Female | Typical Angina | 57 |
#unique(HeartData$Gender)
HeartData%>%
group_by(Chest_Pain_Type)%>%
summarize(MeanAge=mean(Age,na.rm=TRUE))%>%
kable()
| Chest_Pain_Type | MeanAge |
|---|---|
| Atypical Angina | 53.51724 |
| Non-anginal | 55.86957 |
| Typical Angina | 51.36000 |
Based on the data, it seems as though the majority of respondents experience atypical angina and are below age 50.
library(ggdark)
Chest<- ggplot(data=HeartData, aes(x=Age, y=Chest_Pain_Type, colour=Chest_Pain_Type)) +
geom_line() +
geom_point()+
ggtitle("Age of Respondents by Chest Pain Type")+theme(plot.title=element_text(family="Times", face="bold.italic", size=20))
Chest+dark_theme_grey()+ theme(legend.position="right")
g <- ggplot(HeartData, mapping = aes(x = Age,fill=Chest_Pain_Type))
g1 <- g + geom_histogram()+
ggtitle("Age of Respondents by Chest Pain Type")+theme(plot.title=element_text(family="Times", face="bold.italic", size=20))
g1+theme_fivethirtyeight()+ theme(legend.position="right")
g1
Chest2<-ggplot(HeartData, aes(Chest_Pain_Type, fill=Gender))
Chest2+geom_bar(position = "dodge")+theme_economist()+
ggtitle("Chest Pain Types by Gender")+theme(plot.title = element_text(family="Times", face= "bold.italic", size = 20))+theme(legend.position = "right")
In conclusion, my hypothesis was correct in thinking that male respondents experience more chest pain types than females. In terms of age, I was also right in my hypothesis that the older an individual is, the more susceptible they may be to experience chest pains. The data shows for age that the majority of respondents that experience atypical angina are over 50 years old and the same can be said for typical angina. In terms of Gender, males 50 and under experience all levels of chest pains more than their female counterparts. However, males typically over the age of 35 experience atypical angina more than females that are 35 and under.