This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.
When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:
## Q1 and Q7
AirPass <- read.csv(file = "https://raw.githubusercontent.com/vincentarelbundock/Rdatasets/master/csv/datasets/AirPassengers.csv", header=TRUE, sep=",")
summary(AirPass)
## X time AirPassengers
## Min. : 1.00 Min. :1949 Min. :104.0
## 1st Qu.: 36.75 1st Qu.:1952 1st Qu.:180.0
## Median : 72.50 Median :1955 Median :265.5
## Mean : 72.50 Mean :1955 Mean :280.3
## 3rd Qu.:108.25 3rd Qu.:1958 3rd Qu.:360.5
## Max. :144.00 Max. :1961 Max. :622.0
t<-summary(AirPass$time)
p<-summary(AirPass$AirPassengers)
mean_t <- mean(t,na.rm = TRUE)
mean_p <- mean(p,na.rm = TRUE)
median_t <- median(t, na.rm = TRUE)
median_p <- median(p, na.rm = TRUE)
print (paste("Mean time", mean_t, "Mean Passengers", mean_p,"Median time", median_t, "Median Passengers", median_p))
## [1] "Mean time 1955 Mean Passengers 302.05 Median time 1955 Median Passengers 272.9"
#### mean_time <- mean(AirPass$time)
#### mean_time
#### mean_AirPassengers <- mean(AirPass$AirPassengers)
#### mean_AirPassengers
## Q2
new_pass_set <- data.frame(head(AirPass, n=10))
## Q3
colnames(new_pass_set)<- c("seq", "duration", "passengerCount")
new_pass_set
## seq duration passengerCount
## 1 1 1949.000 112
## 2 2 1949.083 118
## 3 3 1949.167 132
## 4 4 1949.250 129
## 5 5 1949.333 121
## 6 6 1949.417 135
## 7 7 1949.500 148
## 8 8 1949.583 148
## 9 9 1949.667 136
## 10 10 1949.750 119
## Q4
t1<-summary(new_pass_set$duration)
p1<-summary(new_pass_set$passengerCount)
mean_t1 <- mean(t1,na.rm = TRUE)
mean_p1 <- mean(p1,na.rm = TRUE)
median_t1 <- median(t1, na.rm = TRUE)
median_p1 <- median(p1, na.rm = TRUE)
print (paste("Mean time", mean_t1, "Mean Passengers", mean_p1,"Median time", median_t1, "Median Passengers", median_p1))
## [1] "Mean time 1949.33333333333 Mean Passengers 129.266666666667 Median time 1949 Median Passengers 130.15"
## Q5
new_pass_set[which(new_pass_set[,2]>1949.2, arr.ind = TRUE), 2]<- 1111
new_pass_set
## seq duration passengerCount
## 1 1 1949.000 112
## 2 2 1949.083 118
## 3 3 1949.167 132
## 4 4 1111.000 129
## 5 5 1111.000 121
## 6 6 1111.000 135
## 7 7 1111.000 148
## 8 8 1111.000 148
## 9 9 1111.000 136
## 10 10 1111.000 119
You can also embed plots, for example:
Note that the echo = FALSE parameter was added to the code chunk to prevent printing of the R code that generated the plot.