Done as a part of class project, with data taken from a local source describing incoming and outgoing flights from Santa Monica Airport .
Impact of Columbus Day on flight frequency over Santa Monica Airport
The scope of the analysis is to see if there are any quantifiable differences before/after Columbus day (Monday, October 12th, 2015) - an oft cited milestone for recent changes in noise. These changes are thought to be related to the implementation of the NextGen program by the FAA.
Data Pre-processing:
Used the flight data from csv files for March 22 and November 15, 2015, and removed null values.
The mentioned dates have been chosen to observe the impact of Columbus day on flight frequency.
Private plane and no code flight data removed.
Some redundant values for latitude, longitude and altitude combinations have been removed .
The data is separated into day and night time observations, where day is 6:30AM to 12AM, and night is 12AM to 6:30AM.
library(ggplot2)
library(dplyr)
library(plotly)
#df1d_22<-read.csv("./RTL150322_day.csv",header = TRUE)
#df1d_15<-read.csv("./RTL151115_day.csv",header = TRUE)
#df1n_22<-read.csv("./RTL150322_night.csv",header = TRUE)
#df1n_15<-read.csv("./RTL151115_night.csv",header = TRUE)
SMO <- data.frame(label = 'SMO', lon=-118.456667, lat=34.010167)
smo <- c(SMO$lon, SMO$lat)
1. Overall Trend
From the below map plots we can see the different flights flowing in and out of Santa Monica Airport (SMO), during day and night time.
On the map: 22 March 15 Novemeber
Day time - 6:30AM to 12:00AM
(Mouse over the map to see latitude and longitude points. )

Night time - 12:00AM to 6:30AM

2 : Altitude of different flights over a point during day
Chosen Point : latitude = 34.03602, longitude= -118
It can be seen that on 15 Novemeber, 6 different flights were flying at different altitudes over the chosen point on map, represented as a fluctuating line graph moving from 4675 to 10375 as the highest altitude.
For the same point, on 22 March, a single flight was observed.

3 : Number of different flights during the day
15 different flights were seen between 19:00:00 -20:00:00 hours.

10 different flights were seen between 19:00:00 -20:00:00 hours.

4 : Average Altitude Descent for flight A57D52
Date : 22 March 2015, 06:30AM to 12AM
Between 08:37 to 08:45 the altitude of the flight descended from 10725 to 4050.
The observations have been grouped and averaged over every minute.

Date : 15 November 2015, 6:30AM to 12:00AM
The flight has two occurences on the same day.
Between 09:04 to 09:12 the altitude of the flight descended from 10801 to 4155, which is nearly similar to 22 March descent.
Between 23:29 to 23:35 the altitude of the flight descended from 10685 to 4800, whereas, there was no occurrence of this flight on 22 March around this time.
The observations have been grouped and averaged over every minute.

Conclusion:
From the overall trends, it can be observed that, more number of flights fly during the day time as compared to night time before and after the Columbus day.
The flight altitude is observed to be higher over the point with latitude = 34.08 and longitude = 118 , after Columbus Day.
10 different flights were seen between 19:00:00 -20:00:00 hours, which is less than the number of flights at the same time on Novemeber 15, a probable impact of Columbus day.
Flight A57D52 was observed to fly once on 22March, whereas it appeared twice in the 24 hour schedule on November 15. Surprisingly, both the chosen days are Sundays. This is a clear indicator of the change of flight pattern after Columbus Day on domestic flights from Santa Monica Airport.
Site created and maintained by Ankita Aggarwal| USC | ankitaa@ usc.edu
---
title: "Data Wrangling with R"
output: html_notebook
---
<br>
Done as a part of class project, with data taken from a local source describing <strong> incoming and outgoing flights from Santa Monica Airport </strong>.

## Impact of Columbus Day on flight frequency over Santa Monica Airport

The scope of the analysis is to see if there are any quantifiable differences before/after Columbus day (Monday, October 12th, 2015) - an oft cited milestone for recent changes in noise. These changes are thought to be related to the implementation of the NextGen program by the FAA.

<h4><strong>Data Pre-processing:</strong></h4>
<li>Used the flight data from csv files for March 22 and November 15, 2015, and removed null values. </li>
<li> The mentioned dates have been chosen to observe the impact of Columbus day on flight frequency. </li>
<li> Private plane and no code flight data removed.</li>
<li> Some redundant values for latitude, longitude and altitude combinations have been removed .</li>
<li> The data is separated into day and night time observations, where day is 6:30AM to 12AM, and night is 12AM to 6:30AM. </li>

```{r}
library(ggplot2)
library(dplyr)
library(plotly)
df1d_22<-read.csv("./RTL150322_day.csv",header = TRUE)
df1d_15<-read.csv("./RTL151115_day.csv",header = TRUE)
df1n_22<-read.csv("./RTL150322_night.csv",header = TRUE)
df1n_15<-read.csv("./RTL151115_night.csv",header = TRUE)
SMO <- data.frame(label = 'SMO', lon=-118.456667, lat=34.010167)
smo <- c(SMO$lon, SMO$lat)

```

## 1. Overall Trend

From the below map plots we can see the different flights flowing in and out of Santa Monica Airport (SMO), during day and night time.

On the map: <span style="color:blue">22 March </span>
<span style="color:green">15 Novemeber</span>

<center>
<h4> Day time - 6:30AM to 12:00AM </h4>
(**Mouse over the map to see latitude and longitude points. **)

```{r warning=FALSE,message=FALSE,error=FALSE, echo=FALSE}

map.google <- get_map(smo, zoom = 12, scale=1)  # get map around SMO
ggmap(map.google) +
  geom_point(data = SMO, aes(x = lon, y = lat), color="red", size=10, alpha=.5) +
  
  geom_point(data = df1d_22, 
             size=1, alpha=.5,colour = "blue",
             aes(x = lon, y = lat,colour = "blue", text = paste("Airline:", flight, "<br>code:", code, "<br>timestamp:", timestamp, "<br>alt:", alt))) +
  
  geom_point(data = df1d_15, 
             size=1, alpha=.05,colour = "green",
             aes(x = lon, y = lat,colour = "green", text = paste("Airline:", flight, "<br>code:", code, "<br>timestamp:", timestamp, "<br>alt:", alt))) +
  theme(legend.box.background = element_rect(), legend.position = "left", legend.key = element_rect(fill = "white", colour = "black"))+
  ggtitle("Overall Trend - day flights on 03/15 and 11/22")

ggplotly()
```

<h4> Night time - 12:00AM to 6:30AM </h4>

```{r echo=FALSE,warning=FALSE,message=FALSE,error=FALSE}
map.google <- get_map(smo, zoom = 12, scale = 1) 
ggmap(map.google) +
  geom_point(data = SMO, aes(x = lon, y = lat), color="red", size=10, alpha=.5) +
  
  geom_point(data = df1n_22, 
             size=1, alpha=.5,colour = "blue",
             aes(x = lon, y = lat,colour = "blue", text = paste("Airline:", flight, "<br>code:", code, "<br>timestamp:", timestamp, "<br>alt:", alt))) +
  
  geom_point(data = df1n_15, 
             size=1, alpha=.08,colour = "green",
             aes(x = lon, y = lat, text = paste("Airline:", flight, "<br>code:", code, "<br>timestamp:", timestamp, "<br>alt:", alt))) +
  theme(legend.box.background = element_rect(), legend.position = "left", legend.key = element_rect(fill = "white", colour = "black"))+
  ggtitle("Overall trend - night flights on 03/15 and 11/22")

ggplotly()
```
</center>
## 2 : Altitude of different flights over a point during day
<br>
<li><strong> Chosen Point : latitude = 34.03602, longitude= -118 </strong></li>

<li> It can be seen that on <span style="color:red">15 Novemeber</span>, <i><strong>6 different flights were flying at different altitudes over the chosen point on map,</strong></i> represented as a fluctuating line graph moving from 4675 to 10375 as the highest altitude. </li> 

<li> For the same point, on <span style="color:blue">22 March</span>, a single flight was observed. </li>


```{r echo= FALSE,warning=FALSE,message=FALSE,error=FALSE}


dfd_n1 <- df1d_15 %>% filter(lat== 34.03602, lon<= -118)
dfd_n2 <- df1d_22 %>% filter(lat== 34.03602)


alt1<- dfd_n1[8]
alt2<- dfd_n2[8]

ggplot(dfd_n1, aes(timestamp)) +
 geom_line(aes(y = alt1, colour = "alt1")) +
 geom_line(aes(y = alt2, colour = "alt2")) +
labs(title="Altitude of different flights during day time over latitude = 34.03" , subtitle ="alt1 = 15-Nov, alt2 = 22-Mar")


```
## 3 : Number of different flights during the day

 **15** *different flights were seen between 19:00:00 -20:00:00 hours.*

<center>
```{r echo=FALSE}
library(readr)
library(dplyr)
library(lubridate)


df1d_aaa<- df1d_15 %>% 
  mutate(hour_of_day = hour(as.POSIXct(strptime(timestamp, "%Y-%m-%d %H:%M:%S")))) %>% 
  group_by(hour_of_day) %>% 
  summarise(countFlights = n_distinct(code))

ggplot(df1d_aaa, aes(hour_of_day, countFlights, colour ="blue")) + geom_col(fill = "blue") + labs(title="Number of Flights on 15-Nov between 6AM to 12AM")


```
</center>

**10** *different flights were seen between 19:00:00 -20:00:00 hours.*
<center>
```{r echo=FALSE}
df1d_2a<- df1d_22 %>% 
  mutate(hour_of_day = hour(as.POSIXct(strptime(timestamp, "%Y-%m-%d %H:%M:%S")))) %>% 
  group_by(hour_of_day) %>% 
  summarise(countFlights = n_distinct(code))

ggplot(df1d_2a, aes(hour_of_day, countFlights, colour ="green")) + geom_col(fill = "green") + labs(title="Number of Flights on 22-March between 6AM to 12AM")
```
</center>

## 4 : Average Altitude Descent for flight A57D52 

<strong> Date : 22 March 2015, 06:30AM to 12AM </strong>

<li> Between 08:37 to 08:45 the altitude of the flight descended from 10725 to 4050. </li>
<li> The observations have been grouped and averaged over every minute. </li>

<center>
```{r echo=FALSE}
dfx<- df1d_22 %>% filter(code=="A57D52")
df1d_2a<- dfx %>% mutate(interval = floor_date(timestamp, unit="hour") +minutes(floor(minute(timestamp)/1)*1)) %>% 
  group_by(interval) %>% 
  summarise(avgAlt = mean(alt))

library(ggthemes)
p<- ggplot(df1d_2a, aes(interval, avgAlt)) +
  geom_point(colour ="orange")
p + theme_bw() + ggtitle("Average altitude descent grouped by minute", subtitle=" flight A57D52 on 22 March between 8:37AM to 8:45AM")

```
</center>

<strong> Date : 15 November 2015, 6:30AM to 12:00AM </strong>
<li> The flight has two occurences on the same day. </li>
<li>Between 09:04 to 09:12 the altitude of the flight descended from 10801 to 4155, which is nearly similar to 22 March descent.  </li>
<li>Between 23:29 to 23:35 the altitude of the flight descended from 10685 to 4800, whereas, there was no occurrence of this flight on 22 March around this time.  </li>
<li> The observations have been grouped and averaged over every minute. </li>

<center>
```{r echo=FALSE}
dfx<- df1d_15 %>% filter(code=="A57D52")
df1d_2a<- dfx %>% mutate(interval = floor_date(timestamp, unit="hour") +minutes(floor(minute(timestamp)/1)*1)) %>% 
  group_by(interval) %>% 
  summarise(avgAlt = mean(alt))

library(ggthemes)
p<- ggplot(df1d_2a, aes(interval, avgAlt)) +
  geom_point(colour ="orange")
p + theme_bw() + ggtitle("Average altitude descent grouped by minute ", subtitle="flight <strong>A57D52</strong> on 15 Novemeber between 9:04 AM to 9:12AM and 23:29PM to 23:35PM")
```

</center>

## Conclusion:

1. From the overall trends, it can be observed that, <strong>more number of flights fly during the day time as compared to night time before and after the Columbus day. </strong>

2. The flight <strong>altitude </strong>is observed to be <strong>higher over the point </strong>with latitude = 34.08 and longitude = 118 , after Columbus Day.

3. <strong>10 different flights</strong> were seen between 19:00:00 -20:00:00 hours, which is <strong>less than the number of flights at the same time on Novemeber 15</strong>, a probable impact of Columbus day.

4. Flight <strong>A57D52</strong> was observed to fly once on 22March, whereas it <strong>appeared twice in the 24 hour schedule on November 15</strong>. Surprisingly, both the chosen days are <strong>Sundays</strong>. This is a clear indicator of the change of flight pattern after Columbus Day on domestic flights from Santa Monica Airport.
<br>
<br>


            
<span style="font-size:70%"><center>Site created and maintained by Ankita Aggarwal| USC | ankitaa@ usc.edu </center></span>
<br>
<br>




