- Yue Wu
- YinChia Huang
- Francesco Ignazio Re
Apprehensions at the US-Mexico border have declined to near-historic lows over the last few years. The objective of this report is to give a deeper insight on this change that has been occuring. Through the analysis of the data collected by the U.S. Customs and Border Protection through the years, we intend to shed light on the general trend of this phenomenon, focusing on how factors such as time and place have influenced the given outcome.
The gray dots are Sector "Tucson", which had the most unstable number of apprehensions and was the sector who contributed the most in the apprehensions of 2010.
The pink dots are Sector "Rio Grande Valley" which contributed the most in the apprehensions of 2017. Notably,apprehensions dramatically increased from Sep to Oct.
The most trafficated months in 2010, such as March, April and May are also the ones with the biggest decline in 2017.
The greatest change has occured in Tuscon, the area with the highest number of apprehensions in the 2010, that observed a drop of over the 80% according to the data collected in 2017.
From 2010 to 2017, the U.S. Customs and Border Protection saw an overall 36 percent decrease in apprehensions for illegal entry to the country. The plots show a significant different trend of monthly apprehension changes between the two years. Sectors also exhibit different patterns of changes from each other.
## ## Paired t-test ## ## data: as.numeric(ap10[y, ]) and as.numeric(ap17[y, ]) ## t = 6.2428, df = 11, p-value = 6.324e-05 ## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval: ## 9363.28 19560.89 ## sample estimates: ## mean of the differences ## 14462.08
## ## Paired t-test ## ## data: as.numeric(ap10[y, ]) and as.numeric(ap17[y, ]) ## t = -2.4601, df = 11, p-value = 0.03167 ## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval: ## -12283.0782 -682.9218 ## sample estimates: ## mean of the differences ## -6483
## ## Paired t-test ## ## data: as.numeric(xb10) and as.numeric(xb17) ## t = 8.5141, df = 2, p-value = 0.01352 ## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval: ## 20742.45 63125.55 ## sample estimates: ## mean of the differences ## 41934
## ## Paired t-test ## ## data: as.numeric(xa10) and as.numeric(xa17) ## t = -3.2966, df = 2, p-value = 0.081 ## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval: ## -29127.621 3856.288 ## sample estimates: ## mean of the differences ## -12635.67
The time series plot shows that from 2000, there has been continuous decline, from a high of over 1.6 million in 2000 to around 300,000 in 2017. Over these years, US policies on immigration control have been rapidly developed, leading to potential correlation with the change in apprehensions.
Box plot across months will give us a sense on seasonal effect. We can see that the number of apprehension in March has the biggest range and highest median.
In addition to the overall decline, seasonal fluctuation is another noticeable trend in the plots. A reasonable explanation is that harsh weather decreases attempts of illegal entry while keeping other factors controlled. The seasonal effects cause the changes among months, but plays little role in the rapid changes among years.
Month plot graph displays the time series plot for each month from 2000 to 2017. The shape of each month is very similar with different magnitudes.
Using ARIMA Model to predict future apprehensions
The significant decline of border apprehensions since 2000 mainly come from changes in specific sectors, such as Rio Grande Valley, and time of a year, such as March to May. More research on political and economical factors can be done to further explain the causes.
Using R for Time Series Analysis
Time series decomposition
Time Series Analysis in R
Introduction to Forecasting with ARIMA in R
Border apprehensions hit 17-year low in March