Introduction
Nitrogen dioxide (NO2) emissions have widely been considered one of the major pollutants to our atmosphere. One of the major causes of high level gas emissions is attributed to human activity. The purpose of this analysis is to examine how much populations in the United States contribute to N02 emissions, since we are arguable one of the main reasons for such high green-house gas emissions in the United States.
library(sf)
library(tmap)
library(tigris)
library(spdep)
library(data.world)
library(dplyr)
library(tmaptools)
library(printr)
library(readr)
library(stargazer)
library(texreg)
library(ggplot2)
library(knitr)
Method
This analysis uses data from the U.S. Environmental Protection Agency to capture nitrogen dioxide emissions by state. In addition, population data from social explore to spatially map whether large populations have greater emissions of the greenhouse gas nitrogen dioxide.
library(sf)
us_map <- st_read('/Users/cruz/Desktop/tl_2017_us_state/tl_2017_us_state.shp', stringsAsFactors = FALSE)
Reading layer `tl_2017_us_state' from data source `/Users/cruz/Desktop/tl_2017_us_state/tl_2017_us_state.shp' using driver `ESRI Shapefile'
Simple feature collection with 56 features and 14 fields
geometry type: MULTIPOLYGON
dimension: XY
bbox: xmin: -179.2311 ymin: -14.60181 xmax: 179.8597 ymax: 71.43979
epsg (SRID): 4269
proj4string: +proj=longlat +datum=NAD83 +no_defs
us_map<- us_map%>%
mutate(STATEFP=parse_integer(STATEFP))
names(us_map)
[1] "REGION" "DIVISION" "STATEFP" "STATENS" "GEOID" "STUSPS"
[7] "NAME" "LSAD" "MTFCC" "FUNCSTAT" "ALAND" "AWATER"
[13] "INTPTLAT" "INTPTLON" "geometry"
library(readr)
uspop1_data <- read_csv("/Users/cruz/Desktop/uspop.csv")
names(uspop1_data)
[1] "FIPS" "Geo_NAME" "Geo_QNAME" "NATION" "STATE"
[6] "COUNTY" "totalpop" "popdensity"
usp_data2 <- usp_data %>%
rename(nitrogenm = `NO2 Mean`,
statecode = `State Code`,
countycode = `County Code`,
comean = `CO Mean`)%>%
mutate(statecode=parse_integer(statecode))%>%
filter(nitrogenm>0)%>%
select(nitrogenm, statecode, countycode, comean)%>%
group_by(statecode)%>%
summarize(nitrogenm=mean(nitrogenm))
usp_data2
# A tibble: 47 x 2
statecode nitrogenm
<int> <dbl>
1 1 9.410693
2 2 11.313152
3 4 19.099151
4 5 9.753701
5 6 13.790194
6 8 19.710465
7 9 9.073335
8 10 11.584773
9 11 17.689366
10 12 7.365660
# ... with 37 more rows
us_combo <- append_data(us_map, usp_data2, key.shp = "STATEFP", key.data = "statecode")
uspop2_data <- uspop1_data %>%
mutate(FIPS=parse_integer(FIPS))%>%
filter(totalpop>0)%>%
select(FIPS, Geo_NAME, Geo_QNAME, totalpop, popdensity)%>%
group_by(FIPS)%>%
summarize(totalpop=mean(totalpop))
uspop2_data
# A tibble: 51 x 2
FIPS totalpop
<int> <dbl>
1 1 4863300
2 2 741894
3 4 6931071
4 5 2988248
5 6 39250017
6 8 5540545
7 9 3576452
8 10 952065
9 11 681170
10 12 20612439
# ... with 41 more rows
uspop_combo <- append_data(us_map, uspop2_data, key.shp = "STATEFP", key.data = "FIPS")
uspop_sub_combo <- uspop_combo%>%
filter(!STATEFP == 02) %>%
filter(!STATEFP == 15) %>%
filter(!STATEFP == 60) %>%
filter(!STATEFP == 66) %>%
filter(!STATEFP == 69) %>%
filter(!STATEFP == 72) %>%
filter(!STATEFP == 79)
us_sub_combo <- us_combo%>%
filter(!STATEFP == 02) %>%
filter(!STATEFP == 15) %>%
filter(!STATEFP == 60) %>%
filter(!STATEFP == 66) %>%
filter(!STATEFP == 69) %>%
filter(!STATEFP == 72) %>%
filter(!STATEFP == 79)
us_ni_pop <- merge.data.frame(usp_data2, uspop2_data)
us_ni_pop <- us_ni_pop%>%
mutate(totalpopcat = ifelse(totalpop>0 & totalpop <=10000000, 1,
ifelse(totalpop > 10000000 & totalpop <=20000000, 2,
ifelse(totalpop > 20000000 & totalpop <=30000000, 3,
ifelse(totalpop > 30000000 & totalpop <=40000000, 4,
ifelse(totalpop > 40000000, 5,NA))))))
us_ni_pop%>%
group_by(totalpopcat)%>%
summarize(n=n())
lm1 <- lm(nitrogenm ~ totalpopcat, data = us_ni_pop)
Distribution of the N02 & Total Population in United States
The mean distribution of nitrogen dioxide into the atmosphere is from 8- 12.5 parts per billion in the United States. In addition, the distribution of total population in the United States displays that it falls between 10-20 million people.
ggplot(us_ni_pop, aes(x = nitrogenm)) + geom_histogram(color = "light coral", fill = "light green")

ggplot(us_ni_pop, aes(x = totalpopcat)) + geom_histogram(color = "light coral", fill = "light green")

Linear regression analysis
Surprisingly, the regression output displays that total population has a very small impact on nitrogen dioxide emissions in the Unites States and was not statistically significant. Going into this analysis I assumed that the impact of population size would have on the greenhouse gas nitrogen dioxide would be immense. However, both the spatial mapping and regression display otherwise. Of course, I must mention that there were certainly limitations. Many factors contribute to the emissions of nitrogen dioxide emission into the atmosphere, although populations may certainly have something to do with the emissions of this particular gas. Limitations occur when measuring exactly how much every person in the U.S contributes to air pollution because a spurious relationship occurs and it is difficult to control for everything. In the article examining air quality in California and population growth (Cramer 1998) researchers indicated that they had a hard time improving their model, even after accounting, even with all of the additional variables that they accounted for. Overall, it seems that getting exact results on how much nitrogen dioxide populations emit into the atmosphere is not as intuitive as it may look, rather air pollution has been mediated by regulatory efforts.
stargazer(lm1,digits = 17, type = "html")
|
|
Dependent variable:
|
|
|
|
nitrogenm
|
|
totalpopcat
|
0.00000000000000382
|
|
(0.15676250000000000)
|
|
|
Constant
|
11.10783000000000000***
|
|
(0.21951140000000000)
|
|
|
|
Observations
|
2,397
|
R2
|
0.00000000000000000
|
Adjusted R2
|
-0.00041753650000000
|
Residual Std. Error
|
4.76840100000000000 (df = 2395)
|
F Statistic
|
0.00000000000000000 (df = 1; 2395)
|
|
Note:
|
p<0.1; p<0.05; p<0.01
|
Nitrogen Dioxide map emissions of the United States Vs Population in The United States
The Environmental Protection agency data on NO2 emissions and population display quite inconsistent results. We see that some states with high population do have high emissions of N02, for example California has an extremely high population and relatively high NO2 emissions. However, this is not enough to argue that large population creates high N02 emissions.
tm_shape(us_sub_combo, projection = 2163) + tm_polygons("nitrogenm", palette = "BuGn") + tm_shape(us_map) + tm_borders(lwd = .36, col = "black", alpha = 1)

tm_shape(uspop_sub_combo, projection = 2163) + tm_polygons("totalpop", palette = "BuGn") + tm_shape(us_map) + tm_borders(lwd = .36, col = "black", alpha = 1)

Bibliography
(Cramer 1998) (Cramer 2002) (Dunlap and Jorgenson 2012)
Cramer, James C. 1998. “Population Growth and Air Quality in California.” Demography 35 (1). Springer: 45–56.
———. 2002. “Population Growth and Local Air Pollution: Methods, Models, and Results.” Population and Development Review 28. JSTOR: 22–52.
Dunlap, Riley E, and Andrew K Jorgenson. 2012. “Environmental Problems.” The Wiley-Blackwell Encyclopedia of Globalization. Wiley Online Library.
