#SDGs
Main SDG under study: SDG 1 - End poverty in all its forms everywhere. Other SDGs that we tackle: 1. SDG 2 - End hunger. 2. SDG 4 - Quality education. 3. SDG 11 - Sustainable cities and communities.
The objective that we have set for this work is to analyze how poverty is measured throughout the world and how serious the problem of poverty is. For this, we have recurred to the data available in the world bank. We have used two main variables: the headcount ratio of population living below poverty lines set at national levels (SI.POV.NAHC), and the headcount ratio of population living below the international level of 1.90$ a day (SI.POV.DDAY). The main goal of this project is to analyze the main differences between both measurements and extract conclusions about the political decisions behind these thresholds.
In addition to these, we have also used other variables to analyze possible relationships between poverty and other problems. The main variables chosen for such purpose are: the prevalence of undernourishment (% of population), the lower secondary completion rate and the proportion of the population living in slums or inadequate housing.
We did not want to focus on any specific region of the world, as we understand that to extract better conclusions on this topic it is necessary to see the global picture. Consequently, the geographical framework we use in our project covers the whole world .
my_indicators <- c(
int_pov_line = "SI.POV.DDAY",
nat_pov_line = "SI.POV.NAHC",
undernourishment = "SN.ITK.DEFC.ZS",
lower_secondary = "SE.SEC.CMPT.LO.ZS",
slums = "EN.POP.SLUM.UR.ZS",
gdp_capita ="NY.GDP.PCAP.CD",
pop = "SP.POP.TOTL"
)
In this section, we downloaded the whole data set for all of our variables, with the whole range of time covered by the World Bank. Then, we defined our main indicators: the proportion of the population living under the national poverty line (povna), the proportion of the population living under the international poverty line (povint); and the difference between both percentages from each country (povmix).It is important to note that the variable povmix is calculated subtracting the percentage of the international poverty line to the percentage of the national poverty line. This decision responds to the fact that national poverty lines are usually set at higher thresholds.
Applying filters, we also eliminated all the missing data for our variables in our investigation. Moreover, for our indicators, we used coding to reduce our data set to the latest data available for each country, thus assuring that our indicators have information on the highest number of countries possible. In this sense, we have not decided a start date nor and end date nor something like that. Instead, we worked with the whole time period, only extracting the data from the latest year available for each country and for each variable.
To make the work more visual and the data easier to understand, we created a series of tables, maps and graphs, showing and analyzing different bits of data. We will explain them as they show up along our chunks.
data_set <- wbstats::wb_data(my_indicators,) %>%
left_join(wbstats::wb_countries() %>% select(-country), by="iso3c")
data_set %>%
group_by(country) %>%
drop_na(nat_pov_line) %>%
drop_na(int_pov_line) %>%
drop_na(undernourishment) %>%
drop_na(lower_secondary) %>%
drop_na(slums)
## # A tibble: 87 x 27
## # Groups: country [53]
## iso2c.x iso3c country date slums gdp_capita lower_secondary int_pov_line
## <chr> <chr> <chr> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 AR ARG Argentina 2016 14.7 12790. 89.2 1.1
## 2 AR ARG Argentina 2018 14.7 11795. 90.9 1.4
## 3 BF BFA Burkina Fa~ 2018 57.1 805. 43.0 33.7
## 4 BD BGD Bangladesh 2005 70.8 499. 54.2 25.1
## 5 BD BGD Bangladesh 2010 61.6 781. 55.5 19.2
## 6 BD BGD Bangladesh 2016 49.4 1402. 79.4 14.3
## 7 BY BLR Belarus 2016 45.2 5040. 107. 0
## 8 BY BLR Belarus 2018 33.2 6360. 97.8 0
## 9 CN CHN China 2010 29.1 4550. 99.1 11.2
## 10 CI CIV Cote d'Ivo~ 2018 60.1 2314. 49.4 9.2
## # ... with 77 more rows, and 19 more variables: nat_pov_line <dbl>,
## # undernourishment <dbl>, pop <dbl>, iso2c.y <chr>, capital_city <chr>,
## # longitude <dbl>, latitude <dbl>, region_iso3c <chr>, region_iso2c <chr>,
## # region <chr>, admin_region_iso3c <chr>, admin_region_iso2c <chr>,
## # admin_region <chr>, income_level_iso3c <chr>, income_level_iso2c <chr>,
## # income_level <chr>, lending_type_iso3c <chr>, lending_type_iso2c <chr>,
## # lending_type <chr>
povna = data_set %>%
group_by(country) %>%
drop_na(nat_pov_line) %>%
arrange(-date) %>%
slice(1)
povint = data_set %>%
group_by(country) %>%
drop_na(int_pov_line) %>%
arrange(-date) %>%
slice(1)
povmix = data_set %>%
group_by(country) %>%
drop_na(int_pov_line) %>%
drop_na(nat_pov_line) %>%
arrange(-date) %>%
slice(1) %>%
mutate(difference = nat_pov_line - int_pov_line)
#"Countries with the highest proportion of population living below the national poverty line. Arranged by region"
povna_reg = povna %>%
left_join(codelist %>% rename(GeoAreaCode=iso3n)) %>%
group_by(region) %>%
slice_max(nat_pov_line) %>%
select(region,iso3c,country, nat_pov_line, date) %>%
arrange(-nat_pov_line)
povna_reg %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | iso3c | country | nat_pov_line | date |
Sub-Saharan Africa | SSD | South Sudan | 82.30 | 2,016 |
Latin America & Caribbean | GTM | Guatemala | 59.30 | 2,014 |
South Asia | AFG | Afghanistan | 54.50 | 2,016 |
Middle East & North Africa | YEM | Yemen, Rep. | 48.60 | 2,014 |
East Asia & Pacific | TLS | Timor-Leste | 41.80 | 2,014 |
Europe & Central Asia | ARM | Armenia | 27.00 | 2,020 |
#"Countries with the highest proportion of population living below the international poverty line. Arranged by region"
povint_reg = povint %>%
left_join(codelist %>% rename(GeoAreaCode=iso3n)) %>%
group_by(region) %>%
slice_max(int_pov_line) %>%
select(region,iso3c,country, int_pov_line, date) %>%
arrange(-int_pov_line)
povint_reg %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | iso3c | country | int_pov_line | date |
Sub-Saharan Africa | MDG | Madagascar | 78.80 | 2,012 |
Europe & Central Asia | UZB | Uzbekistan | 61.60 | 2,003 |
East Asia & Pacific | PNG | Papua New Guinea | 38.00 | 2,009 |
Latin America & Caribbean | HTI | Haiti | 24.50 | 2,012 |
South Asia | IND | India | 22.50 | 2,011 |
Middle East & North Africa | YEM | Yemen, Rep. | 18.30 | 2,014 |
North America | USA | United States | 1.00 | 2,019 |
In these two tables we show the highest values of each region. We can see that there is little difference in the percentages from one poverty line to the other. What may be striking is the fact that only one country repeats: Yemen. For the rest of regions, differences appear as to which country has the highest value. Another important fact to note is that the two countries of North America (Canda and the US) have not established any kind of National Poverty Line.
#"Countries with the highest difference between the percentages of national poverty line and international poverty line. Arranged by region"
povmix_reg = povmix %>%
left_join(codelist %>% rename(GeoAreaCode=iso3n)) %>%
group_by(region) %>%
slice_max(difference) %>%
select(region,iso3c,country, difference, date) %>%
arrange(-difference)
povmix_reg %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | iso3c | country | difference | date |
Latin America & Caribbean | GTM | Guatemala | 50.50 | 2,014 |
Europe & Central Asia | AZE | Azerbaijan | 47.00 | 2,001 |
Sub-Saharan Africa | GMB | Gambia, The | 38.30 | 2,015 |
Middle East & North Africa | YEM | Yemen, Rep. | 30.30 | 2,014 |
East Asia & Pacific | MNG | Mongolia | 27.90 | 2,018 |
South Asia | PAK | Pakistan | 18.30 | 2,018 |
This table shows the countries with the highest difference between the percentages of the National Poverty Line and the International Poverty Line. As we have noted above, national poverty lines are usually set at higher thresholds. This explains the huge differences that appear in this table, highlighting Guatemala, Azerbaijan, Gambia and Yemen.
#"Countries with the lowest proportion of population living below the national poverty line. Arranged by region"
povna_reg2 = povna %>%
left_join(codelist %>% rename(GeoAreaCode=iso3n)) %>%
group_by(region) %>%
slice_min(nat_pov_line) %>%
select(region,iso3c,country,nat_pov_line, date) %>%
arrange(nat_pov_line)
povna_reg2 %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | iso3c | country | nat_pov_line | date |
East Asia & Pacific | CHN | China | 0.00 | 2,020 |
Europe & Central Asia | UKR | Ukraine | 1.10 | 2,019 |
South Asia | LKA | Sri Lanka | 4.10 | 2,016 |
Middle East & North Africa | MAR | Morocco | 4.80 | 2,013 |
Sub-Saharan Africa | MUS | Mauritius | 10.30 | 2,017 |
Latin America & Caribbean | CHL | Chile | 10.80 | 2,020 |
#"Countries with the lowest proportion of population living below the international poverty line. Arranged by region"
povint_reg2 = povint %>%
left_join(codelist %>% rename(GeoAreaCode=iso3n)) %>%
group_by(region) %>%
slice_min(int_pov_line) %>%
select(region,iso3c,country,int_pov_line, date) %>%
arrange(int_pov_line)
#"Countries with the lowest difference between national poverty line and international poverty line. Arranged by region"
povmix_reg2 = povmix %>%
left_join(codelist %>% rename(GeoAreaCode=iso3n)) %>%
group_by(region) %>%
slice_min(difference) %>%
select(region,iso3c,country,difference, date) %>%
arrange(difference)
povmix_reg2 %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | iso3c | country | difference | date |
Sub-Saharan Africa | MWI | Malawi | -22.80 | 2,019 |
East Asia & Pacific | SLB | Solomon Islands | -12.00 | 2,012 |
South Asia | IND | India | -0.60 | 2,011 |
Europe & Central Asia | UKR | Ukraine | 1.10 | 2,019 |
Middle East & North Africa | MAR | Morocco | 3.90 | 2,013 |
Latin America & Caribbean | CHL | Chile | 10.10 | 2,020 |
For this series of data, we have decided to omit the table for the international poverty line. This is because it did not provide relevant data: it only showed countries with 0.0 values. The conclusions that we can extract here is that in effect the national poverty lines are way stricter than the international poverty line. Also, for the table showing the lowest difference in countries, we can see an interesting fact: negative values. This means that for these countries (Malawi, Solomon Islands and India), the national poverty lines are in fact set below the international poverty line, contrary to the common standard.
#"Countries with the highest proportion of population living below the national poverty line"
povna20L = povna %>%
group_by(region) %>%
ungroup() %>%
arrange(-nat_pov_line) %>%
select(region,country,nat_pov_line,date) %>%
slice(1:20)
povna20L %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | country | nat_pov_line | date |
Sub-Saharan Africa | South Sudan | 82.30 | 2,016 |
Sub-Saharan Africa | Equatorial Guinea | 76.80 | 2,006 |
Sub-Saharan Africa | Madagascar | 70.70 | 2,012 |
Sub-Saharan Africa | Eritrea | 69.00 | 1,993 |
Sub-Saharan Africa | Burundi | 64.90 | 2,013 |
Sub-Saharan Africa | Congo, Dem. Rep. | 63.90 | 2,012 |
Sub-Saharan Africa | Central African Republic | 62.00 | 2,008 |
Latin America & Caribbean | Guatemala | 59.30 | 2,014 |
Sub-Saharan Africa | Eswatini | 58.90 | 2,016 |
Latin America & Caribbean | Haiti | 58.50 | 2,012 |
Sub-Saharan Africa | Sierra Leone | 56.80 | 2,018 |
Sub-Saharan Africa | South Africa | 55.50 | 2,014 |
Sub-Saharan Africa | Togo | 55.10 | 2,015 |
South Asia | Afghanistan | 54.50 | 2,016 |
Sub-Saharan Africa | Zambia | 54.40 | 2,015 |
Sub-Saharan Africa | Liberia | 50.90 | 2,016 |
Sub-Saharan Africa | Malawi | 50.70 | 2,019 |
Sub-Saharan Africa | Lesotho | 49.70 | 2,017 |
Sub-Saharan Africa | Gambia, The | 48.60 | 2,015 |
Middle East & North Africa | Yemen, Rep. | 48.60 | 2,014 |
#"Countries with the highest proportion of population living below the international poverty line"
povint20L = povint %>%
group_by(region) %>%
ungroup() %>%
arrange(-int_pov_line) %>%
select(region,country,int_pov_line,date) %>%
slice(1:20)
povint20L %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | country | int_pov_line | date |
Sub-Saharan Africa | Madagascar | 78.80 | 2,012 |
Sub-Saharan Africa | Congo, Dem. Rep. | 77.20 | 2,012 |
Sub-Saharan Africa | South Sudan | 76.50 | 2,016 |
Sub-Saharan Africa | Malawi | 73.50 | 2,019 |
Sub-Saharan Africa | Burundi | 72.80 | 2,013 |
Sub-Saharan Africa | Somalia | 68.60 | 2,017 |
Sub-Saharan Africa | Central African Republic | 65.90 | 2,008 |
Sub-Saharan Africa | Mozambique | 63.70 | 2,014 |
Europe & Central Asia | Uzbekistan | 61.60 | 2,003 |
Sub-Saharan Africa | Zambia | 58.70 | 2,015 |
Sub-Saharan Africa | Rwanda | 56.50 | 2,016 |
Sub-Saharan Africa | Angola | 49.90 | 2,018 |
Europe & Central Asia | Turkmenistan | 49.80 | 1,998 |
Sub-Saharan Africa | Tanzania | 49.40 | 2,018 |
Sub-Saharan Africa | Liberia | 44.40 | 2,016 |
Sub-Saharan Africa | Sierra Leone | 43.00 | 2,018 |
Sub-Saharan Africa | Niger | 41.40 | 2,018 |
Sub-Saharan Africa | Uganda | 41.00 | 2,019 |
Sub-Saharan Africa | Congo, Rep. | 39.60 | 2,011 |
Sub-Saharan Africa | Zimbabwe | 39.50 | 2,019 |
In these two tables we can see the countries with the highest percentages for both variables. Although there are differences between the two tables, it is clear that Sub-Saharan Africa predominates in both of them.
#"Countries with the lowest proportion of population living below the national poverty line"
povna20W = povna %>%
group_by(region) %>%
ungroup() %>%
arrange(nat_pov_line) %>%
select(region,country,nat_pov_line,date) %>%
slice(1:15)
povna20W %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | country | nat_pov_line | date |
East Asia & Pacific | China | 0.00 | 2,020 |
Europe & Central Asia | Ukraine | 1.10 | 2,019 |
South Asia | Sri Lanka | 4.10 | 2,016 |
Europe & Central Asia | Belarus | 4.80 | 2,020 |
Middle East & North Africa | Morocco | 4.80 | 2,013 |
Europe & Central Asia | Kazakhstan | 5.30 | 2,020 |
South Asia | Maldives | 5.40 | 2,019 |
Middle East & North Africa | Algeria | 5.50 | 2,011 |
Europe & Central Asia | Azerbaijan | 6.00 | 2,012 |
East Asia & Pacific | Vietnam | 6.70 | 2,018 |
East Asia & Pacific | Thailand | 6.80 | 2,020 |
South Asia | Bhutan | 8.20 | 2,017 |
East Asia & Pacific | Malaysia | 8.40 | 2,019 |
Europe & Central Asia | Iceland | 8.80 | 2,017 |
Europe & Central Asia | Czech Republic | 9.50 | 2,019 |
#"Countries with the lowest proportion of population living below the international poverty line"
povint20W = povint %>%
group_by(region) %>%
ungroup() %>%
arrange(int_pov_line) %>%
select(region,country,int_pov_line,date) %>%
slice(1:10)
povint20W %>%
ungroup() %>%
mutate(date=as.integer(date)) %>%
flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | country | int_pov_line | date |
Europe & Central Asia | Albania | 0.00 | 2,019 |
Europe & Central Asia | Azerbaijan | 0.00 | 2,005 |
Europe & Central Asia | Belarus | 0.00 | 2,020 |
Europe & Central Asia | Czech Republic | 0.00 | 2,019 |
Europe & Central Asia | Finland | 0.00 | 2,019 |
Europe & Central Asia | France | 0.00 | 2,018 |
Europe & Central Asia | Germany | 0.00 | 2,018 |
Europe & Central Asia | Iceland | 0.00 | 2,017 |
Europe & Central Asia | Ireland | 0.00 | 2,018 |
Europe & Central Asia | Kazakhstan | 0.00 | 2,018 |
We have created these two tables to see once again the huge difference between national and international poverty lines. Both of them show the lowest values. However, while for the international poverty line there are a lot of countries with values of 0.0, the national poverty line shows a very different tendency. China is the only country which presents a value of 0.0, and the top 15 (Czech Republic), goes as high as 9.5%.
library(tmap)
data(World)
map = World %>%
filter(iso_a3 != "ATA") %>%
left_join(povna %>% rename(iso_a3=iso3c) %>% select(iso_a3,country,nat_pov_line),
by="iso_a3")
map %>% ggplot(aes(fill = nat_pov_line)) +
geom_sf() +
scale_fill_viridis_c() +
theme(legend.position="bottom") +
labs(
title = "Map of National Poverty Line",
fill = NULL,
caption = paste("Source: World Bank")
) +
coord_sf()
This map is quite illustrative: it shows the distribution of poverty throughout the world according to the proportion of the population living below national poverty lines. We can see that the Global South presents worrysome results.
map = World %>%
filter(iso_a3 != "ATA") %>%
left_join(povint %>% rename(iso_a3=iso3c) %>% select(iso_a3,country,int_pov_line),
by="iso_a3")
map %>% ggplot(aes(fill = int_pov_line)) +
geom_sf() +
scale_fill_viridis_c() +
theme(legend.position="bottom") +
labs(
title = "Map of International Poverty Line",
fill = NULL,
caption = paste("Source: World Bank")
) +
coord_sf()
This map illustrates the distribution of poverty according to the proportion of the population living below the international poverty line. We can see that the picture is considerably better if we compare it to the previous map. However, the trend persists in Sub-Saharan Africa, where results remain as worse as with the national poverty lines.
map = World %>%
filter(iso_a3 != "ATA") %>%
left_join(povmix %>% rename(iso_a3=iso3c) %>% select(iso_a3,country,difference),
by="iso_a3")
map %>% ggplot(aes(fill = difference)) +
geom_sf() +
scale_fill_viridis_c() +
theme(legend.position="bottom") +
labs(
title = "Map of the Difference between National and International Poverty Line",
fill = NULL,
caption = paste("Source: World Bank")
) +
coord_sf()
This map illustrates the difference between both poverty lines along the world. We can see that the difference is higher in the Global South.
povint %>%
group_by(region) %>%
arrange(int_pov_line) %>%
slice(1) %>%
select(region,country,gdp_capita) %>% flextable::flextable() %>%
flextable::colformat_double(digits = 0)
region | country | gdp_capita |
East Asia & Pacific | Malaysia | 9,955 |
Europe & Central Asia | Albania | 5,396 |
Latin America & Caribbean | Uruguay | 15,438 |
Middle East & North Africa | Lebanon | 7,675 |
North America | Canada | 45,129 |
South Asia | Maldives | 10,562 |
Sub-Saharan Africa | Mauritius | 10,485 |
povint %>%
group_by(region) %>%
arrange(-int_pov_line) %>%
slice(1) %>%
select(region,country,gdp_capita) %>% flextable::flextable() %>%
flextable::colformat_double(digits = 0)
region | country | gdp_capita |
East Asia & Pacific | Papua New Guinea | 1,626 |
Europe & Central Asia | Uzbekistan | 396 |
Latin America & Caribbean | Haiti | 1,337 |
Middle East & North Africa | Yemen, Rep. | 1,674 |
North America | United States | 65,280 |
South Asia | India | 1,458 |
Sub-Saharan Africa | Madagascar | 518 |
These tables show the highest and lowest values of poverty (fixed to the poverty international line) for each region, next to its GDP figure. We can see that there is a strong trend relating GDP per capita and proportion of population living in poverty.
povna %>%
group_by(region) %>%
drop_na(pop) %>%
summarize(`Weighted average`=weighted.mean(nat_pov_line,pop,na.rm=TRUE),
`Simple average`=mean(nat_pov_line,na.rm=TRUE),
Countries=n(),
Missing=sum(is.na(nat_pov_line)),
`Pop (millions)`=sum(pop)/1e6
) %>%
arrange(`Weighted average`) %>% flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | Weighted average | Simple average | Countries | Missing | Pop (millions) |
East Asia & Pacific | 3.87 | 20.40 | 21 | 0 | 2,077.53 |
Europe & Central Asia | 15.10 | 16.39 | 48 | 0 | 910.23 |
South Asia | 22.62 | 20.69 | 8 | 0 | 1,705.37 |
Middle East & North Africa | 24.15 | 22.58 | 12 | 0 | 276.57 |
Latin America & Caribbean | 38.29 | 32.36 | 20 | 0 | 418.48 |
Sub-Saharan Africa | 41.02 | 43.83 | 46 | 0 | 1,004.22 |
povint %>%
group_by(region) %>%
drop_na(pop) %>%
summarize(`Weighted average`=weighted.mean(int_pov_line,pop,na.rm=TRUE),
`Simple average`=mean(int_pov_line,na.rm=TRUE),
Countries=n(),
Missing=sum(is.na(int_pov_line)),
`Pop (millions)`=sum(pop)/1e6
) %>%
arrange(`Weighted average`) %>% flextable::flextable() %>%
flextable::colformat_double(digits = 2)
region | Weighted average | Simple average | Countries | Missing | Pop (millions) |
East Asia & Pacific | 0.59 | 5.95 | 23 | 0 | 1,987.93 |
North America | 0.92 | 0.60 | 2 | 0 | 364.88 |
Europe & Central Asia | 2.36 | 2.88 | 49 | 0 | 907.29 |
Middle East & North Africa | 2.76 | 3.01 | 15 | 0 | 372.83 |
Latin America & Caribbean | 3.94 | 6.14 | 25 | 0 | 630.30 |
South Asia | 18.91 | 8.26 | 7 | 0 | 1,669.99 |
Sub-Saharan Africa | 39.72 | 33.94 | 46 | 0 | 1,023.92 |
We have created these tables to show the weighted and simple average of the different rates of poverty for each region in the world. We can see differences between both types of averages, the reason being the existence of bigger and smaller countries that distort the simple average. On the other hand, we can see that there exists a notable difference between the averages of the national poverty line (which are considerably higher) and the averages of the international poverty line.
povmix %>%
group_by(region) %>%
drop_na(pop) %>%
summarize(National=weighted.mean(nat_pov_line,pop,na.rm=TRUE),
International=weighted.mean(int_pov_line,pop,na.rm=TRUE),
Countries=n(),
`Pop (millions)`=sum(pop)/1e6) %>%
ggplot(aes(
x = International,
y = National
)) +
geom_point(aes(size=`Pop (millions)`, color=region)) +
geom_smooth(se = FALSE,method="lm") +
geom_abline(slope=1,intercept=0) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "Comparison between both poverty lines",
x = "International Povery Line",
y = "National Poverty Line",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
These graph illustrated the difference of averages that we have just mentioned. The more far from the black line, the greater the difference within that region. We can see that every region is placed above the black line, meaning that the averages of national poverty lines are always higher. The blue line represents the deviation of the total calculation.
Moreover, the great differences that we have found throughout this project can be seen very clearly. We see, for example, that the region of East Asia & Pacific is the one with the least proportion of population living in poverty and, at the same time, the one region presenting the least difference between averages. Something similar happens with the Sub-Saharan Africa region, although it is located in the opposite side of the poverty spectrum.
povmix %>%
ggplot(aes(
x = gdp_capita,
y = nat_pov_line
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_x_log10(
labels = scales::dollar_format(),
breaks = scales::log_breaks(n = 4)
) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "National Poverty Line according to
GDP per capita",
x = "GDP per Capita",
y = "National Poverty Line",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
povmix %>%
ggplot(aes(
x = gdp_capita,
y = int_pov_line
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_x_log10(
labels = scales::dollar_format(),
breaks = scales::log_breaks(n = 4)
) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "International Poverty Line according to
GDP per capita",
x = "GDP per Capita",
y = "Internatopnal Poverty Line",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
These two graphs are pretty illustrative: they show a clear negative correlation between GDP per capita and the proportion of population living in poverty. These graphs are the visual representation of the tables shown above, and they confirm that the higher the GDP per capita, the lower the poverty in the country.
The following chunks were created to arrange the remaining variables in the same way we arranged our main indicators. We use them to analyze the impact of poverty, set at both international and national poverty lines.
per_undernourishment = data_set %>%
group_by(country) %>%
drop_na(int_pov_line) %>%
drop_na(undernourishment) %>%
arrange(-date) %>%
slice(1)
completion_lower_sec = data_set %>%
group_by(country) %>%
drop_na(int_pov_line) %>%
drop_na(lower_secondary) %>%
arrange(-date) %>%
slice(1)
per_slums = data_set %>%
group_by(country) %>%
drop_na(int_pov_line) %>%
drop_na(slums) %>%
arrange(-date) %>%
slice(1)
per_undernourishment2 = data_set %>%
group_by(country) %>%
drop_na(nat_pov_line) %>%
drop_na(undernourishment) %>%
arrange(-date) %>%
slice(1)
completion_lower_sec2 = data_set %>%
group_by(country) %>%
drop_na(nat_pov_line) %>%
drop_na(lower_secondary) %>%
arrange(-date) %>%
slice(1)
per_slums2 = data_set %>%
group_by(country) %>%
drop_na(nat_pov_line) %>%
drop_na(slums) %>%
arrange(-date) %>%
slice(1)
per_undernourishment %>%
ggplot(aes(
x = int_pov_line,
y = undernourishment
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "Undernourishment according to
International Poverty Line",
x = "International Poverty Line",
y = "Undernourishment",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
completion_lower_sec %>%
ggplot(aes(
x = int_pov_line,
y = lower_secondary
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "Completion of lower secondary according to
International Poverty Line",
x = "International Poverty Line",
y = "Lower Secondary Completion Rate",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
per_slums %>%
ggplot(aes(
x = int_pov_line,
y = slums
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "People living in inadequate housing according to
International Poverty Line",
x = "International Poverty Line",
y = "People living in slums",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
These graphs aim at finding a correlation between the proportion of the population living below the international poverty line and (1) the prevalence of undernourishment, (2) the completion rate of lower secondary education; and (3) the proportion of the population living in inadequate housing. All three of the graphs show a strong correlation between the variables. Correlation is positive in the case of undernourishment and people living in slums, and negative in the case of the completion rate of lower secondary education.
per_undernourishment2 %>%
ggplot(aes(
x = nat_pov_line,
y = undernourishment
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "Undernourishment according to
National Poverty Line",
x = "National Poverty Line",
y = "Undernourishment",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
completion_lower_sec2 %>%
ggplot(aes(
x = nat_pov_line,
y = lower_secondary
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "Completion of lower secondary according to
National Poverty Line",
x = "National Poverty Line",
y = "Lower secondary completion rate",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
per_slums2 %>%
ggplot(aes(
x = nat_pov_line,
y = slums
)) +
geom_point(aes(size=pop, color=region)) +
geom_smooth(se = FALSE) +
scale_size_continuous(
labels = scales::number_format(scale = 1/1e6, suffix = "m"),
breaks = seq(1e8,1e9, 2e8),
range = c(1,20)
) +
theme_minimal() +
labs(
title = "People living in inadequate housing according to
National Poverty Line",
x = "National Poverty Line",
y = "People living in slums",
size = "Population",
color = NULL,
caption = "Source: World Bank, through `wbstats`"
)
Using these graphs, looking at the relation of the national poverty line and the other three variables, we can see that the trends found in the other set of graphs are intensified. In other words: if we compare the percentage of the population living under the national poverty line to the three variables presented, we can see that there exist even stronger correlations than for the case of the international poverty line.
This project has helped us to extract 4 main conclusions:
The proportion of population living in poverty follows the expected trends: it remains low in the Global North (developed and developing countries) while reaching maximums along the Global South (developing and under-developed countries). The worst results can be located in Sub-Saharan Africa.
There exists a strong correlation between poverty and GDP per capita. This explains the difference between developed and under-developed countries, since the ratio of population living in poverty diminishes as GDP increases.
There exists a huge difference between the standards of international and national poverty lines. We have seen this difference through our indicator POVMIX and the several tables and graphs that we crafted. The international poverty line sets a very lax and accommodating threshold of 1.90$ a day. Thus, it only tackles the most extreme of the extreme poverty. This explains the generalized low percentages of poverty throughout the world, only reaching important figures (more than 5%) in the most under-developed countries of the Global South. On the other hand, national poverty lines are usually way stricter, but they are generally neglected by global surveys and reports. This shows that there is a generalized interest in showing that poverty is in fact lower than we may think, but this is a trick, as the international poverty line is set extremely low on purpose. Thus, the national poverty line is a more reliable indicator of poverty.
Poverty is at the root of other major problems found in the Global South, like the prevalence of undernourishment, the low completion rates of middle and higher education and the proportion of the population living in inadequate housing. This means that poverty still constitutes a major structural problem that needs to be seriously considered and addressed in order for the international community to overcome the challenges set out in the 2030 Agenda. Bu tracking and covering the root causes of poverty, a solid basis can be laid down in order to build a rigid structure that holds other important Sustainable Development Goals, like ending hunger or promoting quality education.