The Correlation between Poverty and Various Internet Infrastructure Indicators
Team 2, School of Professional Studies, CUNY
5/24/2019
Question 1: What economic indicators (race, occupation, community poverty rate) are most strongly correlated with internet access rates? Can we build a model that accurately predicts said rates?
Question 2: Are internet access rates a stronger predictor of poverty rates than other forms of social investment (ie roads, schools, hospitals)?
Question 3: Do these effects extend across internet technologies (cell phone sand broadband internet)? If not, which type of infrastructure investment is better.
Team Members: Charlie, Olga, Julian, Hantz, and Mike
Motivation, Literature Review, Methodology, and Hypothesis
Correlation between Various Technology Indicators and Poverty Rates
Data Initialization, Plots, and Models (SVM, Neural Networks, and GLM)
Finance Analysis
Conclusion and Next Steps
| Model | RMSE.train | RSquared.train | RMSE.test | RSquared.test |
|---|---|---|---|---|
| Support Vector Machine | 2.8493 | 0.8711891 | 1.5309 | 0.5936714 |
| Model | RMSE.train | RSquared.train | RMSE.test |
|---|---|---|---|
| Random Forest | 1.876554 | 0.9704061 | 1.655401 |
| Model | RMSE.train | RSquared.train | RMSE.test |
|---|---|---|---|
| Neural Network | 5.997393 | 0.2715674 | 3.939332 |
| Model | RMSE.train | RSquared.train | RMSE.test | RSquared.test |
|---|---|---|---|---|
| GLM | 0.722634 | 0.9854818 | 2.196015 | 0.8323232 |
| Model | RMSE.train | RSquared.train | RMSE.test | RSquared.test |
|---|---|---|---|---|
| GLM, excl. racial ind. | 0.8127761 | 0.9816339 | 2.903837 | 0.6964135 |
| Model | RMSE.train | RSquared.train | RMSE.test | RSquared.test |
|---|---|---|---|---|
| Random Forest | 1.083638 | 0.9656586 | 2.220804 | 0.8002246 |
| Model | RMSE.train | RSquared.train | RMSE.test | RSquared.test |
|---|---|---|---|---|
| GLM | 1.957209 | 0.6317318 | 2.005328 | 0.7493103 |