setwd("/Users/diegodearmas/Downloads")
install.packages("ipumsr", repos = "https://cloud.r-project.org")
##
## The downloaded binary packages are in
## /var/folders/5_/389qrkvs1sd7nkp792bslx5r0000gn/T//Rtmpw3LESC/downloaded_packages
library(ipumsr)
install.packages("dplyr", repos = "https://cloud.r-project.org")
##
## The downloaded binary packages are in
## /var/folders/5_/389qrkvs1sd7nkp792bslx5r0000gn/T//Rtmpw3LESC/downloaded_packages
library(dplyr)
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
install.packages("psych", repos = "https://cloud.r-project.org")
##
## The downloaded binary packages are in
## /var/folders/5_/389qrkvs1sd7nkp792bslx5r0000gn/T//Rtmpw3LESC/downloaded_packages
library(psych)
install.packages("ggplot2", repos = "https://cloud.r-project.org")
##
## The downloaded binary packages are in
## /var/folders/5_/389qrkvs1sd7nkp792bslx5r0000gn/T//Rtmpw3LESC/downloaded_packages
library(ggplot2)
##
## Attaching package: 'ggplot2'
## The following objects are masked from 'package:psych':
##
## %+%, alpha
1) What is the population of interest in the Current Population
Survey (CPS)? Be precise – are there any age criteria, occupation
criteria, and/or geographic criteria ?
The population of interest is any U.S. civilian,
noninstitutionalized population ages 16 and older. The sample excludes
people living in institutions, and those active in the armed forces. The
survey is also for designed for age 16 and over (no upper limit).
## 2) What is the sample used to estimate the population
parameters/headline statistics Links to an external site. like
unemployment rate, employment to population ratio ? Be precise - how
many households and/or people are included ?
The Current Population Survey (CPS) is a sample survey of about
60,000 eligible households conducted by the U.S. Census Bureau. A unique
aspect of the methodology is how they implement a revolving panel
desing.
3) Do you think CPS is representative sample of the US entire
population after reading about its methodology or your online
reserach?
The CPS is a representative sample because it selects a multistage
probability-based sample of households in the United States. The sample
size is also determined by specific criteria that ensure a reliable
source for measuring the unemployment rate at the national and state
level.
4)
cps_ddi_file <- "cps_00001.xml"
cps_data_file <- "cps_00001.dat"
cps_ddi <- read_ipums_ddi(cps_ddi_file)
cps_data <- read_ipums_micro(cps_ddi_file, data_file = cps_data_file)
## Use of data from IPUMS CPS is subject to conditions including that users should
## cite the data appropriately. Use command `ipums_conditions()` for more details.
Yes the grpah makes sense because we can observate that households
that reported employement have a higher income that household that not
reported employment.