Access - microsofts database - tool of preference for
accumulating datasets into queries
ask for access data??
Primary data = outcome of interest is the vitals data (still
waiting for IRB approval)
socio, demographic and home address
everyone who has died in Galveston County since 2010 about 30,000
observations
ICD codes up to 3 level deeps
Keys
- Age at death
- 65 year threshold over and under
- crude death
- five year intervals/frames in dataset
Secondary data
- A lot of different data sources
- Has a couple of different datasets where he has aggregated secondary
and primary data (will need to wait to access)
- working on narrowing focus when looking at what variables to include
in the models we want to explore
- public data sets that we are using for the exploratory:
- Social capital Data
- Social Vulnerability data
- link to
svi data
- census tracks for galveston - csv file has labels for variables will
have to look at data dictionary
- summary Excel file to figure out the big picture
- Cdc PLACES data
- link
to cdc PLACES data
- for multiple years - longitudinal analysis??
- aggregate data in five year frames of death data
- Nanda Data
- link to nanda
dataset
- Green space, polluting, cover, environment data social services
data
- Tract is census tract and zc is zip code in files
- Census Data - denominator for the study
- recieved file in email
- Use 2020 and possibly 2010 data
- looking for total population count (by census track, zip code,
county) and by (age brackets {5 year intervals})