In this final project we will analyze the full data set from the GSS.
Please install the gssr
package and read the instruction
and example here.
In this project, you will be required to finish the following tasks:
Install the package gssr
, then read its mainpage to understand the
basic structure of the data set. Obtain basic information about the data
set from the GSS mainpage.
I am going to give a tutorial in class and share the videos with all of you.
The gssr
package main page has an tutorial of how to
obtain the meaning for each variable in R.
You can use the tool to search for useful variables for your analysis.
Full documentation about the data set is a place for reference. You want to scan through the “Introduction” and “Index to Data Set” to get an idea about what variables are included.
GSS data analysis are often quoted by media. One example is this article. You can get some hints from these studies and you can also pursue to explore some of the finding in new directions/depth. This article is written for a media report and you can choose this style or a more academic writting style.
Key trends section lists many interesting trends which should give you some hints about where to start.
To find questions in depth and of practical importance, you need to do some research to find a meaningful topic of your analysis report. You are required to cite at least one article (giving link to the article is good enough) in your Introduction part to describe what motivates your study.
Pick up at least three questions of your interest that can be
explored by the gss
data set. The questions should meet the
following requirements:
At least one question needs to explore data from the past 10 years (2012-2022).
At least one question should study a trend with respect to time (compare the same variable from different years since 1970s up to now).
At least one question needs to involve geospatial data analysis. Check this link for the definition of regions.
The questions should be related to each other in some way such that you can draw an overall conclusion in the Conclusion section.
You need to create powerpoint slides to present your results in the presentation session. Your presentation should cover the main contents of your report (Introduction, Questions and Findings, Conclusion) and should be between 10-15 minutes. You must be well prepared for the talk as an exercise to publicly presenting technical reports.
You are required to submit a pdf (text + plot) with code hidden
along with the rmd file that can be knitted to generate the pdf report.
You can put this chunk in a code cell at top of Rmd will hide code in
knitted doc: opts_chunk$set(echo=FALSE)
The final report should be at least 8 pages long and formatted including sections of Introduction, Questions and Findings, and Conclusion:
Rubrics:
Rubrics: