Environment_Subset_Data <- "Environment_Subset.RData"
load(Environment_Subset_Data)
colnames(Environment_Subset) <- c("Environment_Related_Question",
"Year_Of_survey",
"Country_or_Region",
"Age",
"Highest_educational_level")
# Subset to US,India & China
# also removing missing/unanswered data
Environment_Subset_US_India_China<-subset( Environment_Subset,(Country_or_Region==356 | Country_or_Region==156 |Country_or_Region==840) & Environment_Related_Question >0 )
You should phrase your research question in a way that matches up with the scope of inference your dataset allows for.
How people think about environment in three different countries China,US and India? ( World’s biggest polluters** ).
If time permits other variables like age and level of education will be explored.
What are the cases, and how many are there?
Total cases are 3791
Describe the method of data collection.
As per “Collection Procedures” section at http://www.worldvaluessurvey.org/WVSContents.jsp
“The mode of data collection for WVS surveys is face-to-face interviewing. Other modes (e.g., telephone, mail, internet) are not acceptable except under very exceptional circumstances and only on an experimental basis”
R Data file downloaded from http://www.worldvaluessurvey.org/WVSDocumentationWVL.jsp and then filtered as per requirement
What type of study is this (observational/experiment)?
This is observational study.
Link :
http://www.worldvaluessurvey.org/WVSDocumentationWVL.jsp
Citation : WORLD VALUES SURVEY 1981-2014 LONGITUDINAL AGGREGATE v.20150418. World Values Survey Association (www.worldvaluessurvey.org). Aggregate File Producer: JDSystems, Madrid SPAIN.
The response variable would be answer to question “Environmental problems in the world: Global warming or the greenhouse effect.”
| Value | Description |
|---|---|
| 1 | Very serious |
| 2 | Somewhat serious |
| 3 | Not very serious |
| 4 | Not serious at all |
| -5 | Missing; Unknown |
| -4 | Not asked in survey |
| -3 | Not applicable |
| -2 | No answer |
| -1 | Don´t know |
We wil check how answer to this question varies with region,and also possibly, with age and education.
What is the explanatory variable, and what type is it (numerical/categorival)?
Explanatory variables are region ( US,China & India ), education qualification and age.
| Value | Description |
|---|---|
| 156 | China |
| 356 | India |
| 840 | US |
| Value | Description |
|---|---|
| 1 | Inadequately completed elementary education |
| 2 | Completed (compulsory) elementary education |
| 3 | Incomplete secondary school: technical/vocational type/(Compulsory) elementary education and basic vocational qualification |
| 4 | Complete secondary school: technical/vocational type/Secondary, intermediate vocational qualification |
| 5 | Incomplete secondary: university-preparatory type/Secondary, intermediate general qualification |
| 6 | Complete secondary: university-preparatory type/Full secondary, maturity level certificate |
| 7 | Some university without degree/Higher education - lower-level tertiary certificate |
| 8 | University with degree/Higher education - upper-level tertiary certificate |
| -5 | Missing; Unknown |
| -4 | Not asked in survey |
| -3 | Not applicable; No formal education |
| -2 | No answer |
| -1 | Don´t know |
| Value | Description |
|---|---|
| 15 to 98 | age ranges from 15 to 98 |
| -5 | Missing; Unknown |
| -4 | Not asked in survey |
| -3 | Not applicable |
| -2 | No answer |
| -1 | Don’t know |
Provide summary statistics relevant to your research question. For example, if you’re comparing means across groups provide means, SDs, sample sizes of each group. This step requires the use of R, hence a code chunk is provided below. Insert more code chunks as needed.
We can ask below questions from data available.
## Environment_Related_Question Year_Of_survey Country_or_Region
## Min. :1.000 Min. :2006 Min. :156.0
## 1st Qu.:1.000 1st Qu.:2006 1st Qu.:156.0
## Median :2.000 Median :2006 Median :356.0
## Mean :1.767 Mean :2006 Mean :446.4
## 3rd Qu.:2.000 3rd Qu.:2007 3rd Qu.:840.0
## Max. :4.000 Max. :2007 Max. :840.0
## Age Highest_educational_level
## Min. :-2.00 Min. :-3.000
## 1st Qu.:31.00 1st Qu.: 2.000
## Median :42.00 Median : 5.000
## Mean :43.57 Mean : 3.838
## 3rd Qu.:54.00 3rd Qu.: 6.000
## Max. :93.00 Max. : 8.000
The mean value of question “Environmental problems in the world: Global warming or the greenhouse effect” In China 1.8169243 , in India 1.7095097 and in US 1.7796053
WORLD VALUES SURVEY 1981-2014 LONGITUDINAL AGGREGATE v.20150418. World Values Survey Association (www.worldvaluessurvey.org). Aggregate File Producer: JDSystems, Madrid SPAIN.
**World’s biggest polluters - China, U.S. and India :