Instructions

You must follow the instructions below to get credits for this assignment.

Q1 Describe the Early Childhood Longitudinal Study that the U.S. Department of Education undertook in the late 1990s.

Hint: Make sure to discuss study’s goal, subjects, and variables in the data.

The ECLS was a program where more than 20,000 young children between the grades of kindergarten and fifth grade, were asked simple demographic questions with a goal of finding their education level. Students were asked race, gender, family structure, socioeconomic status, parents level of educaiton, and more. But more information was gathered from teachers, principals, and parents about how often they were spanked if at all, how often they watch tv, are the children ever brought to educational recreation sites, and so on. In this study, there are hundreds of variables to be accounted for where in normal studies there are only a couple variables. Hundreds of variables would make it difficult to analyze in a normal setting but using regression analysis tools have made it much easier to read and comprehend.

Q2 Describe regression analysis in your own words.

Hint: A correct answer must have a discussion on a main concept of regression, often called as, “all else being equal”, “controlling for other variables”, or “Ceteris paribus”. The author explained this concept using “the circuit board analogy”.

Regression analysis is kind of like grouping variables to see how and if they correlate. When using an experiment setting, usually all variables are controlled with a random subject(s) being altered. The regression analysis format can group subjects with likewise characteristics or with characteristics that they dont have in common. Regression analysis doesnt just answer any question though, it answer if X can be correlated to Y and vise versa. This still doesn’t prove that the question is 100% true, it basically says that there is reason to believe that X is correlated with Y because the numerics say so.

Q3 What is a drawback of regression analysis. What type of questions can regression analysis not answer?

Hint: A correct answer must have a discussion on causality versus correlation.

Regression analysis can not answer questions that are considered casuality questions such as the one used in the text “does having a lot of books in your home lead your child to do well in school?”. This is basically a yes or no question that can’t be answered with regresssion analysis. Regression analysis is meant for answering questions that ask if variable 1 is related to variable 2 such as “does a child with a lot of books in his home tend to do better than a child with no books?”. Now using data, this question can be answered in terms of correlation where a child with books may do better than a child with no books. But there can be hundreds of other variables as to why the child with books does better in school such as family structure and socioeconomic status.

Q4 What role does the quality of schools play in academic performance of students.

Hint: See page 150.

The quality of school plays a significant role in academic performance. In the survey, the typical white student attended schools where there was only a 6% black population and came from better neighborhoods and a much lower rate of “troublesome indicators”. The typical black student that was surveyed came from schools that were 60% black population but had a significant higher rate of troublesome indicators like gang relations, non-student loitering, and not much PTA funding. Regardless of technology, teacher education, or money, the two populations were very similar other than the troublesome indicators. This shows that the area that a student is in plays a significant role in how well they will perform academically.

Q5 Continued from Q4. How could you control for the quality of schools in the study?

Hint: For this question, you may need additional information in addition to the assigned reading. You may Google search someting like “how does regression control for variables”.

Well in this case you cannot control the quality of schools for this study as this is an observational study. There is no altering to the students, rather they are just being examined for their academic ability. Observational studies do not have a control and usually have confounds. A confounding varibale is a variable that cold affect the X and Y but is not necassarily measured.

Q6 The author says that regression is more art than science. What does he mean?

He means that the studies can show numerous amounts of data and a skilled practitioner can give you correlations to really any correlating questions. It is more of a skill than it is a task and the author thinks that it’s something that could be useful for determining how variables can be related to other variables.

Q7 What are major takeaways from the reading?

The thing that stuck out the most to me was the fact that you can do a regression analysis on any correlating question that you can think of. It is crazy to think that we have the capability, using statistics, to find out if X can be related to Y or Y related to X even if it is absurd. One of the things found later in the study was that Head Start, an early entry schooling program, does not affect the childs test scores in the future. It does not prepare them for a better education. Regression Analysis can be used for anything which is what surprises me the most.

Q8 Hide the messages, but display the code and its results on the webpage.

Hint: Use message, echo and results in the chunk options. Refer to the RMarkdown Reference Guide.

Q9 Display the title and your name correctly at the top of the webpage.

Q10 Use the correct slug.