Question 1

Suppose I conduct a study and publish my findings. Which of the following is an example of replication of my study ?

Answer : An investigator at another institution conducts a study addressing the same question, collects her own data, analyses it separately from me, and publishes her own findings.

Question 2

Which of the following is a requirement for a published data analysis to be reproducible ?

Answer : The full computer code for doing the data analysis is made publicly available.

Question 3

Which of the following is an example or reproducible study ?

Answer : The study’s analytic data and computer code for the data analysis are publicly available. When the code is run on the analytic data, the findings are identical to the published results.

Question 4

Which of the following is a reason that a study migth NOT be fully replicated ?

Answer : The original study was opportunistic in its timing and it would be difficult to find a similar context in which to repeat it.

Question 5

Which of the following is a reason why publishing reproducible research is increasingly important ?

Answer : New technologies are increasing the rate of data collection, creating datasets that are more complex and extremely high dimensional.

Question 6

What is the role of processing code in research pipeline ?

Answer : It transforms the measured data in to analytic data.

Question 7

Which is a goal of literate statistical programming ?

Answer : Combine explanatory text and data analysis in a single document.

Question 8

What does it mean to weave a literate statistical program ?

Answer : Transform the literate program into a human readable document.

Question 9

Which of the following is required to implement a literate programming system ?

Answer : A programming language like R.

Question 10

What is one way in which the knitr system differs from Sweave ?

Answer : knitr allows for the use of markdown instead of LaTeX.