Albert Y. Kim
Monday 2015/01/26
When I came up with the name for this class “MATH 241 Case Studies in Statistical Analysis” last year, I was still skeptical of the term “Data Science” as I felt it too buzzwordy.
I've since changed my stance. This class should be in fact “MATH 241 Introductory Data Science”
From a presentation by former Institute of Mathematical Statistics president Bin Yu:
Venn Diagram 2.0:
From the introduction to the OpenIntro Statistics textbook from MATH 141:
In MATH 141, we tended to focus more on 4 and 5.
For the first few lectures, we will work on developing our data toolbox. These tools are absolutely necessary before we can pursue any kind of meaningful analysis. Specifically
ggplot2 packagedplyr packageThe beauty of these two R packages is there is a deep philosophy underlying how to use them.
RStudio is an integrated development environment that acts as a user interface for R.
To see them all, press alt+shift+k.