Full stack data analysis with R's ecosystem

Tho Duy Nguyen
2016

Tools in data analysis process

  • Data analysis software: Eview, Crystal Ball, Stata, SPSS, SAS…
  • Text editor: Word, Open Office, ….
  • Data visualization: usually Excel then paste to text editor.
  • Presentation: Power Point
  • and more: Latex, …

Problems

  • Have to learn many GUI tools
  • Commercial tools
  • Specific platform: Windows, Linux or MacOS?
  • Reproducible research?
  • Productivity
  • Up to date techonology?

R's Ecosystem is the question for all problem!

This is not a joke!

R language

  • The richest data analysis libraries
  • Open source

RStudio as IDE

  • RStudio is the open source IDE, tailors for data analysis with R
  • RStudio Server: cloud solution
  • Easy to extend function with plugins writing in R

"Traditional R"

  • Data cleaning and data manipulating
  • Data exploration
  • Data visualiztion
  • Data modeling

"Modern R" with additional power

  • Reproducible research is the core value.

rmarkdown

  • A simple mark up language, easy to learn
  • Intergrate code and plot into research document
  • Uses in other places
  • For more information
install.packages("rmarkdown")
install.packages("knitr")

slidify

  • Make HTML5 presentation: support all devices have modern web browser
  • Easy to customize with CSS, JavaScript
  • 12 templates
install.packages("devtool")
library(devtool)
install_github('ramnathv/slidify')
install_github('ramnathv/slidifyLibraries')

flexdashboard

  • Make dashboard with interactive data visualization powered by R's library htmlwidgets
  • Modern way to present data visualization
  • Easy customize with CSS, JavaScript
  • Example: here or here
  • For more information
devtools::install_github("rstudio/flexdashboard")

More?

The list is still going …

Blogging

Develop web data product