Hsin-Yu Cheng
Updata : June 29, 2015
Data Science Specialization Certificate - John Hopkins University / Coursera (In Pursuit)
John Hopkins University holds data science courses on Coursera. A Specialization is a group of related courses, including 9 data analysis related courses and a Capstone project that allows you to use all skills learned throughout the 9 courses. After finising requirements of quizzes and projects, you will get the certificate for the corresponding course. The report records content and projects I have done in the Data Science course.
The introduction for the Data Science Specialization.
In this course, the instructor gives brief introduction of all 9 data science courses. It covers concepts of data analysis and big data and the related tools, namely git, github, RStudio, Rtools amd R markdown.
Github account
Certificate : The Data Scientist’s Toolbox by Johns Hopkins University on Coursera. Certificate earned on May 31, 2015
In this course, learn how to program in R and how to use R for effective data analysis. The course covers practical issues in statistical computing which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, and organizing and commenting R code. Topics in statistical data analysis will provide working examples.
Project 2 : Lexical Scoping
Project 3 : Air Pollution
Certificate : R Programming by Johns Hopkins University on Coursera. Certificate earned on May 31, 2015
In this course, it covers how to obtain different kind of data type, such as from the web, from APIs, from databases and from colleagues in various formats. Also, I have learned the process of cleaning data and how to make data “tidy”.
Project : A tidy data set of Human Activity Recognition Using Smartphones
Certificate : Getting and Cleaning Data by Johns Hopkins University on Coursera. Certificate earned on June 28, 2015
In this course, I have learn the essential exploratory techniques for summarizing data and three plotting systems in R, namely Base Plotting System, Lattice Plotting System and ggplot2.
Project 1 : Exploration of electric power consumption data
Project 2 : Exploration of PM2.5 emissions Data
Certificate : Exploratory Data Analysis by Johns Hopkins University on Coursera. Certificate earned on May 31, 2015
This course focus on literate statistical analysis tools which allow one to publish data analyses in a single document that allows others to easily execute the same analysis to obtain the same results. I have learned the concepts and tools behind reporting modern data analyses in a reproducible manner, namely R markdown, knitr and Rpubs.
Project 1 : Reproducible research on activity monitoring data
Project 2 : Report on storm data in the United States
Certificate : Reproducible Research by Johns Hopkins University on Coursera. Certificate earned on June 28, 2015
In the course, I have learned statistical linear regression models for prediction, redidual variance, inference in regression, multivariable regression, dummy variable, interactions and model selection.
Project : Analysis on motor trend data
Certificate : Regression Models by Johns Hopkins University on Coursera. Certificate earned on May 31, 2015
Learn how to communicate a statistical analysis output with a data product. The course covers shiny which is a web APP, R packages, rchart, Googlevis ,interactive graphics and slidify. The project is to create a data product to tell a story about the data to a mass audience.
Project : Interactive Web Application - Shiny
Certificate : Developing Data Products by Johns Hopkins University on Coursera. Certificate earned on June 28, 2015