Reproducibility, RMarkdown, Presentation, Webpages…., pdf, html, word, Latex and much more
Basic feature of this workshop will be to start exploring data using R along-with hands-on experience. All workshop and related data will be provided to all the participants once they will register for the workshop.
Understanding datasets requires many hours/days or in some cases weeks.There are many commercially available softwares but open source community based softwares have now dominated and R is one of these. R makes data understanding process as easy as possible through the dplyr package. It is one of the easiest solution for code-based data analysis. We will learn in this training how to do it. I have already uploaded videos for downloading packages and relevant information. Installing R and RStudio,
I have discussed the Gapminder dataset in my videos and we shall use it throughout this training. It’s available through CRAN, so make sure to install it. Here’s how to load in all required packages:
Average life expectancy is calculated by_grouping continents:
## # A tibble: 5 x 2
## continent avglife
## <fct> <dbl>
## 1 Africa 54.8
## 2 Americas 73.6
## 3 Asia 70.7
## 4 Europe 77.6
## 5 Oceania 80.7
Understanding percentiles w.r.t lifeexpectancy and gdp per capita for 200 countries using gapminder data.
Creating awesome graphs using themes of popular magzines like The Economist, Financial Times and many ohters.
##
## Attaching package: 'plotly'
## The following object is masked from 'package:ggplot2':
##
## last_plot
## The following object is masked from 'package:stats':
##
## filter
## The following object is masked from 'package:graphics':
##
## layout
## Warning: Ignoring unknown aesthetics: ids
## Warning: All elements of `...` must be named.
## Did you want `key = c(key)`?
## `geom_smooth()` using formula 'y ~ x'
## Setting the `off` event (i.e., 'plotly_doubleclick') to match the `on` event (i.e., 'plotly_hover'). You can change this default via the `highlight()` function.
After exploratory analysis, we shall discuss modeling of data in this section.