Introduction to data science in R: Data management and the art of wrangling
Overview: In this weekly workshop we will explore the management of data using Program R. We will focus on the stucture and linguistics of data in R. This workshop is not intended to teach specific R tasks. Rather than addressing how to solve data problems, our goal is to explore how to think about data in new ways. Through this process, participants are expected to develop a flexible skillset for managing and exploring data. Each lesson will last one hour and consist of a lecture followed by guided activities. This workshop will provide useful knowledge gains for participants at all skill levels. No previous experience with R is necessary.
Instructor: Brian S. Evans, Ph.D. Migratory Bird Center Smithsonian Conservation Biology Institute evansbr@si.edu
Topics and schedule
Base R I: An introduction to the linguistics of Base R and the R Studio environment
Base R II: Querying objects in Base R
Writing functions: Writing custom functions for repeated tasks
The dreaded for loop: Writing for loops and confronting myths on their implementation
Introduction to the tidyverse: A primer on tidy data: Learning the principles of tidy data and using tidyverse functions to reshape and join data
Manipulating and summarizing data in the tidyverse: Learning how to use tidyverse functions for data manipulation and summarizing
This is a pipe: Using pipes to streamline and communicate your code
Data queries in the tidyverse: Using dplyr to subset data
Visualizing data: Using ggplot to plot data
Putting it all together: Developing a data science workflow
Course details
Target audience: Anyone who currently works with data or hopes to in the future
Where: The Science Building conference room. A teleconference link will be provided to off-site participants.
When: Every Wednesday at 3 pm from January 17 - March 21, 2018. Each lesson will last roughly one hour.
Preparation: Please bring your own laptop to each class. Please visit this link prior to attending your first lesson.
If you are interested in attending this workshop, please apply at this link by January 10, 2018.
About the Instructor: Brian is an avian ecologist and data scientist at the Smithsonian Migratory Bird Center. He received his Ph.D. in quantitative ecology from the University of North Carolina in 2015. He has been using R for roughly 10 years and has been teaching classes and workshops in R and data management since 2012.