Data Wrangling & Visualization: Comprehensive Guide

Josh Carrell

For more of my code: https://rpubs.com/GeospatialEcologist


Introduction

Too often I found myself googling simple R code I had learned and quickly forgotten. While googling code examples and tutorial is a daily pursuit for 99% of coders, I needed more consistancy. So I have made my own Data Wrangling & Visualization page! This markdown contains all wrangling and visual codes and examples that I find useful and necessary. All code will NOT be hid but printed for example. Enjoy.

Data Wrangling: What is it & Why is it important?

“Data wrangling, sometimes referred to as data munging, is the process of transforming and mapping data from one”raw" data form into another format with the intent of making it more appropriate and valuable for a variety of downstream purposes such as analytics". - Wikipedia (Yes, wikipedia.).

Essentially, it is the imperative preprocessing step of cleaning your data. It is the basics and already assumed skillset of researchers, data scientists, and data visualization. You need to know this stuff.

Visualization: What makes a “good” visualization?

Good data visualizations come in all shapes and sizes, but all of them have certain traits that help ensure you produce something with important insights. The following table, sourced from Mico Yuk and Stephanie Diamond from their book “Data Visualization for Dummies”, shows these key items:

Trait Description
Useful People use it on a regular basis and can make relevant decisions by viewing all the information they need in one place.
Desirable It’s not only easy to use, but also pleasurable to use.
Usable People who use it can accomplish their goals quickly and easily.

Simply put, you want your visuals to be appealing, simple yet critical, and only using information the consumer needs. Sometimes less is more and sometimes more is just right. All visualizations should be tailor made to your audiance.

Packages: Which, What, Why

The following packages will prove helpful: