Directions

In this chapter we discussed wy well-designed data graphics are important and we described a taxonomy for understanding their composition.

The objective of this assignment is for you to understand what characteristics you can use to develop a great data graphic.

Each question is worth 5 points.

To submit this homework you will create the document in Rstudio, using the knitr package (button included in Rstudio) and then submit the document to your Rpubs account. Once uploaded you will submit the link to that document on Canvas. Please make sure that this link is hyper linked and that I can see the visualization and the code required to create it.

Question #1

Answer the following questions for this graphic Relationship between ages and psychosocial maturity

  1. Identify the visual cues, coordinate system, and scale(s)

  2. How many variables are depicted in the graph? Explicitly link each variable to a visual cue that you listed above.

  3. Critique this data graphic using the taxonomy described in the lecture.

    1. Use position, length and color as visual cues; Use Cartesian coordinate system; Use Numerical as Y scale and time as X scale
    2. Age and time period are depicted by position cue; Age spans are depicted by length cue; Categorical variable (Menarche/Psychosocial Maturation) is depicted by color.
    3. This data graphic using position, length, and color as visual cues to present the fact that the pattern changin across period for different age span about Menarche and Psychosocial Maturation. Using context to improve the visualization.

Question #2

Answer the following questions for this graphic World’s top 10 best selling cigarette brands 2004-2007

  1. Identify the visual cues, coordinate system, and scale(s)

  2. How many variables are depicted in the graph? Explicitly link each variable to a visual cue that you listed above.

  3. Critique this data graphic using the taxonomy described in the lecture.

    1. Use length and color as visual cues; Use Cartesian coordinate system; Use Numerical for X scale and Categorical for Y scale.
    2. Categorical variable cigarette brand is linking to color visual cue; Numerical variable sales in billions is linking to length visual cue.
    3. The graphic use length and color as visual cues to present sales in billions for different Cigarette brands. The context in title and lable improve the visualization.

Question #3

Find two data graphics published in a newspaper on on the internet in the last two years.

  1. Identify a graphical display that you find compelling. What aspects of the display work well, and how do these relate to the principles that we have just gone over in lecture. Include a screenshot of the display along with your solution (Hint:use the following in a code chunk: knitr::include_graphics(“your_graphic”).
# The European City Liveability Index graph is compelling. It use Geographic coordinate system. Use location and color as visual cue. The visualization is very clear. The good context also improved the visualization

knitr::include_graphics("EuropeCityIndex.jpg")

  1. Identify a graphical display that you find less compelling. What aspects of the display don’t work well? Are there ways that the display might be improved? Include a screenshot of the display along with your solution (Hint:use the following in a code chunk: knitr::include_graphics(“your_graphic”).
# The market analysis graph looks less compelling. It did not add enough context, such as label and title, which can help reader to understand the visualization better. Adding context and help to improve the graphical display.

knitr::include_graphics("MarketAnalysis.jpg")

Question #4

Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.

What is a Data Scientist

Answer:

The Palettes (Blue) designer use is clear and consistent. There have a lot of context to help us understand the visualization. The visual cue is pretty straight forward, such as color, length, number, etc. Add the coordinate system might help to improve the visualization. Such as graphic related to "Data scientist are significantly more likely to have advanced degress than BI professional", where are hard for reader to identiy the Y axis meaning. 

Question #5

Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.

Charts that explain food in America

Answer:

Most of graphics are using Geographic coordinate system and color visual cues. They gave enough context to help reader understanding the visualization. I will made same choice as the designer made since it's a great way to visualize geographic related infomration.