Directions

In this chapter we discussed wy well-designed data graphics are important and we described a taxonomy for understanding their composition.

The objective of this assignment is for you to understand what characteristics you can use to develop a great data graphic.

Each question is worth 5 points.

To submit this homework you will create the document in Rstudio, using the knitr package (button included in Rstudio) and then submit the document to your Rpubs account. Once uploaded you will submit the link to that document on Canvas. Please make sure that this link is hyper linked and that I can see the visualization and the code required to create it.

Question #1

Answer the following questions for this graphic Relationship between ages and psychosocial maturity

  1. Identify the visual cues, coordinate system, and scale(s)

A: Visual cues: color, position, length, and direction, labels; Coordinate system: Cartesian; Scale(s): Time(years)

  1. How many variables are depicted in the graph? Explicitly link each variable to a visual cue that you listed above.

A: 4 pair of variable to show the different between menarche and psychosocial maturation 1. 20,000 years ago: length for ages 2. 2,000 years ago: position in term of Argiculture settlement 3. 200 years ago: direction in term of Industrial revolution and social improvement 4. Present: magenta and green color to distinguish menarche and psychosocial maturation, mismatched in social complexity and nutritional overload.

  1. Critique this data graphic using the taxonomy described in the lecture.

A: The data graphic has color visual cues to distinguish menarche and psychosocial maturation. It tries to show there is a mismatch between menarche and psychosocial maturation in the present time. Also, the scale of the graphic shows a time scale where there is a numeric quantity that has some special properties. it’s easy to distinguish the relationship between those two objects, however, the bar length is not defined by scale, so it’s not easy to see the absolute number of each objects.

Question #2

Answer the following questions for this graphic World’s top 10 best selling cigarette brands 2004-2007

  1. Identify the visual cues, coordinate system, and scale(s)

A: Visual cues: color and length. Coordinate system: Cartesian. Scale(s): Linear

  1. How many variables are depicted in the graph? Explicitly link each variable to a visual cue that you listed above.

A: 10 variables. Marlboro: color & length. Mild Seven: color & length. L&M: color & length. Winston: color & length .Camel: color & length. Cleopatra: color & length. Derby: color & length.Pall Mall: color & length.Kent: color & length.Wills Gold flake: color & length

  1. Critique this data graphic using the taxonomy described in the lecture.

A: The data graphic has visual cues of color and length. We can see that Marlboro was the number 1 top best selling cigarette brand between 2004 and 2007. With the title and x-axis label, it gives us a clear context of what the purpose of the data graphics is to make meaningful comparison. With that said, it has a linear scale where x-axis is sales in billions($).

Question #3

Find two data graphics published in a newspaper on on the internet in the last two years.

  1. Identify a graphical display that you find compelling. What aspects of the display work well, and how do these relate to the principles that we have just gone over in lecture. Include a screenshot of the display along with your solution (Hint:use the following in a code chunk: knitr::include_graphics(“your_graphic”).
knitr::include_graphics("ushousingshortage.jpg")

A: This map shows Housing supply in US 2022 June. It clearly displayed the supply level at main metro areas by visual cue of color. The brownish color indicate shortage most and blueish color indicate Surplus. the geographic map, labels and color difference is easier to help viewers to understand the state of current housing supply. The context is provided by color scale and legend.

  1. Identify a graphical display that you find less compelling. What aspects of the display don’t work well? Are there ways that the display might be improved? Include a screenshot of the display along with your solution (Hint:use the following in a code chunk: knitr::include_graphics(“your_graphic”).
knitr::include_graphics("housing-bad.png")

A: This graphic less compelling because it’s lack of information and explanations. The data graphic has visual cues. We know there are three categories, it trying to show the trends of house sale, inventory and HPI, also their relationship. However, it’s lack of labels such as title, and the axis label is not clear enough. the x-axis of time could be more simple and short, the display shows very dense and not easy to specify the difference.

Question #4

Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.

What is a Data Scientist

Answer: I like the graphic with more clear context and contrast. Contrast refers to how different elements are in a design, particularly adjacent elements. These differences make various elements stand out. Contrast is also a very important aspect of creating accessible designs. Insufficient contrast can make text content in particular very difficult to read, especially for people with visual impairments. hence, I would like to add more color contrast instead of blue and gray in this graphic. Also, the balance is also important, every element of a design—typography, colors, images, shapes, patterns, etc.—carries a visual weight. Some elements are heavy and draw the eye, while other elements are lighter. The way these elements are laid out on a page should create a feeling of balance.There are two basic types of balance: symmetrical and asymmetrical. Symmetrical designs layout elements of equal weight on either side of an imaginary center line. Asymmetrical balance uses elements of differing weights, often laid out in relation to a line that is not centered within the overall design.

Question #5

Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.

Charts that explain food in America

Answer: I like the design of each categories are separated and being charted well, since this layout avoid lots overlapping and confusion. To improve the visualization, i would like to normalize the coordinate system since multiple system will cause the misunderstanding. Furthermore, a clear visual cue using color to classify different data point could be improved, such as add more contrast, and label the scale. Another difference choice could be the focus of those graphics, without read the side context, it’s not easy to understand what is the key statement or point for viewers. such as the 14th graph, the color did make the difference, but lack of legend to help viewer understand that which meat does the color represented for. I would try add more intutive lable and legend to describe the graphic instead of using side text to explain.