In this chapter we discussed why well-designed data graphics are important and we described a taxonomy for understanding their composition.
The objective of this assignment is for you to understand what characteristics you can use to develop a great data graphic.
Each question is worth 5 points.
To submit this homework you will create the document in Rstudio, using the knitr package (button included in Rstudio) and then submit the document to your Rpubs account. Once uploaded you will submit the link to that document on Canvas. Please make sure that this link is hyper linked and that I can see the visualization and the code required to create it.
Question #1
Answer the following questions for this graphic Relationship between ages and psychosocial maturity
Identify the visual cues, coordinate system, and scale(s) Visual cues: length, color, position, and direction Coordinate system: Cartesian Scale(s): Linear
How many variables are depicted in the graph? Explicitly link each variable to a visual cue that you listed above. Four pairs of two identical variables menarche and psychosocial maturation 20,000 years ago: length (length of menarche and psychosocial maturation are different) 2,000 years ago: position (compared to the 20,000-years-ago variable pair, it’s in a different position) 200 years ago: direction (compared to the other variable pairs, it’s slightly going up) Present:color to distinguish menarche and psychosocial maturation
Critique this data graphic using the taxonomy described in the lecture. The data graphic has color visual cues to distinguish menarche and psychosocial maturation. It tries to show there is a mismatch between menarche and psychosocial maturation in the present time. Also, the scale of the graphic shows a time scale where there is a numeric quantity that has some special properties.
Question #2
Answer the following questions for this graphic World’s top 10 best selling cigarette brands 2004-2007
Identify the visual cues, coordinate system, and scale(s) Visual cues: color and length Coordinate system: Cartesian Scale(s): Linear
How many variables are depicted in the graph? Explicitly link each variable to a visual cue that you listed above. 10 variables Marlboro: color & length Mild Seven: color & length L&M: color & length Winston: color & length Camel: color & length Cleopatra: color & length Derby: color & length Pall Mall: color & length Kent: color & length Wills Gold flake: color & length
Critique this data graphic using the taxonomy described in the lecture. The data graphic has visual cues of color and length. We can see that Marlboro was the number 1 top best selling cigarette brand between 2004 and 2007. With the title and x-axis label, it gives us a clear context of what the purpose of the data graphics is to make meaningful comparison. With that said, it has a linear scale where x-axis is sales in billions($).
Question #3
Find two data graphics published in a newspaper on on the internet in the last two years.
knitr::include_graphics("https://www.mining.com/wp-content/uploads/2017/11/Bloomberg-chart-1-Nov-29.png")
This is a line chart that represents the change in price of various precious metals (Gold, Silver, Platinum, and Palladium) over a period of time. This type of graph is commonly used to represent changes in prices or values over time, making it an effective way to show trends and patterns in the data.
The aspects of the display that work well include:
The use of different colors to differentiate between the different metals, making it easy to identify the specific line representing each metal The clear labeling of the x-axis with dates, which provides context for the time period being displayed The y-axis is labeled with a clear unit of measurement (dollars per ounce) The use of a grid to help readers easily compare values across different points in time These aspects of the display follow the principles of graphical displays, such as: Clearly labeling the axes and providing context for the data being displayed Using colors effectively to differentiate between categories of data Providing a clear and easy-to-read visual representation of the data.
knitr::include_graphics("https://www.oldstreetsolutions.com/wp-content/uploads/2021/05/3D-Chart.png")
A 3D chart is an example of a display that may not work well in some cases. The use of three dimensions can make it more difficult for the viewer to interpret the data, especially if the chart has a lot of data points or is viewed from an angle that does not clearly show the relationships between data points. Additionally, the 3D effect can be distracting and take away from the actual data being presented. To improve this type of display, it may be better to use a 2D chart or a simple data visualization that better communicates the information in a clear and concise manner. Another solution could be to provide interactive tools such as zoom or rotate options to allow the viewer to better understand the data.
Question #4
Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.
Answer: The data graphic “What is a Data Scientist” effectively uses simple illustrations and typography to explain the different skills and responsibilities of a data scientist. The designer’s choice to use a limited color palette of blue and gray creates a professional and clean look, while the illustrations help to visually break up the information and make it easier to understand. The typography is clear and easy to read, but the use of different font sizes and weights helps to distinguish between the different sections of text. However, the designer could have used data visualizations to further emphasize the importance of certain skills, or used annotations to provide additional information. Overall, the designer’s choices result in a clear and straightforward explanation of the role of a data scientist, but some additional data visualizations could have further enhanced the graphic.
Question #5
Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.
Charts that explain food in America
Answer: The data graphics in the collection “Charts that explain food in America” effectively use color, typography, and simple illustrations to convey complex information about the American food system. The designer’s choice to use a minimal color palette of blue, orange, and gray helps to create a consistent look throughout the collection, while also making it easy to distinguish different data points. The typography is clear and easy to read, while the illustrations, such as the images of fruit and vegetables, add a playful touch to the graphics and help to break up large amounts of data. Overall, the designer’s choices work well in making the information accessible and easy to understand, but a few additions such as annotated axes or interactive elements might help to further engage the reader.