In this chapter we discussed why well-designed data graphics are important and we described a taxonomy for understanding their composition.
The objective of this assignment is for you to understand what characteristics you can use to develop a great data graphic.
Each question is worth 5 points.
To submit this homework you will create the document in Rstudio, using the knitr package (button included in Rstudio) and then submit the document to your Rpubs account. Once uploaded you will submit the link to that document on Canvas. Please make sure that this link is hyper linked and that I can see the visualization and the code required to create it.
Question #1
Answer the following questions for this graphic Relationship between ages and psychosocial maturity
knitr::include_graphics("http://ars.els-cdn.com/content/image/1-s2.0-S1043276005002602-gr2.jpg")
#### Question - 1 Answer part (a):
# Answer (a)
## Visual cues: Some of the graphs visual cues elements are color, position, shape, size (length) and direction.
## Coordinate System: The Relationship between ages and psychosocial maturity graph has Cartesian (x,y) coordinate system.
## Scale(s) : The graph's scale is linear numeric for y axis, initiating from the y-scale of 0 age year till the y axis scale of 20 years of age, and graph further has logarithmic backward numeric ( x- axis), initiating from 20,000 years of x-axis scale, till present year.
#### Question - 1 Answer part (b):
# Answer (b) : The graph presented four pairs of similar or identical variables i.e., variable "menarche" and variable "Psychosocial maturation", and these set of variables having a variation in terms of length, direction, and position, i.e., 20,000 years ago the length of "menarche" and "Psychosocial maturation" were different and around 2000 years in addition to the length, the position of the variable was also in a different position, and around 200 years, even the direction of the "menarche" variable moving up compared to another variable, and in the present year, the visual cue: color clearly distinguishes between the mismatch between these two variable.
#### Question - 1 Answer part (c):
# Answer (c) : The graphic data consist of different elements of color-related visual cues for the distinction between the "menarche" and "psychosocial maturation" variables, and although the graphic data depicts the main purpose of the data, however, it would be nice to have the title "The trends in Endocrinology and Metabolism" to be positioned on the top rather than bottom position, and in addition, there also seem to be a missing x- axis label and legend in this graph, and perhaps a different pick of color elements other than green and pink would have given more professional appearance to the graphic data.
Question #2
Answer the following questions for this graphic World’s top 10 best selling cigarette brands 2004-2007
knitr::include_graphics("https://farm3.static.flickr.com/2695/4149541331_482fbb0aaf_o.png")
#### Question - 2 Answer part (a):
# Answer (a)
## Visual cues: Some of the graphs visual cues elements are color, and length).
## Coordinate System: Cartesian (x,y) coordinate system.
## Scale(s) : The graph's scale is linear numeric for x axis, initiating from the x-scale of 0 dollars till the x axis scale of 500 dollars, and graph further has categorical ( y- axis), for depicting different categories of cigarette brands.
#### Question - 2 Answer part (b):
# Answer (b):
# There are around ten variables depicted in the graph, and each of these variables further depicted with the visual cues elements of length and color:
# Variable 1: Marlboro, Visual Cues: Color and length.
# Variable 2: Mild Seven, Visual Cues: Color and length.
# Variable 3: L&M, Visual Cues: Color and length.
# Variable 4: Winston, Visual Cues: Color and length.
# Variable 5: Camel, Visual Cues: Color and length.
# Variable 6: Cleopatra, Visual Cues: Color and length.
# Variable 7: Derby, Visual Cues: Color and length.
# Variable 8: Pall Mall, Visual Cues: Color and length.
# Variable 9: Kent, Visual Cues: Color and length.
# Variable 10: Wills Gold flake, Visual Cues: Color and length.
#### Question - 2 Answer part (c):
# Answer part (c):
# The graphic data consist of different elements of color and length related visual cues for the distinction between the various sales percentages (in billions) of some of the world's topmost cigarettes brands between the period 2004 -2007, and although the graphic data depicts the primary purpose of the data in a nicely organized manner, and clearly representing the "Marlboro" cigarette brand as the top most selling brand between year 2004 and 2007, and besides this, all of the variables were also cleanly represented in the graph via leveraging distinct color cues elements being assigned to different brands, and the length of the bars to represent the total sales in billions for those cigarettes brands between 2004 & 2007, although all the elements of the graph were clearly depicted in structured manner, however, their is still a missing Y-axis label and legend in this graphic data.
Question #3
Find two data graphics published in a newspaper on the internet in the last two years.
Identify a graphical display that you find compelling. What aspects of the display work well, and how do these relate to the principles that we have just gone over in lecture. Include a screenshot of the display along with your solution (Hint:use the following in a code chunk: knitr::include_graphics(“your_graphic”).
Identify a graphical display that you find less compelling. What aspects of the display don’t work well? Are there ways that the display might be improved? Include a screenshot of the display along with your solution (Hint:use the following in a code chunk: knitr::include_graphics(“your_graphic”).
#### Question - 3 Answer part (a): More Compelling graphical Display Analysis:
knitr::include_graphics("https://www.census.gov/content/dam/Census/library/stories/2022/03/united-states-deaths-spiked-as-covid-19-continued-figure-3.jpg")
## Answer-3 (part a): Some of the crucial elements of the 'Covid-19 Pandemic: Monthly deaths and Milestones" graphical display which I found to be compelling were the deployment of different color cues for the clear distinction and representation of different year ranges i.e., utilization of the color cue blue for year duration (Year: 2020-2021), and color cue grey for year duration (2018-2019) and inclusion of legend in the graphic data plot, and besides that, the Cartesian (x,y) coordinate system's y-axis scale further depicted the total pandemic deaths in thousands, and whereas x-axis depicted the various twelve months of the year, in addition of the precise representation of the trend of rises or fall in the total pandemic monthly deaths for different year ranges selection (represented by blue and grey color cues lines), the other intriguing element of the graphical display was the inclusion and displaying of some of the year's crucial milestones summaries during the Covid-19 pandemic situation during year duration 2020 -2021.
#### Question - 3 Answer part (b): Less Compelling graphical Display Analysis:
knitr::include_graphics("https://miro.medium.com/max/1228/1*2XZ667k6b_haq-jrHb58HQ.webp")
## Answer-3 (part b): Some of the crucial elements of the 'Number of Covid-19 Tests per Million" graphical display which I found to be less compelling were the deployment of the same color cue i.e., blue for the distinction and representation of different countries, and in addition, the graphical display also didn't have the clear x-axis label, and this graphical display was also perhaps missing one of the most crucial elements of the representation of depiction of the specific year, for which the data statistics for the total number of Covid-19 tests per million people for various countries were referred in this graphical display, and perhaps a slightly different title name and not exactly similar to the y-axis label display would have been more appealing for this graphical display, and perhaps this was another element of the graph which I found to be significantly less compelling.
Question #4
Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.
Answer:
#### Question - 4 Answer:
# In my perspective, although the author's design choices deployed in these various graphical displays seem to be reasonably okay but mostly, from a high-level understanding perspective of these graphical-elements main context, projected statistics, and different defined categories comparisons, however, I would have certainly made different choices, since it would have been more compelling to have leveraged different shades of color for visual cues rather than utilizing monochromatic color scheme driven visual cues i.e. usages of varying shade of blue and grey in most of these graphical elements, and besides the visual cues, another choice which could have been implemented in these graphs is the consistency for the font sizes of the percentages and textual elements, since the author has used varied fonts to depict some of these graphs, and the dimension of the data percentages and fonts also significantly varies from each other in some of these graphs, e.g., huge deviation between graph i.e., "The best source of new data science talent is", and "Data scientists are significantly more likely to have advanced degrees than BI Professional" from the perspective of font sizes, and graphical dimensions, and it would have been better to have some level of consistency across these depicted graphs, and in addition deploying different color visual cues for the depiction of different categories would also have been more professional and appealing, unlike the similar color cues used for "Lack of training and resources as biggest obstacle" graph.
Question #5
Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.
Charts that explain food in America
Answer:
#### Question - 5 Answer:
# Although, I really found it compelling that the author for the charts explaining the food in america have deployed various forms of coordinate system in their graphs i.e., either geographic coordinate system, Cartesian, and polar coordinate system, and in addition to utilization of varied ranges of appealing color visual cues elements as well, and in addition, some element of dynamic visual elements also incorporated to few select graphs, however, despite that if I've to made some different choice and further have to improve in these graphs, I would have certainly included a clearly defined x-axis, and y-axis labels in majority of the Cartesian coordinate system driven-graphs, since in some of bar graphs these axis labels are entirely missing, and one of the another element which I'd noticed is that majority of the Cartesian coordinate system driven-graphs which have different visual color cues in them are also further missing legend and it would have been more compelling and professional to have a clearly defined "legend" for some of these Cartesian coordinate system driven graphs for further improvement, although majority of the geographic coordinate system driven-graphs does have properly defined legends for different categories, and I would have also made a choice of using different color visual cues for the last "The rise of bottled water" graph to depict different categories of beverages.