In this chapter we discussed why well-designed data graphics are important and we described a taxonomy for understanding their composition.
The objective of this assignment is for you to understand what characteristics you can use to develop a great data graphic.
Each question is worth 5 points.
To submit this homework you will create the document in Rstudio, using the knitr package (button included in Rstudio) and then submit the document to your Rpubs account. Once uploaded you will submit the link to that document on Canvas. Please make sure that this link is hyper linked and that I can see the visualization and the code required to create it.
Question #1
Answer the following questions for this graphic Relationship between ages and psychosocial maturity
Question #2
Answer the following questions for this graphic World’s top 10 best selling cigarette brands 2004-2007
Question #3
Find two data graphics published in a newspaper on on the internet in the last two years.
knitr::include_graphics("Student loans.png")
- The above graphic is
compelling to me since it is the bar plot that is easy to interpret. I
can see how many borrowers belong to each amount owed bracket. It is
pretty simple as there are only two variables.
knitr::include_graphics("Tax Filings.png")
- This graphic is bit harder
to read as it has a few shades of green depending on the returns count.
It needs a little more time to interpret compared to the one before. I
don’t really have a better solution though. I think it is just the
specific of information represented here.
Question #4
Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.
Answer: I think the choice of colors is fine. Some graphics feel a bit overwhelming with the amount of information on them, for example the one comparing proportions of BI professionals vs Data scientists in learning backgrounds.
Question #5
Briefly (one paragraph) critique the designer’s choices. Would you have made different choices? Why or why not? Note: Link contains a collection of many data graphics, and I don’t expect (or want) you to write a full report on each individual graphic. But each collection shares some common stylistic elements. You should comment on a few things that you notice about the design of the collection.
Charts that explain food in America
Answer: Graphics 2 and 4 are simple and easy to read. I would do these in a similar way. Graphic 3 is also pretty easy to interpret and see the trends based on colors depending on geography. At the same time, if we take a look at graphic 5, it seems harder to read as there is a multiple choice of colors, shades and transparency thanks to the gradient. Graphic 38 has similar problem, especially when there is no obvious leading name (soda, pop or coke). However, figures 28 and 29 are even harder to work with. For graphic 28, according to its description, each point has the colors of the three most popular restaurants at that location. Personally, for me it is extremely hard to name three most popular restaurants at any point on the map. On graphic 29, it isn’t easy to determine state borders or major cities. It is creative but its interpretation would be easier if the base map was shown as on the figure 3 and density be visualized by color shade, not the “waffle” candles’ height.