Click the Original, Code and Reconstruction tabs to read about the issues and how they were fixed.
Objective
The objective of the original data visualisation is to showcase the contribution of e-commerce sales as a percentage of the total retail sales made in each given year over a specified duration of time. The aim is to gain insights on whether there is a boom in e-commerce retail sales in the past decades. This information is beneficial to companies who operate within the U.S. retail industry. Information interpreted from the visualisations will benefit any existing player within the U.S. retail industry or any new potential entrants looking to enter the U.S. market with their short-term and long-term business strategies. To name a few of the target audiences within the above mentioned market spaces includes but are not limited to company strategists such as analysts, C-suite executives and stakeholders.
The visualisation chosen had the following three main issues:
the visual bombardment due to saturation of the colours blue and red
sizes of the red and blue portions of the chart are not measured to scale. Since the chart is not measured a scale, it makes it difficult for the reader to understand, interpret and visualise the percentage growth over the given period of time. As a result of this, the data visualisation seems to be misleading which means it is deceptive in nature and less effective in conveying the objective of the visualisation. Therefore, a bar graph would be more indicative when comparing percentages of e-commerce sales in different years in either an ascending or descending order. Moreover, adding grids to the bar graph will be helpful as it will help with highlighting the small percentage changes, assuring accuracy in reading the graph.
Although the visualisation has percentage of e-commerce sales out of the total retail sales mentioned, the absence of an axis makes it more difficult for the human eye to elucidate the objective of the chart. This is because the human eye is better trained and equipped to interpret visualisations along graphs plotted against the y- and x- axes. Thus, the absence of axes creates a bias and leads to misinformation that is misleading judgement and/or interpretations.
The objective of the visualisation revolves around the percentage of e-commerce boom of the total retail sales made each year, which is represented by the various shades of red as seen in the visualisation. There are two main issues with this colour representation which are as follows:
the four shades of red ranging from light red which comes off as pink to the naked eye to a stark, bright dark red colour. This is an issue as it can impair the ability of any human being that suffers from red colour blindness. This increases the probability of misreading the data visualisation resulting in making wrong conclusions; which is vital as it is makes up for the essence of the aim of the visualisation. This is why it is essential to keep in mind a colourblind-safe colour palette when producing visualisations. Moreover, red-blind readers perceive the colour as blue and/or olive whilst blue-blind readers perceive the colour as teal and/or pink. This, with regards to our visualisation will create immense confusion given the interchange of colours due to colour blindness, therefore resulting in misleading information thus defeating the purpose of the objective.
Since the primary focus is the data presented in shades of the colour red in the visualisation, the different shades of blue showcasing total retail sales is not required. It results in unnecessary information and colour explosion, especially where it is not essential or required with regards to the objective of the visualisation.
References
Martinčević, I 2021, Charting Over 20 Years of Retail Sales and E-Commerce in the U.S. Retail Industry (Update), HowMuch, viewed 19 September 2022, https://howmuch.net/articles/timeline-retail-sales-growth-US
McCready, R 2018, 5 Ways Writers Use Misleading Graphs To Manipulate You [INFOGRAPHIC] - Venngage, Venngage, viewed 19 September 2022, https://venngage.com/blog/misleading-graphs/
Nguyen, VT, Jung, K & Gupta, V 2021, ‘Examining data visualization pitfalls in scientific publications’, Visual Computing for Industry, Biomedicine, and Art, vol. 4, no. 1
Muth, LC 2020, What to consider when visualizing data for colorblind readers - Datawrapper Blog, blog.datawrapper.de, viewed 19 September 2022, https://blog.datawrapper.de/colorblindness-part2/
The following code was used to fix the issues identified in the original.
library(dplyr)
library(ggplot2)
retailsales <- data.frame(Year = c("2020", "2019", "2018", "2017", "2016",
"2015", "2014", "2013", "2012", "2011"),
Percentage_of_Ecommerce_Total_Retail_Sales = c(14.0, 10.9, 9.8, 9.0,
8.2, 7.3, 6.5, 5.9, 5.4, 4.9))
View(retailsales)
retailsales1 <- retailsales[order(retailsales$Year),]
View(retailsales1)
plot1 <- ggplot(data = retailsales1, aes(group = 1, x = Year,y = Percentage_of_Ecommerce_Total_Retail_Sales))
plot1 <- plot1 + geom_bar(stat= "identity", fill= "purple", colour= "black") +
geom_text(aes(label = paste(Percentage_of_Ecommerce_Total_Retail_Sales, "%", sep='')),
nudge_y = 1, nudge_x = 0.05) +
labs(
title = "Change in Growth of E-commerce Retail Sales in the U.S. Over a Decade",
y = "% of Ecommerce Retail Sales") + theme(plot.title = element_text(hjust = 0.5),
panel.background = element_rect(fill = "white", colour = "black", linetype="solid", size=1),
panel.grid.major = element_line(size = 0.5, linetype = 'solid', colour = "grey"),
panel.grid.minor = element_line(size = 0.25, linetype ='solid', colour ="grey")) +
scale_y_continuous(limits = c(0,16))
Data Reference
The following plot fixes the main issues in the original.