About
In this section, we will be using Tableau to learn concepts on data outliers, seasonality effect, and the relationships and impacts. There is no R coding in this lab session.
Setup
This worksheet will be used to capture your images from Tableau and to share your observations. Example of capturing and including an image is included at the end of this sheet for your reference. You will need to log onto Tableau and Connect/Import the file EuroStore.xls found in the ‘bsad_lab10’ folder.
Remember to always set your working directory to the source file location. Go to ‘Session’, scroll down to ‘Set Working Directory’, and click ‘To Source File Location’. Read carefully the below and follow the instructions to complete the tasks and answer any questions. Submit your work to RPubs as detailed in previous notes.
Note
For your assignment you may be using different data sets than what is included here. Always read carefully the instructions on Sakai. Tasks/questions to be completed/answered are highlighted in larger bolded fonts and numbered according to their particular placement in the task section.
mydata = read.csv("data/EuroStore.csv")
head(mydata)
## Week Sales TV Radio Fuel.Volume Fuel.Price Temp Holiday
## 1 26 24864 74.5 66.5 61825 104.24 27.9 1
## 2 27 23809 74.5 66.5 62617 103.97 27.7 1
## 3 28 24476 90.0 75.0 60227 107.48 29.1 1
## 4 29 25279 90.0 75.0 63273 111.75 30.0 1
## 5 30 26263 90.0 75.0 65196 109.08 29.3 1
## 6 31 24299 90.0 75.0 64789 105.36 28.1 1
PART A: DATA OUTLIERS AND SEASONALITY EFFECT
First get familiar with the data and what each columns represent. A description of the data is provided in a seperate sheet called ‘Desc’ in the same Excel file. Refer to Lab05 for early exercise using Tableau.
In a new Tableau sheet
TASK 1: Plot Sales (Rows) versus Week (Columns). Include a snapshot here. (To save the image go to Worksheet, select Export, Image, Save.) Analyse the data source and explain in clear words the behavior you observe.
ANSWER TASK 1: From the plotting we can observe that sales tended to slowly increase until week 22 where sales dropped to about half of its peak (50K to 23K). The sales increased again at week 25 and came back to their peak at week 26, increased a bit more until week 30 and then went slowly decreased.
TASK 2: Switch from SUM(Sales) to Average AVG(Sales) by clicking the arrow on the right of Sum(Sales), select Measure and Average. Change the Sales scale to be more reflective of the data (double click on the Avg. Sales axis, under Range select Fixed, change the Fixed Start and Fixed End then close the window). Include a snapshot here. Explain the new behavior relative to Task 1).
ANSWER TASK 2: The new behavior is an overall increase in the average sales with a lot of variations in the curve. The sales peak is still on week 30 and the average sales has a big drop between week 22 and 24 which is in accordance to the behavior observed in task 1
TASK 3: Add Temp to the Color scale found in Marks. Change SUM(Temp) to AVG(Temp). Edit the color legend to be more reflective of hot and cold temperatures (by clicking on Color, Edit Color, select relevant color, Applu, OK). Include a snapshot here. Explain the combined behavior of sales and temperature.
ANSWER TASK 3: As the temperature increases, the sales are getting closer to the sales peak
PART B: RELATIONSHIPS AND IMPACTS
In a seperate Tableau sheet
TASK 4: Plot Sales (Rows) versus TV (Columns). Switch both measures from SUM() to Dimension. The plot should look more like a scatter plot. Include a snapshot here. Explain the behavior of Sales versus TV. How much you think is the upper limit amount that should be invested in TV ads?
ASNWER TASK 4: As we can see on the scatter plot, spending on TV ads increase the sales, however, having spending over 100K doesn’t seem to have a significant increase in the sales.
TASK 5: Overlay Radio to the previous plot using the Size scale found in Marks. Include a snapshot here. Explain how the additional Radio ads to Tv ads is impacting Sales.
ANSWER TASK 5: Radio ads spending seems to vary less so it is harder to interpret, however, we can see that high spending in TV ads is usually associated with high spendings in Radio ads.The radio ads don’t seem as significant as the TV ads looking at the patterns.
In a separate Tableau sheet
TASK 6: Plot Sales versus Fuel Volume. Switch both measures from SUM() to Dimension. Explain behavior.
ANSWER TASK 7: As we can see with the temperature, the Fuel Volume tends to increase as temperature increase which shows a positive correlation between sales, Fuel Volume and temperature.
TASK 8: Overlay Holiday using the Label scale. Include a snapshot here. Explain the new combined behavior and the impact of Holiday.
ASNWER TASK 8: Most sales are made during Holiday and has the same behavior has the variables we just added
In a separate sheet
TASK 9: Use a Tree Map to best show the combined effect of Sales, Fuel Volume, Temp, and Holiday. A sample view is shown below. Consider using the Quick Filter on Holiday and Temp to isolate and better view the impact of each. You can have more than one filter at a time. Include a snapshot here.
TASK 10: Write a small paragraph summarizing your final conclusions on what you think most affect Sales and under what conditions.
ASNWER TASK 10: I believe the increase in sales is mainly due to the increase in temperature as well as TV ads and Radio ads, fuel consumption is a cause of the increase in Sales rather than merely correlated I believe. Holiday has an impact but its impact might actually be due to the increase in temperature, which would mean that the effect of temperature is being absorbed by holiday in a regression. Concerning TV and Radios ads, the effect of radios seem more ambiguous than the effect of TV ads.