In this section, we will be using Tableau to learn concepts on data outliers, seasonality effect, and the relationships and impacts. There is no R coding in this lab session.
This worksheet will be used to capture your images from Tableau and to share your observations. Example of capturing and including an image is included at the end of this sheet for your reference. You will need to log onto Tableau and Connect/Import the file EuroStore.xls found in the ‘bsad_lab10’ folder.
Remember to always set your working directory to the source file location. Go to ‘Session’, scroll down to ‘Set Working Directory’, and click ‘To Source File Location’. Read carefully the below and follow the instructions to complete the tasks and answer any questions. Submit your work to RPubs as detailed in previous notes.
For your assignment you may be using different data sets than what is included here. Always read carefully the instructions on Sakai. Tasks/questions to be completed/answered are highlighted in larger bolded fonts and numbered according to their particular placement in the task section.
### mydata = read.csv("data/EuroStore.csv")
### head(mydata)
First get familiar with the data and what each columns represent. A description of the data is provided in a seperate sheet called ‘Desc’ in the same Excel file. Refer to Lab05 for early exercise using Tableau.
In a new Tableau sheet
Beginning with Week 22 and ending with Week 25 there is a sharp decrease in sales. This is because the dataset contains duplicates for all of the weeks expect weeks 22-25. Therefore, the sales number is much lower, about half that of the other weeks as sales are only counted once, not twice as with all the other 52 weeks.
The sales data is represented as a AVG now instead of SUM therefore the values do not drop during weeks 22-25. This change accounts for the duplicate weeks and shows that sales do not decrease during weeks 22-25 as displayed in Task 1.
In a seperate Tableau sheet
Based off this plot, there are many weeks where there is no investment in TV ads yet there are still high sales amounts. TV ads have a relatively low correlation with sales and therefore a large amount should not be invested in TV ads. The upper limit should be 100. There are few sales numbers above 100 and they are very dispersed. It would be best to keep the value invested below 100.
From this plot it is observed that radio ads are more beneficial to sales than tv ads. Adding radio ads increases sales.
In a separate Tableau sheet
There is a slight positive correlation between Sales and Fuel Volume. Generally, as Sales increase, fuel volume does as well.
Sales correlates with temperature. As the temperature increases, so do sales as portrayed in the plot. When the temperature is cooler, there are generally fewer sales.
When we overlay Holiday, we can see that Holidays have a large impact on Sales. Many of the days with highest sales are days that are also holidays.
In a separate sheet
I believe temperature has the greatest impact on sales for this gas station. This is because for every different task, upon adding temperature, it was evident that the warmer the temperature the more sales were made. TV Ads and Radio Ads do not have a large impact on sales. The second largest variable that has an impact on sales is holidays. The highest selling days were generally warm holidays.