South Dakota State University

Assignment 12

Ecuadorian Sales Analysis

Judge Eli and Valerie Janis

STAT-442-ST1

FA2024 Semester

November 25, 2024

Abstract

Provide a concise snapshot summary of the entire document, giving the reader a quick overview of the contents. This helps readers understand the scope and significance before diving into details. It typically covers the main objectives, methods, results, and conclusions. Offers enough information to help readers determine whether the content is relevant to their needs. It should be able to convey the essence of the document without requiring further context.





Dataset Overview

This dataset contains information about millions of sales sold at Favorita stores located in Ecuador. The data includes dates, store and product information, whether that item was being promoted, as well as the sales and transaction numbers.

Variables

Additional Notes

  • Wages in the public sector are paid every two weeks on the 15th and on the last day of the month. Supermarket sales could be affected by this.

  • A magnitude 7.8 earthquake struck Ecuador on April 16, 2016. People rallied in relief efforts donating water and other first need products which greatly affected supermarket sales for several weeks after the earthquake.


Dataset Structure

Data Frame Summary

dataset

Dimensions: 3000888 x 14
Duplicates: 0
Variable Stats / Values Freqs (% of Valid) Graph Missing
id [character]
1. 0
2. 1
3. 10
4. 100
5. 1000
6. 10000
7. 100000
8. 1000000
9. 1000001
10. 1000002
[ 3000878 others ]
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
3000878(100.0%)
0 (0.0%)
date [Date]
min : 2013-01-01
med : 2015-04-24
max : 2017-08-15
range : 4y 7m 14d
1684 distinct values 0 (0.0%)
store_nbr [numeric]
Mean (sd) : 27.5 (15.6)
min ≤ med ≤ max:
1 ≤ 27.5 ≤ 54
IQR (CV) : 27 (0.6)
54 distinct values 0 (0.0%)
product_category [character]
1. AUTOMOTIVE
2. BABY CARE
3. BEAUTY
4. BEVERAGES
5. BOOKS
6. BREAD/BAKERY
7. CELEBRATION
8. CLEANING
9. DAIRY
10. DELI
[ 23 others ]
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
2091528(69.7%)
0 (0.0%)
sales [numeric]
Mean (sd) : 357.8 (1102)
min ≤ med ≤ max:
0 ≤ 11 ≤ 124717
IQR (CV) : 195.8 (3.1)
379610 distinct values 0 (0.0%)
items_promoted [numeric]
Mean (sd) : 2.6 (12.2)
min ≤ med ≤ max:
0 ≤ 0 ≤ 741
IQR (CV) : 0 (4.7)
362 distinct values 0 (0.0%)
transactions [numeric]
Mean (sd) : 1694.6 (963.3)
min ≤ med ≤ max:
5 ≤ 1393 ≤ 8359
IQR (CV) : 1033 (0.6)
4993 distinct values 245784 (8.2%)
city [character]
1. Quito
2. Guayaquil
3. Cuenca
4. Santo Domingo
5. Ambato
6. Latacunga
7. Machala
8. Manta
9. Babahoyo
10. Cayambe
[ 12 others ]
1000296(33.3%)
444576(14.8%)
166716(5.6%)
166716(5.6%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
55572(1.9%)
55572(1.9%)
666864(22.2%)
0 (0.0%)
state [character]
1. Pichincha
2. Guayas
3. Azuay
4. Manabi
5. Santo Domingo de los Tsac
6. Cotopaxi
7. El Oro
8. Los Rios
9. Tungurahua
10. Bolivar
[ 6 others ]
1055868(35.2%)
611292(20.4%)
166716(5.6%)
166716(5.6%)
166716(5.6%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
55572(1.9%)
333432(11.1%)
0 (0.0%)
store_type [character]
1. A
2. B
3. C
4. D
5. E
500148(16.7%)
444576(14.8%)
833580(27.8%)
1000296(33.3%)
222288(7.4%)
0 (0.0%)
store_cluster [character]
1. 3
2. 10
3. 6
4. 15
5. 13
6. 14
7. 1
8. 11
9. 4
10. 8
[ 7 others ]
389004(13.0%)
333432(11.1%)
333432(11.1%)
277860(9.3%)
222288(7.4%)
222288(7.4%)
166716(5.6%)
166716(5.6%)
166716(5.6%)
166716(5.6%)
555720(18.5%)
0 (0.0%)
oil_price [numeric]
Mean (sd) : 67.9 (25.7)
min ≤ med ≤ max:
26.2 ≤ 53.3 ≤ 110.6
IQR (CV) : 49.4 (0.4)
1352 distinct values 0 (0.0%)
event_type [character]
1. Disaster
2. Holiday
3. Shopping Event
4. Sport Event
3069(1.5%)
142725(71.8%)
28281(14.2%)
24783(12.5%)
2802030 (93.4%)
event_description [character]
1. Navidad
2. Independencia
3. Mundial de futbol
4. Carnaval
5. Primer dia del ano
6. Dia de la Madre
7. Fundacion
8. Dia de Difuntos
9. Viernes Santo
10. Dia del Trabajo
[ 5 others ]
35508(17.9%)
32736(16.5%)
24783(12.5%)
17820(9.0%)
17820(9.0%)
17589(8.8%)
9636(4.8%)
8910(4.5%)
8877(4.5%)
8811(4.4%)
16368(8.2%)
2802030 (93.4%)

Generated by summarytools 1.0.1 (R version 4.4.0)
2024-11-21





Exploratory Data Analysis



1) Map of Ecuador Showing Store Locations by City Population

Geographically map the store locations across Ecuador while incorporating population data for each city.

Interpretation

The map displays store locations across Ecuador, with point sizes representing city populations. Larger points, concentrated in cities like Quito and Guayaquil, indicate higher populations and likely strategic placement to maximize customer reach. Smaller points highlight stores in less populated areas, suggesting limited market demand or potential opportunities for expansion. By combining store locations and population data, the visualization effectively shows coverage and identifies regions that may benefit from further market exploration.



3) Average Daily Sales by Product Category

Identify the contribution of different product categories to average daily sales.

Interpretation

The bar chart illustrates the average daily sales for various product categories, highlighting Grocery I as the top-performing category, with significantly higher sales compared to others. Categories such as Beverages, Produce, and Cleaning also contribute substantially, while some categories like Automotive and Baby Care have minimal sales. This breakdown helps prioritize key product categories that drive revenue and identify low-performing categories that may require strategic adjustments or promotion.



4) Average Daily Sales Per Year by Store Type

Compare yearly growth in average daily sales across different store types.

Interpretation

This line chart shows the trend of average daily sales for each store type from 2013 to 2017. Store Type A consistently outperforms others, with significant growth over the years, while Types B, C, and D follow with more gradual increases. Store Type E shows the slowest growth, maintaining relatively low sales levels. This indicates that Store Type A may cater to higher demand or operate in more profitable locations, while lower-performing types may benefit from targeted strategies to boost sales. The chart highlights the importance of store type in overall revenue growth.



Quantitative Analysis



6) Relationship Between Sales and Promotions

Determine the correlation between average sales and promotional activity over time.

## [1] "Correlation between sales and promotions: 0.575"

Interpretation

The scatter plot shows a positive correlation (r = 0.575) between average sales and the number of promotions, indicating that higher promotional activity generally leads to increased sales. The line chart on the right demonstrates the growth of promotions over time, with a noticeable increase starting around the middle of 2014. While promotions are effective in boosting sales, the correlation suggests other factors may also influence sales. This analysis highlights the importance of strategic promotions in driving revenue but also suggests the need to explore complementary strategies.


7) Relationship Between Sales and Oil Prices

Assess the impact of oil price fluctuations on average sales over time.

## [1] "Correlation between sales and oil prices: -0.628"

Interpretation

The scatter plot indicates a negative correlation (r = -0.628) between average sales and oil prices, suggesting that higher oil prices are associated with lower sales. The line chart shows a significant drop in oil prices from mid 2014 to 2016, which may have contributed to economic challenges influencing consumer spending. This relationship highlights the sensitivity of sales to fluctuations in oil prices, particularly in oil-dependent economies like Ecuador. Understanding this correlation can help businesses anticipate and adapt to market conditions during periods of volatile oil prices.





Conclusion

Recaps the key findings, interpretations, and implications at the end of the document, tying everything together. This provides readers with a final understanding of the work and its contributions. Summarize the main findings and emphasize their importance. Explain the broader significance of the results, highlighting how they address the original objectives or questions. Offer closure by summarizing the conclusions drawn from the analysis, suggesting potential future directions or applications.