South Dakota State University

Assignment 12

Ecuadorian Sales Analysis

Judge Eli and Valerie Janis

STAT-442-ST1

FA2024 Semester

November 25, 2024

Abstract

This report explores sales patterns and trends for Favorita stores across Ecuador using extensive transactional data. The analysis examines factors such as geographical distribution of stores, temporal sales trends, product category performance, and store type growth. Quantitative analyses investigate the impact of promotions and oil price fluctuations on sales. Key findings include the importance of population density in store location strategy, the seasonal and yearly trends driving sales, and the significant influence of promotional activity on revenue. Additionally, oil price volatility is shown to negatively impact sales, underscoring the economic sensitivity of the region. This report provides actionable insights for optimizing marketing strategies, inventory management, and regional store performance.





Dataset Overview

This dataset contains information about millions of sales sold at Favorita stores located in Ecuador. The data includes dates, store and product information, whether that item was being promoted, as well as the sales and transaction numbers.

Variables

Additional Notes

  • Wages in the public sector are paid every two weeks on the 15th and on the last day of the month. Supermarket sales could be affected by this.

  • A magnitude 7.8 earthquake struck Ecuador on April 16, 2016. People rallied in relief efforts donating water and other first need products which greatly affected supermarket sales for several weeks after the earthquake.


Dataset Structure

Data Frame Summary

dataset

Dimensions: 3000888 x 14
Duplicates: 0
Variable Stats / Values Freqs (% of Valid) Graph Missing
id [character]
1. 0
2. 1
3. 10
4. 100
5. 1000
6. 10000
7. 100000
8. 1000000
9. 1000001
10. 1000002
[ 3000878 others ]
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
1(0.0%)
3000878(100.0%)
0 (0.0%)
date [Date]
min : 2013-01-01
med : 2015-04-24
max : 2017-08-15
range : 4y 7m 14d
1684 distinct values 0 (0.0%)
store_nbr [numeric]
Mean (sd) : 27.5 (15.6)
min ≤ med ≤ max:
1 ≤ 27.5 ≤ 54
IQR (CV) : 27 (0.6)
54 distinct values 0 (0.0%)
product_category [character]
1. AUTOMOTIVE
2. BABY CARE
3. BEAUTY
4. BEVERAGES
5. BOOKS
6. BREAD/BAKERY
7. CELEBRATION
8. CLEANING
9. DAIRY
10. DELI
[ 23 others ]
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
90936(3.0%)
2091528(69.7%)
0 (0.0%)
sales [numeric]
Mean (sd) : 357.8 (1102)
min ≤ med ≤ max:
0 ≤ 11 ≤ 124717
IQR (CV) : 195.8 (3.1)
379610 distinct values 0 (0.0%)
items_promoted [numeric]
Mean (sd) : 2.6 (12.2)
min ≤ med ≤ max:
0 ≤ 0 ≤ 741
IQR (CV) : 0 (4.7)
362 distinct values 0 (0.0%)
transactions [numeric]
Mean (sd) : 1694.6 (963.3)
min ≤ med ≤ max:
5 ≤ 1393 ≤ 8359
IQR (CV) : 1033 (0.6)
4993 distinct values 245784 (8.2%)
city [character]
1. Quito
2. Guayaquil
3. Cuenca
4. Santo Domingo
5. Ambato
6. Latacunga
7. Machala
8. Manta
9. Babahoyo
10. Cayambe
[ 12 others ]
1000296(33.3%)
444576(14.8%)
166716(5.6%)
166716(5.6%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
55572(1.9%)
55572(1.9%)
666864(22.2%)
0 (0.0%)
state [character]
1. Pichincha
2. Guayas
3. Azuay
4. Manabi
5. Santo Domingo de los Tsac
6. Cotopaxi
7. El Oro
8. Los Rios
9. Tungurahua
10. Bolivar
[ 6 others ]
1055868(35.2%)
611292(20.4%)
166716(5.6%)
166716(5.6%)
166716(5.6%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
111144(3.7%)
55572(1.9%)
333432(11.1%)
0 (0.0%)
store_type [character]
1. A
2. B
3. C
4. D
5. E
500148(16.7%)
444576(14.8%)
833580(27.8%)
1000296(33.3%)
222288(7.4%)
0 (0.0%)
store_cluster [character]
1. 3
2. 10
3. 6
4. 15
5. 13
6. 14
7. 1
8. 11
9. 4
10. 8
[ 7 others ]
389004(13.0%)
333432(11.1%)
333432(11.1%)
277860(9.3%)
222288(7.4%)
222288(7.4%)
166716(5.6%)
166716(5.6%)
166716(5.6%)
166716(5.6%)
555720(18.5%)
0 (0.0%)
oil_price [numeric]
Mean (sd) : 67.9 (25.7)
min ≤ med ≤ max:
26.2 ≤ 53.3 ≤ 110.6
IQR (CV) : 49.4 (0.4)
1352 distinct values 0 (0.0%)
event_type [character]
1. Disaster
2. Holiday
3. Shopping Event
4. Sport Event
3069(1.5%)
142725(71.8%)
28281(14.2%)
24783(12.5%)
2802030 (93.4%)
event_description [character]
1. Navidad
2. Independencia
3. Mundial de futbol
4. Carnaval
5. Primer dia del ano
6. Dia de la Madre
7. Fundacion
8. Dia de Difuntos
9. Viernes Santo
10. Dia del Trabajo
[ 5 others ]
35508(17.9%)
32736(16.5%)
24783(12.5%)
17820(9.0%)
17820(9.0%)
17589(8.8%)
9636(4.8%)
8910(4.5%)
8877(4.5%)
8811(4.4%)
16368(8.2%)
2802030 (93.4%)

Generated by summarytools 1.0.1 (R version 4.4.0)
2024-11-21





Exploratory Data Analysis



1) Map of Ecuador Showing Store Locations by City Population

Geographically map the store locations across Ecuador while incorporating population data for each city.

Interpretation

The map displays store locations across Ecuador, with point sizes representing city populations. Larger points, concentrated in cities like Quito and Guayaquil, indicate higher populations and likely strategic placement to maximize customer reach. Smaller points highlight stores in less populated areas, suggesting limited market demand or potential opportunities for expansion. By combining store locations and population data, the visualization effectively shows coverage and identifies regions that may benefit from further market exploration.



3) Average Daily Sales by Product Category

Identify the contribution of different product categories to average daily sales.

Interpretation

The bar chart illustrates the average daily sales for various product categories, highlighting Grocery I as the top-performing category, with significantly higher sales compared to others. Categories such as Beverages, Produce, and Cleaning also contribute substantially, while some categories like Automotive and Baby Care have minimal sales. This breakdown helps prioritize key product categories that drive revenue and identify low-performing categories that may require strategic adjustments or promotion.



4) Average Daily Sales Per Year by Store Type

Compare yearly growth in average daily sales across different store types.

Interpretation

This line chart shows the trend of average daily sales for each store type from 2013 to 2017. Store Type A consistently outperforms others, with significant growth over the years, while Types B, D, and E follow with more gradual increases. Store Type C shows the slowest growth, maintaining relatively low sales levels. This indicates that Store Type A may cater to higher demand or operate in more profitable locations, while lower-performing types may benefit from targeted strategies to boost sales. The chart highlights the importance of store type in overall revenue growth.



Quantitative Analysis



6) Relationship Between Sales and Promotions

Determine the correlation between average sales and promotional activity over time.

## [1] "Correlation between sales and promotions: 0.575"

Interpretation

The scatter plot shows a positive correlation (r = 0.575) between average sales and the number of promotions, indicating that higher promotional activity generally leads to increased sales. The line chart on the right demonstrates the growth of promotions over time, with a noticeable increase starting around the middle of 2014. While promotions are effective in boosting sales, the correlation suggests other factors may also influence sales. This analysis highlights the importance of strategic promotions in driving revenue but also suggests the need to explore complementary strategies.


7) Relationship Between Sales and Oil Prices

Assess the impact of oil price fluctuations on average sales over time.

## [1] "Correlation between sales and oil prices: -0.628"

Interpretation

The scatter plot indicates a negative correlation (r = -0.628) between average sales and oil prices, suggesting that higher oil prices are associated with lower sales. The line chart shows a significant drop in oil prices from mid 2014 to 2016, which may have contributed to economic challenges influencing consumer spending. This relationship highlights the sensitivity of sales to fluctuations in oil prices, particularly in oil-dependent economies like Ecuador. Understanding this correlation can help businesses anticipate and adapt to market conditions during periods of volatile oil prices.





Conclusion

This analysis highlights several critical insights for Favorita stores in Ecuador. Store locations are strategically aligned with population density, but underserved regions offer potential for expansion. Temporal trends reveal the importance of weekends and December for maximizing sales, while year-over-year growth reflects successful long-term strategies. Product categories like Grocery I and store types like Type A consistently outperform others, providing a focus for revenue optimization. Promotions are shown to positively correlate with sales, but oil price volatility poses challenges to consumer spending. Addressing these economic sensitivities and leveraging high-performing categories and strategies can drive future growth. Further research could explore specific regional needs and consumer behaviors to refine these strategies.