Running Ahead: An Examination of Adidas’ Sales Trends Across US Regions

2023-03-08

Giyani Shangase


This analysis seeks to learn valuable insights from the Adidas Sales dataset, which shows sales performance of different product categories across the 5 regions of the USA and 6 retailers which are represented in the data.

The goal of this analysis is to understand consumer buying trends, gain insights into which product categories perform well in different retailers, and to understand how these trends differ throughout the USA.

The key metrics for this study are:
• Total Sales by retailer
• Total Profit by retailers
• Sales performance of different sales categories
• Operating margin of different products 

Business Questions
There are two business questions to be answered in this analysis:
1. Which retailer had the highest total sales and profit for each region and what was their profit margin for Adidas products?
2. Which product categories have the highest sales and profit margin in each region? 

The answers to these questions can be used to:
• Identify high-performing retailers and understand what drives their sales performance, and assess whether this information can be used to improve sales and profitability across all retailers.
• Inform sales and marketing strategy based on performance of different product categories across all regions.
• Inform inventory and staffing decisions based on past sales performance and costs of doing business for each retailer.

Data Source
The data used in this analysis is available publicly on Kaggle.com. This dataset contains all of the key information needed to answer the business questions above, and to provide further insights into sales performance and profit across all retailers and regions.

Source: https://www.kaggle.com/datasets/heemalichaudhari/adidas-sales-dataset

The data is organised as follows:
• Retailer ID
• Invoice date
• Region
• State
• City
• Product category
• Price per unit
• Units sold
• Total sales
• Operating profit
• Operating margin
• Sales method 

There are no privacy, security or licensing issues with this dataset as it is publicly available. The dataset has been used by many other analysts and cited repeatedly which is a verification of the integrity of the data.
This dataset helps to answer the business question as it has the key metrics, measured by retailer and region for all sales across the United States.
The dataset contains data for two years, 2020 and 2021. I have decided to use only 2021 sales as I feel this is a more accurate representation of sales under normal conditions. Sales in 2020 were heavily skewed towards retailers who operated online across all industries due to the pandemic, making the sales data from the year less useful for predicting consumer behaviour in other years.

Exploratory Data Analysis

Tools

All analysis and visualisations in this project was done on R and R Studio.

The following packages were used in the data cleaning process:
• Tidyverse
• Lubridate
• Skimr
• Here
• Janitor

Load data

library(readxl)
sales_data <- read_excel("Adidas US Sales Datasets.xlsx")

Data Cleaning

Data cleaning for this project involved the following steps:

Loading Packages for cleaning and analysis

library(tidyverse)
library(skimr)
library(here)
library(janitor)
library(lubridate)
library(knitr)
library(scales)
library(gridExtra)

• Renaming column names using the rename() function to ensure uniformity and consistency, making it clear which variable each column is showing.

sales_data <- rename(sales_data, total_sales = `Total Sales`,
                     operating_profit = `Operating Profit`,
                     retailer_id = `Retailer ID`, 
                     invoice_date = `Invoice Date`,
                     price_per_unit = `Price per Unit`,
                     units_sold = `Units Sold`,
                     operating_margin = `Operating Margin`,
                     sales_method = `Sales Method`)

sales_data <- rename(sales_data, region = Region,
                     retailer = Retailer,
                     state = State,
                     city = City,
                     product = Product)

• Creating a new column showing the operating margin as a percentage. The original column shows the value as a decimal value with 100% being 1.0.

sales_data <- sales_data %>% 
  mutate(margin_percentage = operating_margin*100)

• Using as.Date() function to change date to correct format.

 sales_data <- sales_data %>% 
  select(retailer, total_sales, region, product, operating_margin, invoice_date, sales_method, operating_profit, retailer_id) %>% 
  filter(invoice_date >= as.Date("2021-01-01"))

• Using unique() function to remove duplicates.

 sales_data <- unique(sales_data) 

•Viewing column names and structure of data with colnames() and skim_without_charts() functions.

colnames(sales_data)
skim_without_charts(sales_data)
Retailer Sales Performance Analysis

• Creating sales_2021 data frame

sales_2021 <- sales_data %>% 
  select(retailer, total_sales, region, product, operating_margin, invoice_date, 
         sales_method, operating_profit, retailer_id) %>% 
  filter(invoice_date >= as.Date("2021-01-01"))
View(sales_2021)

• Calculating total sales and profit for each retailer by region

retailer_sales_2021 <- sales_2021 %>% 
  group_by(region, retailer) %>% 
  summarize(total_sales_millions = sum(total_sales)/1000000,
            operating_profit_millions = sum(operating_profit/1000000),
            operating_margin_percent = mean(operating_margin)*100) %>% 
  print(n = 30)

• Calculating retailers with the highest sales in each region

top_product_sales_2021 <- sales_2021 %>% 
  group_by(region, retailer) %>% 
  summarize(total_sales_millions = sum(total_sales)/1000000,
            operating_profit_millions = sum(operating_profit)/1000000,
            operating_margin_percent = mean(operating_margin*100)) %>% 
  group_by(region) %>% 
  slice_max(total_sales_millions, n = 1, with_ties = FALSE) %>% 
  select(region, retailer, total_sales_millions, operating_margin_percent, operating_profit_millions) %>% 
  arrange(-total_sales_millions)
 knitr::kable(top_product_sales_2021)
region retailer total_sales_millions operating_margin_percent operating_profit_millions
West Kohl’s 62.30212 40.26727 21.72853
South Sports Direct 60.44007 50.58555 28.06469
Southeast Foot Locker 50.10478 41.93243 18.69212
Midwest Foot Locker 41.36783 43.49574 15.87856
Northeast Foot Locker 40.10161 41.88905 14.97412

• Calculating retailers with the highest operating margin in each region

top_product_margin_2021 <- sales_2021 %>% 
  group_by(region, retailer) %>% 
  summarize(total_sales_millions = sum(total_sales)/1000000,
            operating_profit_millions = sum(operating_profit)/1000000,
            operating_margin_percent = mean(operating_margin*100)) %>% 
  group_by(region) %>% 
  slice_max(operating_margin_percent, n = 1, with_ties = FALSE) %>% 
  select(region, retailer, total_sales_millions, operating_margin_percent, operating_profit_millions) %>% 
  arrange(-total_sales_millions)
 knitr::kable(top_product_margin_2021)
region retailer total_sales_millions operating_margin_percent operating_profit_millions
South Sports Direct 60.44007 50.58555 28.064692
Northeast West Gear 31.56155 43.51724 12.265058
Midwest Amazon 16.83587 45.83824 6.833800
West Sports Direct 12.12904 43.32800 4.568511
Southeast Amazon 10.82633 45.05224 4.295094
Product category sales analysis

• Calculating total sales, operating profit and operating margin for each product category across all regions

product_sales_2021 <- sales_2021 %>% 
  group_by(region, product) %>% 
  summarize(total_sales_m = sum(total_sales)/1000000,
            total_profit_m = sum(operating_profit)/1000000,
            operating_margin_percent = mean(operating_margin)*100) %>% 
  group_by(region) %>% 
  select(product, region, operating_margin_percent,total_sales_m, total_profit_m)

• Calculating product categories with the highest sales in each region

top_product_sales_2021 <- sales_2021 %>% 
  group_by(region, product) %>%
summarize(product_sales_millions = sum(total_sales/1000000),
          product_profit_millions = sum(operating_profit/1000000),
          margin_percent = mean(operating_margin*100)) %>% 
  group_by(region) %>% 
  slice_max(product_sales_millions, n = 1, with_ties = FALSE) %>% 
  select(region, product, product_sales_millions, product_profit_millions, margin_percent)
knitr::kable(top_product_sales_2021)
region product product_sales_millions product_profit_millions margin_percent
Midwest Men’s Street Footwear 35.82406 14.09810 44.37500
Northeast Men’s Street Footwear 42.57670 17.15452 45.31373
South Women’s Apparel 24.63541 12.14695 53.57937
Southeast Men’s Street Footwear 29.07879 11.49408 44.58333
West Men’s Street Footwear 39.31607 14.48931 42.32803

• Calculating product categories with highest operating margin in each region

region product product_sales_millions product_profit_millions margin_percent
South Women’s Apparel 24.63541 12.14695 53.57937
Midwest Women’s Apparel 26.73033 11.65632 49.11458
Southeast Women’s Apparel 25.20245 10.71479 45.52778
Northeast Men’s Street Footwear 42.57670 17.15452 45.31373
West Men’s Street Footwear 39.31607 14.48931 42.32803

Key Findings

The insights gained from the analysis above were crucial in answering both business questions posed in this project, as the they provided a clear summary of the story that the data was telling.

  1. Which retailer had the highest total sales and profit for each region? What is their profit margin?


ggplot(data = retailer_sales_2021, aes(x = retailer, y = total_sales_millions, fill = retailer)) +
  geom_col() +
  scale_y_continuous(labels = dollar_format()) +
  labs(x = "Retailer", y = "Sales (millions)", fill = "Retailer") +
  ggtitle("Total Retailer Sales by Region") +
  theme_minimal() +
  theme(legend.position = "bottom",
        plot.title = element_text(size = 14, face = "bold"),
        axis.title.x = element_blank(),
        axis.text.x = element_blank(),
        axis.title.y = element_text(size = 12),
        legend.title = element_blank(),
        legend.text = element_text(size = 10)) +
  facet_wrap(~region)



Kohl’s in the West region had the highest total sales by a single retailer at $62.3 million., with a profit margin of 40.3%.
Footlocker has the most total sales across across Midwest, Southeast and Northeast regions, with a combined value of over $130 million.
Sports Direct in the South region has total sales of $60.4 million, with an operating margin of 50.6% and operating profit of $28.1 million from Adidas products during 2021.

Retailers with the top sales across each region:

• West: Kohl's -    $62.3 million
• South: Sports Direct -    $60.4 million
• Southeast: Foot Locker -    $50.1 million
• Midwest: Foot Locker -    $41.4 million
• Northeast: Foot Locker -    $40.1 million  




Retailers with the top operating margin across each region:

• South: Sports Direct -    50.6%
• Northeast: West Gear -    43.5%
• Midwest: Amazon -   45.8%
• West: Sports Direct -   43.2%
• Southeast: Amazon -   45.1%  




Retailers with the highest profit across each region:

• South: Sports Direct -    $28.1 million
• West: Kohl's -    $21.7 million
• Southeast: Foot Locker -    $18.7 million
• Midwest: Foot Locker -    $15.9 million
• Northeast: Foot Locker -    $15 million 




2. Which product categories have the highest sales and profit margin in each region?




Men’s Street Footwear had the highest sales across 4 of 5 regions. - Northeast, West, Midwest, and Southeast.
Women’s apparel had the highest sales in the Southeast region with $13.1 million in sales

Product categories with the highest sales across each region:

• Northeast: Men's Street Footwear -    $42.6 million
• West: Men's Street Footwear -   $39.3 million
• Midwest: Men's Street Footwear -    $35.8 million
• Southeast: Men's Street Footwear -    $29.1 million
• South: Women's Apparel -    $24.6 million





Women’s apparel is the product category with the highest operating margin across 3 of 5 regions – Northeast, South, Southeast, West
Women’s apparel in the South region had an operating margin of 53.6%.

Product categories with the highest operating margin across each region:

• South: Women's Apparel -    53.6%
• Midwest: Women's Apparel -    49.1%
• Southeast: Women's Apparel -    45.5%
• Northeast: Men's street footwear -    45.3% 
• West: : Men's Street Footwear -   42.3%



Conclusions



My final conclusions based on my analysis are:

• Footlocker has the highest total sales across all regions and retailers. Footlocker sells the most product in 3 of 5 regions.
• Sports direct in the South region has the highest profits among all retailers.
• Kohl’s in the West region had the highest total sales of all retailers.
• Men’s street footwear has the most sales, performing better than other categories in 4 of 5 regions.
• Women’s apparel sells extremely well in the South region, outselling men’s street footwear in this region.
• Women’s has the highest profit margins in 3 of 5 regions


Recommendations



RETAILER INSIGHTS:
• Research what makes Footlocker the retailer with the most sales and seek to develop an understanding of their systems vs other retailers to see why there is a discrepancy. This discrepancy could be caused by external factors, such as:
o Competition
o Regional economy, and
o Product fit and demand for each region.
 
It could also include internal factors that Adidas and the retailer have control of such as:
o Supply chain issues, and
o Marketing, promotion, and sales strategies.


Studying these factors can help the team to understand why footlocker performs better than other retailers, and how parts of their systems and practices can be adopted by other retailers across all regions to improve their sales.

• Investigate which systems and factors make Sports Direct in the South region the most profitable retailer across all regions. There are other factors, apart from the points listed above, that could make Sports Direct the most profitable retailer including:
o Cost of doing business
o Variations in the regional supply chain, transportation, and labour costs
o Distinct customer engagement and sales tactics
o Variation in customer demographics, and product placement strategies
o Difference in pricing strategies.
 
• Study Kohl’s in the West region to understand how they achieve such high sales. This study would need to take into consideration factors about the environment that the business operates in, such as the economy of the region and the business’s target demographic, and factors that the retailer controls such as promotions, marketing, and sales strategies to understand how these contribute to the sales performance.

Studying these factors can help the team to understand why these retailers perform better than other retailers, and how parts of their systems and practices can be adopted by other retailers across all regions to improve their sales and profitability where this is possible.


PRODUCT CATEGORY INSIGHTS:

• Study the way men’s street footwear is approached to understand why it is the highest-selling product range across 4 of the 5 regions. This could help the team understand how other product category sales can be improved. These insights could be used to improve demand for other product categories through the application of similar marketing and promotion strategies.

Factors that could affect the performance of other product categories include:
o Variations in sales and promotions strategies
o Different approaches to product placement and targeting consumer demographics.
o Unique customer engagement methods

• Study women’s apparel to understand why it has the highest profit margin in 3 of 5 regions. Understanding what makes this product category as profitable as it is could help to improve total profitability across all product categories and regions. The study would also need to investigate what drives sales of women’s apparel in the South region as this product category has the highest sales across the South region.

Additional data that could be included in this further analysis includes data about:
•   Regional labour cost and supply chain cost data
•   Data on marketing and promotions for different product categories and regions. 

In conclusion, the next steps that I would suggest our stakeholders take are to study the topics brought up by the sales data insights to understand which factors external factors are affecting sales and which factors are within the control of Adidas and retailers and can be improved. This should give a clear picture of steps that can be taken to improve overall product sales and sales of specific product categories in different regions.



Giyani Shangase
Portfolio website:https://giyanishangase.wixsite.com/portfolio
Linkedin: https://www.linkedin.com/in/giyani-shangase-725055261/
Source: https://www.kaggle.com/datasets/heemalichaudhari/adidas-sales-dataset