Introduction

Regork grocery chain caters to a wide demographic. Based on the data, it would be best to change that strategy so that the optimizations can improve our customer satisfaction and financial metrics. We have analyzed the most important customer demographics based on income bracket. I propose we optimize various departments of this particular store to cater to the main customer demographic.

Packages Used

tidyverse - a collection of packages for data wrangling, exploration and visualization

completejourney - data set for Regork grocery store

Lubridate - allows date/time calculations

ggplot2 - visualization tool for graphs and charts

library(completejourney)
library(tidyverse)
library(lubridate)
library(ggplot2)
library(dplyr)

Data Preparation/Exploration

Sorting total sales by customer income

Preparing dataset

Prod <- products
tran <- get_transactions()
demo <- demographics

X1 <- inner_join(tran, Prod, by = "product_id") 
D1 <- inner_join(X1, demo, by = "household_id") 

Analyzing Data

D2 <- D1 %>%
  group_by(income) %>%
  summarise(total_sales = sum(sales_value, na.rm = TRUE)) %>%
  arrange(desc(total_sales)) %>% 
  mutate(SUM = sum(total_sales), Percent = (total_sales/SUM)*100)

Visualize Data

Summary

The problem identified through exploratory analysis was that this particular store was not catering it’s main customer demographic. Customers making $35K-$99k make up 55% of total sales. Customers making less than $35k made up 21% of total sales. Customers making more than $99k made up 26% of total sales. The solution is to optimize inventory, schedule, and other attributes of the store to focus on low-middle class customers. From now on the target demographic are customers making $1k-99k. Our strategy should focus on conforming our product types to this income bracket with decent packaging to appeal to higher income brackets. We need to reevaluate the products in soft drinks, beef, milk, cheese, frozen dinners, snacks, bread, beer, frozen pizza and deli meats and make sure that it appeals to our target income bracket. And optimize our staff scheduling/supply chain management to account for the most common times our target demographic shops which is - Sunday, Monday, Friday and Saturday from 4PM - 7PM. We could improve this analysis by obtaining more attributes regarding product inventory and customer demographics.