Loading and Looking at Data
Channel Region Fresh Milk Grocery Frozen Detergents_Paper Delicassen
1 2 3 12669 9656 7561 214 2674 1338
2 2 3 7057 9810 9568 1762 3293 1776
3 2 3 6353 8808 7684 2405 3516 7844
4 1 3 13265 1196 4221 6404 507 1788
5 2 3 22615 5410 7198 3915 1777 5185
6 2 3 9413 8259 5126 666 1795 1451
7 2 3 12126 3199 6975 480 3140 545
8 2 3 7579 4956 9426 1669 3321 2566
9 1 3 5963 3648 6192 425 1716 750
10 2 3 6006 11093 18881 1159 7425 2098
Channel Region Fresh Milk Grocery Frozen Detergents_Paper Delicassen
431 1 3 3097 4230 16483 575 241 2080
432 1 3 8533 5506 5160 13486 1377 1498
433 1 3 21117 1162 4754 269 1328 395
434 1 3 1982 3218 1493 1541 356 1449
435 1 3 16731 3922 7994 688 2371 838
436 1 3 29703 12051 16027 13135 182 2204
437 1 3 39228 1431 764 4510 93 2346
438 2 3 14531 15488 30243 437 14841 1867
439 1 3 10290 1981 2232 1038 168 2125
440 1 3 2787 1698 2510 65 477 52
'data.frame': 440 obs. of 8 variables:
$ Channel : int 2 2 2 1 2 2 2 2 1 2 ...
$ Region : int 3 3 3 3 3 3 3 3 3 3 ...
$ Fresh : int 12669 7057 6353 13265 22615 9413 12126 7579 5963 6006 ...
$ Milk : int 9656 9810 8808 1196 5410 8259 3199 4956 3648 11093 ...
$ Grocery : int 7561 9568 7684 4221 7198 5126 6975 9426 6192 18881 ...
$ Frozen : int 214 1762 2405 6404 3915 666 480 1669 425 1159 ...
$ Detergents_Paper: int 2674 3293 3516 507 1777 1795 3140 3321 1716 7425 ...
$ Delicassen : int 1338 1776 7844 1788 5185 1451 545 2566 750 2098 ...
Descriptive Statistics for Frozen and Fresh
vars n mean sd median trimmed mad min max range skew
X1 1 440 12000.3 12647.33 8504 9864.61 8776.25 3 112151 112148 2.54
kurtosis se
X1 11.33 602.94
vars n mean sd median trimmed mad min max range skew
X1 1 440 3071.93 4854.67 1526 2144.07 1607.88 25 60869 60844 5.87
kurtosis se
X1 53.8 231.44
Final Chart
Interpretation
Both Fresh and Frozen have some customers who bought from only one category, but most customers bought a combination. The majority of spend was between 30,000 dollars for Fresh and 10,000 dollars for Frozen. Additionally, the hotel channel had the vast majority of high dollar customers. Medium and low spend customers were very split between the two channels.
Revisions
Updated legend with more detailed information for hotel.
Experimented with scale and decided on scale_x_sqrt() (scaled both the x and y axis).
Added str(customers) to the qmd file so that the structure was visible.
Changed alpha level for better visual clarity.
Edited caption for clarity.
Revised graph
Revised Interpretation
Customers tend to buy a combination of fresh and frozen products. The majority of spend was between 30,000 dollars for Fresh and 20,000 dollars for Frozen. Additionally, the hotel channel had the vast majority of the highest spend customers. Medium and low spend customers were very split between the two channels.However, as seen by the line, Hotel spend appears to be higher regardless of food type or spend level.This makes sense given that restaurant ingredients and pricing tend to be higher than say a grocery store.Additionally wealthier idndivials would have greater means to purchase form the hotel channel.