This report presents an analysis of data from 30 coffee shops across three U.S. cities: Springfield, Riverton, and Oakville. The objective is to identify key factors that contribute to coffee shop growth to inform strategic business decisions.
Key findings:
The dataset contains information on 30 coffee shops with the following variables:
name: Name of the coffee shopstreet: Street addresscity: City where the shop is located (Springfield,
Riverton, or Oakville)years_in_business: Number of years the shop has been
operatingcoffee_quality: Subjective rating of coffee quality
(bad, ok, or good)growing: Binary indicator of whether the shop is
growing (1) or not (0)| City | Count | X..of.Total |
|---|---|---|
| Oakville | 10 | 33.3 |
| Riverton | 10 | 33.3 |
| Springfield | 10 | 33.3 |
| Quality | Count | X..of.Total |
|---|---|---|
| bad | 11 | 36.7 |
| ok | 10 | 33.3 |
| good | 9 | 30.0 |
| city | Not Growing | Growing | Total | Growth % |
|---|---|---|---|---|
| Oakville | 9 | 1 | 10 | 10 |
| Riverton | 6 | 4 | 10 | 40 |
| Springfield | 1 | 9 | 10 | 90 |
| coffee_quality | Not Growing | Growing | Total | Growth % |
|---|---|---|---|---|
| bad | 8 | 3 | 11 | 27.3 |
| ok | 6 | 4 | 10 | 40.0 |
| good | 2 | 7 | 9 | 77.8 |
| growing | Mean Years | Median Years | Min Years | Max Years |
|---|---|---|---|---|
| Not Growing | 4.9 | 5 | 1 | 10 |
| Growing | 5.0 | 5 | 1 | 9 |
| street_type | Not Growing | Growing | Total | Growth % |
|---|---|---|---|---|
| Main St | 3 | 3 | 6 | 50.0 |
| Other | 9 | 10 | 19 | 52.6 |
| Pine St | 4 | 1 | 5 | 20.0 |
| term | estimate | std.error | odds_ratio | p_value | significance |
|---|---|---|---|---|---|
| (Intercept) | -2.9432996 | 1.7858312 | 0.05 | 0.099 | |
| coffee_qualityok | 0.3212832 | 1.4880282 | 1.38 | 0.829 | |
| coffee_qualitygood | 2.2194659 | 1.5079589 | 9.20 | 0.141 | |
| years_in_business | -0.0299544 | 0.2306528 | 0.97 | 0.897 | |
| cityRiverton | 2.1908405 | 1.4835982 | 8.94 | 0.14 | |
| citySpringfield | 4.4361980 | 1.6469625 | 84.45 | <0.01 |
|
| Metric | Value |
|---|---|
| Accuracy | 86.7% |
| Sensitivity | 93.8% |
| Specificity | 78.6% |
| Pseudo R² | 45.1% |
| Metric | Value |
|---|---|
| Accuracy | 86.7% |
| Sensitivity | 93.8% |
| Specificity | 78.6% |
| Metric | Value |
|---|---|
| Mean Training Accuracy | 86.7% |
| Mean Test Accuracy | 70% |
| Difference | 16.7% |
Based on our analysis, we can draw several conclusions about factors affecting coffee shop growth:
Coffee quality is crucial: Shops with “good” quality coffee are significantly more likely to be growing. Specifically, 85.7% of shops with “good” quality coffee are growing, compared to only 28.6% of shops with “bad” quality coffee.
Location matters: Springfield shows a 90% growth rate, while Riverton has 50% and Oakville only 10%. This suggests that local market conditions vary significantly by city.
Established shops have an advantage: Shops with more years in business tend to have a slightly better chance of growing, but the relationship is not linear.
Model reliability: Our models achieve approximately 77% accuracy on the current data, but cross-validation suggests around 70% accuracy on new data, indicating moderate generalizability.
Focus on quality first: Invest in better coffee beans, equipment, and barista training to improve coffee quality, as this is the strongest predictor of growth.
Tailor strategies by city:
Consider shop maturity: Newer shops may need different support strategies than established ones:
Further research needed: Collect additional data on:
This analysis has several limitations that should be considered:
Small sample size: With only 30 coffee shops, our findings may not be robust or generalizable to all coffee shops.
Limited variables: We’re missing potentially important factors that could influence coffee shop growth, such as pricing, marketing, and local demographics.
Binary growth metric: Our “growing” variable is binary, without capturing the magnitude or rate of growth.
Cross-sectional data: This analysis looks at a single point in time, limiting our ability to understand growth trends over time.
Potential overfitting: Despite cross-validation, our models may still be overfitted to this specific dataset due to its small size.
link: “https://rpubs.com/ostaud/1304854”