## # A tibble: 6 × 12
## store_id type store_type type2 size price price_per_oz price_per_oz_c taxed
## <dbl> <chr> <dbl> <chr> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 16 WATER 2 <NA> 33.8 1.69 0.05 5 0
## 2 16 TEA 2 <NA> 23 0.99 0.0430 4.30 1
## 3 16 TEA 2 <NA> 23 0.99 0.0430 4.30 1
## 4 16 WATER 2 <NA> 33.8 1.69 0.05 5 0
## 5 16 MILK 2 LOW F… 128 3.79 0.0296 2.96 0
## 6 16 MILK 2 LOW F… 64 2.79 0.0436 4.36 0
## # ℹ 3 more variables: supp <dbl>, time <chr>, product_id <dbl>
The product information was recorded by trained data collectors who entered the pre-sales tax and pre-bottle fee price information into a database using a tablet computer and paper forms (December 2014) or only paper forms (June 2015 and March 2016). The researchers took several measures to ensure that the data was accurate and representative of the treatment group, such as:
Selecting a targeted sample of stores based on the top six stores where participants most frequently shopped, as reported in the Dietary and Shopping Behavior (DSB) telephone survey, and adding additional stores from random selection within their categories and neighborhoods to ensure diversity and coverage.
Using a standard panel of 70 beverage items based on information on top selling beverages in the Bay Area and nationally and representing the range of beverage categories, and collecting additional beverages in a supplemental panel to account for variations in store inventory.
Using a systematic, standardized process to collect the prices, and double-entering the data collected on paper forms by trained research project assistants into a database and comparing the results.
Limiting the analysis to items matched as to product and store in all three rounds, which comprised 55 products and 313 prices across stores. Some of the data collection issues that they encountered were:
Some stores refused to allow the data collectors to enter prices, and had to be replaced by other stores of the same type and neighborhood.
None of the stores sold all the beverages in the panel, and some stores did not stock any beverages in the standard panel, so the data collectors had to collect prices for beverages in the supplemental panel.
The tablet computers were not used during June 2015 and March 2016 because the data collection was more efficient and conducted more quickly using paper forms.
| Product_ID | Frequency |
|---|---|
| 1 | 3 |
| 2 | 3 |
| 3 | 3 |
| 4 | 3 |
| 5 | 10 |
| 6 | 6 |
| 7 | 3 |
| 8 | 6 |
| 9 | 3 |
| 10 | 2 |
| 11 | 4 |
| 12 | 2 |
| 13 | 2 |
| 14 | 2 |
| 15 | 2 |
| 16 | 15 |
| 17 | 2 |
| 18 | 5 |
| 19 | 3 |
| 20 | 3 |
| 21 | 2 |
| 22 | 3 |
| 23 | 3 |
| 24 | 2 |
| 25 | 3 |
| 26 | 2 |
| 27 | 2 |
| 28 | 1 |
| 29 | 22 |
| 30 | 2 |
| 31 | 1 |
| 32 | 32 |
| 33 | 30 |
| 34 | 1 |
| 35 | 1 |
| 36 | 1 |
| 37 | 2 |
| 38 | 16 |
| 39 | 1 |
| 40 | 17 |
| 41 | 22 |
| 42 | 28 |
| 43 | 36 |
| 44 | 12 |
| 45 | 1 |
| 46 | 1 |
| 47 | 7 |
| 48 | 3 |
| 49 | 2 |
| 50 | 18 |
| 51 | 4 |
| 52 | 2 |
| 53 | 1 |
| 54 | 1 |
| 55 | 1 |
| 56 | 1 |
| 57 | 19 |
| 58 | 39 |
| 59 | 55 |
| 60 | 43 |
| 61 | 38 |
| 62 | 3 |
| 63 | 2 |
| 64 | 2 |
| 65 | 1 |
| 66 | 1 |
| 67 | 2 |
| 68 | 3 |
| 69 | 2 |
| 70 | 1 |
| 71 | 2 |
| 72 | 6 |
| 73 | 2 |
| 74 | 1 |
| 75 | 1 |
| 76 | 18 |
| 77 | 16 |
| 78 | 1 |
| 79 | 18 |
| 80 | 1 |
| 81 | 18 |
| 82 | 4 |
| 83 | 1 |
| 84 | 2 |
| 85 | 7 |
| 86 | 3 |
| 87 | 3 |
| 88 | 3 |
| 89 | 3 |
| 90 | 3 |
| 91 | 14 |
| 92 | 3 |
| 93 | 2 |
| 94 | 1 |
| 95 | 1 |
| 96 | 1 |
| 97 | 42 |
| 98 | 14 |
| 99 | 6 |
| 100 | 30 |
| 101 | 5 |
| 102 | 8 |
| 103 | 8 |
| 104 | 28 |
| 105 | 51 |
| 106 | 10 |
| 107 | 1 |
| 108 | 40 |
| 109 | 3 |
| 110 | 19 |
| 111 | 17 |
| 112 | 44 |
| 113 | 49 |
| 114 | 47 |
| 115 | 2 |
| 116 | 2 |
| 117 | 11 |
| 118 | 2 |
| 119 | 8 |
| 120 | 3 |
| 121 | 1 |
| 122 | 3 |
| 123 | 2 |
| 124 | 2 |
| 125 | 21 |
| 126 | 17 |
| 127 | 1 |
| 128 | 3 |
| 129 | 9 |
| 130 | 23 |
| 131 | 1 |
| 132 | 1 |
| 133 | 10 |
| 134 | 1 |
| 135 | 1 |
| 136 | 2 |
| 137 | 4 |
| 138 | 1 |
| 139 | 1 |
| 140 | 34 |
| 141 | 1 |
| 142 | 2 |
| 143 | 1 |
| 144 | 9 |
| 145 | 2 |
| 146 | 16 |
| 147 | 3 |
| 148 | 3 |
| 149 | 3 |
| 150 | 1 |
| 151 | 3 |
| 152 | 1 |
| 153 | 9 |
| 154 | 1 |
| 155 | 3 |
| 156 | 2 |
| 157 | 2 |
| 158 | 1 |
| 159 | 1 |
| 160 | 22 |
| 161 | 1 |
| 162 | 49 |
| 163 | 9 |
| 164 | 3 |
| 165 | 1 |
| 166 | 1 |
| 167 | 1 |
| 168 | 3 |
| 169 | 3 |
| 170 | 15 |
| 171 | 14 |
| 172 | 42 |
| 173 | 2 |
| 174 | 2 |
| 175 | 3 |
| 176 | 2 |
| 177 | 3 |
| 178 | 1 |
| 179 | 4 |
| 180 | 1 |
| 181 | 15 |
| 182 | 3 |
| 183 | 1 |
| 184 | 1 |
| 185 | 4 |
| 186 | 1 |
| 187 | 4 |
| 188 | 26 |
| 189 | 49 |
| 190 | 35 |
| 191 | 4 |
| 192 | 3 |
| 193 | 53 |
| 194 | 2 |
| 195 | 52 |
| 196 | 35 |
| 197 | 1 |
| 198 | 1 |
| 199 | 1 |
| 200 | 1 |
| 201 | 1 |
| 202 | 1 |
| 203 | 2 |
| 204 | 1 |
| 205 | 1 |
| 206 | 11 |
| 207 | 26 |
| 208 | 2 |
| 209 | 1 |
| 210 | 1 |
| 211 | 2 |
| 212 | 1 |
| 213 | 29 |
| 214 | 43 |
| 215 | 35 |
| 216 | 1 |
| 217 | 1 |
| 218 | 3 |
| 219 | 40 |
| 220 | 8 |
| 221 | 1 |
| 222 | 1 |
| 223 | 2 |
| 224 | 1 |
| 225 | 2 |
| 226 | 1 |
| 227 | 1 |
| 228 | 3 |
| 229 | 3 |
| 230 | 20 |
| 231 | 21 |
| 232 | 40 |
| 233 | 3 |
| 234 | 3 |
| 235 | 3 |
| 236 | 2 |
| 237 | 1 |
| 238 | 1 |
| 239 | 3 |
| 240 | 3 |
| 241 | 1 |
| 242 | 1 |
| 243 | 3 |
| 244 | 4 |
| 245 | 1 |
| 246 | 1 |
| 247 | 1 |
##
## DEC2014 JUN2015
## 1 177 209
## 3 87 102
While the observations are not similar, they are also not far apart (or far apart in this case due to the small data size).
##
## 0 1
## 1 291 253
## 3 132 130
In store 3, yes, the number of taxed and non-taxed beverages are similar, while, in store 1, no, the number of taxed and non-taxed beverages are not similar.
##
## DEC2014 JUN2015
## ENERGY 56 58
## ENERGY-DIET 49 54
## JUICE 70 64
## JUICE DRINK 19 17
## MILK 63 61
## SODA 239 262
## SODA-DIET 128 174
## SPORT 11 16
## SPORT-DIET 2 2
## TEA 52 45
## TEA-DIET 6 6
## WATER 48 38
## WATER-SWEET 1 1
Soda and Soda-Diet have the highest number of observations in both the time period.
The products in the standard panel were based on the top selling beverages in the Bay Area and nationally, which means that some products might be more popular and widely available than others. Secondly, the products in the supplemental panel were collected to account for variations in store inventory, which means that some products might be more specific and rare than others. For example, water-sweet had only one observation in both rounds, which suggests that it was a very uncommon beverage in the stores. Additionally, the data collection process involved replacing some stores that refused to allow the data collectors to enter prices, which means that some products might have different availability and prices in the new stores. Lastly, the data analysis process involved limiting the analysis to items matched as to product and store in both the rounds (DEC2014 and JUN2015), which means that some products might have fewer observations than others due to missing data or unmatched items.
| Store type | Total | DEC2014 | JUN2015 | DEC2014 | JUN2015 |
|---|---|---|---|---|---|
| 1 | 36 | 11.19195 | 11.4804 | 15.61744 | 16.92966 |
| 3 | 18 | 15.19575 | 16.0754 | 18.18177 | 19.07878 |
The average price per ounce (in cents) for both taxed and untaxed beverages increased for store type 1 but increased at a decreasing number for store type 3, with taxed beverages being more expensive and having a larger price gap than untaxed beverages for both store types.
No, because the tax is not the only factor that affects the average price of the products in each group. The groups are not the same to begin with, as shown by the difference in average price before the tax was introduced in December 2014. The difference in June 2015 could reflect the pre-existing differences between the groups, not the impact of the tax. One of the obvious examples could be owner’s will to earn extra profits.
| Store type | DEC2014 | JUN2015 | DEC2014 | JUN2015 | Difference |
|---|---|---|---|---|---|
| Large Supermarket | 11.19195 | 11.4804 | NA | NA | 0.2884493 |
| Pharmacy | 15.19575 | 16.0754 | NA | NA | 0.8796482 |
| Large Supermarket | NA | NA | 15.61744 | 16.92966 | 1.3122212 |
| Pharmacy | NA | NA | 18.18177 | 19.07878 | 0.8970093 |
The p-value for large supermarkets is quite small, indicating we can reject the null hypotheses showing that there is enough evidence against our assumption that there are no differences in the populations mean (before- and after-tax prices), as long as other assumptions about the data (e.g. stores were really sampled at random) were correct. Thus it is likely that the sugar tax had some effect on prices.
On the other hand, the p-value for pharmacies is quite large, showing that the the populations mean (before- and after-tax price differences) is not statistically significant, thus, we do not have strong evidence against our assumption. In this case, it is unlikely, that the sugar tax had some effect on prices.
## # A tibble: 6 × 8
## year quarter month location beverage_group tax price under_report
## <dbl> <dbl> <dbl> <chr> <chr> <chr> <dbl> <dbl>
## 1 2013 1 1 Berkeley soda Non-taxed 4.85 NA
## 2 2013 1 1 Non-Berkeley soda Non-taxed 3.51 NA
## 3 2013 1 1 Non-Berkeley soda Non-taxed 3.89 NA
## 4 2013 1 1 Berkeley soda Taxed 3.68 NA
## 5 2013 1 1 Non-Berkeley soda Taxed 3.52 NA
## 6 2013 1 1 Non-Berkeley soda Taxed 3.91 NA
In doing a difference-in-difference analysis, the characteristics of both the locations should be relative. In other words, it is safe to compare two locations with similar characteristics to see if one event (sugar tax) in a location, Berkeley in this case, (and not the other) would be the major reason for the change in the beverage prices or not.
Looking at the table, yes, the researchers chose suitable comparison stores, as the stores’ characteristics are very similar to those in Berkeley.
| Year | Month | Berkeley | Non-Berkeley | Berkeley | Non-Berkeley |
|---|---|---|---|---|---|
| 2013 | 1 | 5.722480 | 5.348640 | 8.692803 | 7.991574 |
| 2013 | 2 | 5.806468 | 5.363863 | 8.654572 | 8.180877 |
| 2013 | 3 | 5.858251 | 5.424512 | 8.822691 | 8.186872 |
| 2013 | 4 | 5.858342 | 5.643459 | 9.022075 | 8.246447 |
| 2013 | 5 | 5.791333 | 5.181730 | 8.678542 | 7.756793 |
| 2013 | 6 | 5.761845 | 5.031972 | 8.573449 | 7.426397 |
| 2013 | 7 | 5.902819 | 5.098175 | 8.233288 | 7.193616 |
| 2013 | 8 | 5.831730 | 5.080378 | 8.821654 | 7.488764 |
| 2013 | 9 | 5.834511 | 5.082787 | 9.015901 | 7.804120 |
| 2013 | 10 | 5.800953 | 5.176406 | 8.982419 | 7.753430 |
| 2013 | 11 | 5.983980 | 5.292364 | 9.090955 | 7.951408 |
| 2013 | 12 | 6.016732 | 5.273657 | 8.950652 | 7.798748 |
| 2014 | 1 | 6.029485 | 5.296528 | 8.850165 | 7.879612 |
| 2014 | 2 | 6.081301 | 5.427840 | 9.027708 | 7.797533 |
| 2014 | 3 | 6.359673 | 5.740176 | 9.370788 | 8.028207 |
| 2014 | 4 | 6.247272 | 5.681016 | 9.439623 | 8.172451 |
| 2014 | 5 | 6.415236 | 5.639642 | 9.512144 | 8.368728 |
| 2014 | 6 | 6.470002 | 5.718528 | 9.058231 | 8.000090 |
| 2014 | 7 | 6.398847 | 5.526962 | 9.181642 | 8.144422 |
| 2014 | 8 | 6.289867 | 5.587100 | 9.224782 | 8.000210 |
| 2014 | 9 | 6.577316 | 5.775283 | 9.222504 | 8.507376 |
| 2014 | 10 | 6.338737 | 5.559348 | 9.416063 | 8.520110 |
| 2014 | 11 | 5.839390 | 5.641029 | 8.023078 | 8.462292 |
| 2014 | 12 | 6.164794 | 5.493931 | 9.618825 | 8.484427 |
| 2015 | 1 | 6.373576 | 5.824950 | 9.968138 | 8.778852 |
| 2015 | 2 | 6.380764 | 5.654277 | 9.222594 | 8.471740 |
| 2015 | 3 | 6.482227 | 5.688276 | 9.973454 | 8.776122 |
| 2015 | 4 | 6.461740 | 5.810564 | 10.353428 | 8.733491 |
| 2015 | 5 | 6.623755 | 5.806210 | 10.310571 | 8.918695 |
| 2015 | 6 | 6.630742 | 5.818193 | 10.410781 | 8.682467 |
| 2015 | 7 | 6.371787 | 5.735532 | 10.588301 | 8.910685 |
| 2015 | 8 | 6.454268 | 5.701793 | 11.109456 | 9.040408 |
| 2015 | 9 | 6.515334 | 5.680509 | 10.452208 | 8.717005 |
| 2015 | 10 | 6.562519 | 5.691359 | 10.730807 | 8.818358 |
| 2015 | 11 | 6.659933 | 5.838232 | 10.697508 | 8.967660 |
| 2015 | 12 | 6.551349 | 5.750560 | 10.521530 | 8.571704 |
| 2016 | 1 | 6.555559 | 5.848632 | 10.525640 | 8.926457 |
| 2016 | 2 | 6.546415 | 5.764717 | 10.815509 | 8.729508 |
| 2016 | 3 | NA | NA | NA | NA |
The prices of beverages that are not taxed are higher in Berkeley than in other areas. This shows that there are some obivous factors that make Berkeley different from its neighbours, regardless of the tax policy. The tax does not seem to affect the prices of non-taxed beverages, as the gap between Berkeley and other areas remains stable after the tax.
The prices of beverages that are taxed are also higher in Berkeley than in other areas. The tax makes this gap bigger, as the prices of taxed beverages in Berkeley go up after the tax is introduced (from March 2015 onwards).
The prices of both taxed and non-taxed beverages in other areas follow a similar pattern over time, and this pattern is also similar to the one in Berkeley for non-taxed beverages. However, if we remove the effect of time, the prices of both types of beverages in other areas do not change much before and after the tax. This implies that nothing else happened during that period that influenced the prices in Berkeley and its surroundings, except for the tax that affected the prices of taxed beverages in Berkeley.
Yes, it is reasonable to conclude that the sugar tax had an effect on prices. The chart shows that the prices of taxed goods in Berkeley increased more than the prices of taxed goods outside Berkeley after the tax was introduced. This means that the tax made the goods more expensive in Berkeley than in other areas. However, the chart also shows that the prices of all goods in Berkeley were higher than the prices of all goods outside Berkeley before the tax was implemented. This means that there were some other factors that made Berkeley different from other areas, regardless of the tax policy. To isolate the effect of the tax, we need to subtract the difference in prices before the tax from the difference in prices after the tax. This gives us the change in the difference in prices due to the tax. The chart shows that this change is positive, which suggests that the tax has a positive effect on prices.
| Usual intake | Pre-tax (Nov–Dec 2014) | Post-tax (Nov–Dec 2015) | Pre-tax–post-tax difference |
|---|---|---|---|
| Caloric intake (kilocalories/capita/day) | NA, n = 623 | NA, n = 613 | NA |
| Taxed beverages | 45.1 | 38.7 | -6.4, p = 0.56 |
| Non-taxed beverages | 115.7 | 147.6 | 31.9, p = 0.04 |
| Volume of intake (grams/capita/day) | NA, n = 623 | NA, n = 613 | NA |
| Taxed beverages | 121 | 97 | -24.0, p = 0.24 |
| Non-taxed beverages | 1839.4 | 1896.5 | 57.1, p = 0.22 |
The p-value for the caloric intake of non-taxed beverages is below 0.05, while the p-values for the other groups are fairly large. This implies that we can reject the null hypothesis that the tax did not change their consumption behavior from sugary to non-sugary beverages for the non-taxed beverages group, but we cannot reject it for the other groups, at the 5% significance level. The difference in the mean caloric intake of sugary beverages between the treatment and control groups is not very large.
It would be interesting to explore what other factors might have contributed to the reduction in the intake of sugary beverages after the sugar tax, even though the difference is not very large. One possible explanation is that water and sugar are essential for biological survival, making the demand for beverages relatively inelastic. Moreover, some people may have developed a habit of consuming certain beverages as part of their lifestyle and routine, such as sugary drinks from Starbucks. However, since people take time to change their consumption habits, which is beyond the scope of this project, a longer-term analysis might provide a stronger basis for the reasoning of the change in consumption of both types of drinks.
The study has several strengths, such as using multiple data sources and methods, comparing Berkeley with nearby non-Berkeley areas, and adjusting for inflation and seasonality. However, the study also has some limitations that may affect the validity and generalizability of the findings. For example, the study only observed 26 stores in Berkeley, which may not represent the diversity of the retail environment and consumer behavior. The study also relied on self-reported consumption data from a telephone survey with a low response rate, which may introduce measurement error and selection bias including calorie and gram intakes (as opposed to frequency). Moreover, the study did not account for the possible substitution effects of the tax, such as cross-border shopping, online purchasing, or switching to other unhealthy products.
Future studies on the SSB tax in Berkeley could address these problems by expanding the sample size and scope of the data collection, using more objective and reliable measures of consumption, and controlling for the potential confounding factors and behavioral responses of the tax. For example, future studies could use scanner data from a larger and more representative sample of stores in Berkeley and other areas, including convenience stores, restaurants, and vending machines. Future studies could also use technologies, such as urinary sucrose or fructose, to measure the actual intake of SSBs and other sugars (if they are still going to use grams/calories as a measurement tool), rather than relying on self-reports. Furthermore, future studies could use an additional econometric method (synthetic control in DiD), to estimate the causal effect of the tax, while accounting for the possible spillover effects, price endogeneity, and heterogeneity of the impact. Nevertheless, as mentioned, this study was tight on time and thus a perfect foundation for future studies.