Part 2
tips$newcol <- paste(tips$day, tips$time, sep=" ")
head(tips)
## X total_bill tip sex smoker day time size newcol
## 1 1 16.99 1.01 Female No Sun Dinner 2 Sun Dinner
## 2 2 10.34 1.66 Male No Sun Dinner 3 Sun Dinner
## 3 3 21.01 3.50 Male No Sun Dinner 3 Sun Dinner
## 4 4 23.68 3.31 Male No Sun Dinner 2 Sun Dinner
## 5 5 24.59 3.61 Female No Sun Dinner 4 Sun Dinner
## 6 6 25.29 4.71 Male No Sun Dinner 4 Sun Dinner
newdata <- subset(tips, select = c(total_bill, tip, sex, smoker, newcol, size))
head(newdata)
## total_bill tip sex smoker newcol size
## 1 16.99 1.01 Female No Sun Dinner 2
## 2 10.34 1.66 Male No Sun Dinner 3
## 3 21.01 3.50 Male No Sun Dinner 3
## 4 23.68 3.31 Male No Sun Dinner 2
## 5 24.59 3.61 Female No Sun Dinner 4
## 6 25.29 4.71 Male No Sun Dinner 4
colnames (newdata) <- c("Total", "Tip", "Gender", "Smoker", "Meal", "Party Size")
head (newdata)
## Total Tip Gender Smoker Meal Party Size
## 1 16.99 1.01 Female No Sun Dinner 2
## 2 10.34 1.66 Male No Sun Dinner 3
## 3 21.01 3.50 Male No Sun Dinner 3
## 4 23.68 3.31 Male No Sun Dinner 2
## 5 24.59 3.61 Female No Sun Dinner 4
## 6 25.29 4.71 Male No Sun Dinner 4
Part 3
scatterplot <- subset(newdata, select = c(Tip, Total))
head(scatterplot)
## Tip Total
## 1 1.01 16.99
## 2 1.66 10.34
## 3 3.50 21.01
## 4 3.31 23.68
## 5 3.61 24.59
## 6 4.71 25.29
attach(scatterplot)
plot(Total, Tip, main="Tip Ratio",
xlab="Total Bill ", ylab="Tip Amount ")

boxplotdata <- subset(newdata, select = c(Meal, Total))
head (boxplotdata)
## Meal Total
## 1 Sun Dinner 16.99
## 2 Sun Dinner 10.34
## 3 Sun Dinner 21.01
## 4 Sun Dinner 23.68
## 5 Sun Dinner 24.59
## 6 Sun Dinner 25.29
boxplot(Total~Meal, data = boxplotdata, main="Price of Meals at Specific Times", xlab = "Meal Time", ylab = "Total Price" )

hist(newdata$`Party Size`, main="Number of Diners", xlab = "Party Size")

##Part 4
###I'd like to find out, based on subsetting data, if larger than 2 tend to tip more than just two dinners. I'd also like to see if smokers tend to tip more and if males or females tend to tip higher based on the total price of the meal.
####create a percentage of tip amount in a new column
newdata <- transform(newdata, new = Tip / Total)
newdata
## Total Tip Gender Smoker Meal Party.Size new
## 1 16.99 1.01 Female No Sun Dinner 2 0.05944673
## 2 10.34 1.66 Male No Sun Dinner 3 0.16054159
## 3 21.01 3.50 Male No Sun Dinner 3 0.16658734
## 4 23.68 3.31 Male No Sun Dinner 2 0.13978041
## 5 24.59 3.61 Female No Sun Dinner 4 0.14680765
## 6 25.29 4.71 Male No Sun Dinner 4 0.18623962
## 7 8.77 2.00 Male No Sun Dinner 2 0.22805017
## 8 26.88 3.12 Male No Sun Dinner 4 0.11607143
## 9 15.04 1.96 Male No Sun Dinner 2 0.13031915
## 10 14.78 3.23 Male No Sun Dinner 2 0.21853857
## 11 10.27 1.71 Male No Sun Dinner 2 0.16650438
## 12 35.26 5.00 Female No Sun Dinner 4 0.14180374
## 13 15.42 1.57 Male No Sun Dinner 2 0.10181582
## 14 18.43 3.00 Male No Sun Dinner 4 0.16277808
## 15 14.83 3.02 Female No Sun Dinner 2 0.20364127
## 16 21.58 3.92 Male No Sun Dinner 2 0.18164968
## 17 10.33 1.67 Female No Sun Dinner 3 0.16166505
## 18 16.29 3.71 Male No Sun Dinner 3 0.22774708
## 19 16.97 3.50 Female No Sun Dinner 3 0.20624632
## 20 20.65 3.35 Male No Sat Dinner 3 0.16222760
## 21 17.92 4.08 Male No Sat Dinner 2 0.22767857
## 22 20.29 2.75 Female No Sat Dinner 2 0.13553475
## 23 15.77 2.23 Female No Sat Dinner 2 0.14140774
## 24 39.42 7.58 Male No Sat Dinner 4 0.19228818
## 25 19.82 3.18 Male No Sat Dinner 2 0.16044400
## 26 17.81 2.34 Male No Sat Dinner 4 0.13138686
## 27 13.37 2.00 Male No Sat Dinner 2 0.14958863
## 28 12.69 2.00 Male No Sat Dinner 2 0.15760441
## 29 21.70 4.30 Male No Sat Dinner 2 0.19815668
## 30 19.65 3.00 Female No Sat Dinner 2 0.15267176
## 31 9.55 1.45 Male No Sat Dinner 2 0.15183246
## 32 18.35 2.50 Male No Sat Dinner 4 0.13623978
## 33 15.06 3.00 Female No Sat Dinner 2 0.19920319
## 34 20.69 2.45 Female No Sat Dinner 4 0.11841469
## 35 17.78 3.27 Male No Sat Dinner 2 0.18391451
## 36 24.06 3.60 Male No Sat Dinner 3 0.14962594
## 37 16.31 2.00 Male No Sat Dinner 3 0.12262416
## 38 16.93 3.07 Female No Sat Dinner 3 0.18133491
## 39 18.69 2.31 Male No Sat Dinner 3 0.12359551
## 40 31.27 5.00 Male No Sat Dinner 3 0.15989767
## 41 16.04 2.24 Male No Sat Dinner 3 0.13965087
## 42 17.46 2.54 Male No Sun Dinner 2 0.14547537
## 43 13.94 3.06 Male No Sun Dinner 2 0.21951220
## 44 9.68 1.32 Male No Sun Dinner 2 0.13636364
## 45 30.40 5.60 Male No Sun Dinner 4 0.18421053
## 46 18.29 3.00 Male No Sun Dinner 2 0.16402406
## 47 22.23 5.00 Male No Sun Dinner 2 0.22492128
## 48 32.40 6.00 Male No Sun Dinner 4 0.18518519
## 49 28.55 2.05 Male No Sun Dinner 3 0.07180385
## 50 18.04 3.00 Male No Sun Dinner 2 0.16629712
## 51 12.54 2.50 Male No Sun Dinner 2 0.19936204
## 52 10.29 2.60 Female No Sun Dinner 2 0.25267250
## 53 34.81 5.20 Female No Sun Dinner 4 0.14938236
## 54 9.94 1.56 Male No Sun Dinner 2 0.15694165
## 55 25.56 4.34 Male No Sun Dinner 4 0.16979656
## 56 19.49 3.51 Male No Sun Dinner 2 0.18009236
## 57 38.01 3.00 Male Yes Sat Dinner 4 0.07892660
## 58 26.41 1.50 Female No Sat Dinner 2 0.05679667
## 59 11.24 1.76 Male Yes Sat Dinner 2 0.15658363
## 60 48.27 6.73 Male No Sat Dinner 4 0.13942407
## 61 20.29 3.21 Male Yes Sat Dinner 2 0.15820601
## 62 13.81 2.00 Male Yes Sat Dinner 2 0.14482259
## 63 11.02 1.98 Male Yes Sat Dinner 2 0.17967332
## 64 18.29 3.76 Male Yes Sat Dinner 4 0.20557682
## 65 17.59 2.64 Male No Sat Dinner 3 0.15008528
## 66 20.08 3.15 Male No Sat Dinner 3 0.15687251
## 67 16.45 2.47 Female No Sat Dinner 2 0.15015198
## 68 3.07 1.00 Female Yes Sat Dinner 1 0.32573290
## 69 20.23 2.01 Male No Sat Dinner 2 0.09935739
## 70 15.01 2.09 Male Yes Sat Dinner 2 0.13924051
## 71 12.02 1.97 Male No Sat Dinner 2 0.16389351
## 72 17.07 3.00 Female No Sat Dinner 3 0.17574692
## 73 26.86 3.14 Female Yes Sat Dinner 2 0.11690246
## 74 25.28 5.00 Female Yes Sat Dinner 2 0.19778481
## 75 14.73 2.20 Female No Sat Dinner 2 0.14935506
## 76 10.51 1.25 Male No Sat Dinner 2 0.11893435
## 77 17.92 3.08 Male Yes Sat Dinner 2 0.17187500
## 78 27.20 4.00 Male No Thur Lunch 4 0.14705882
## 79 22.76 3.00 Male No Thur Lunch 2 0.13181019
## 80 17.29 2.71 Male No Thur Lunch 2 0.15673800
## 81 19.44 3.00 Male Yes Thur Lunch 2 0.15432099
## 82 16.66 3.40 Male No Thur Lunch 2 0.20408163
## 83 10.07 1.83 Female No Thur Lunch 1 0.18172790
## 84 32.68 5.00 Male Yes Thur Lunch 2 0.15299878
## 85 15.98 2.03 Male No Thur Lunch 2 0.12703379
## 86 34.83 5.17 Female No Thur Lunch 4 0.14843526
## 87 13.03 2.00 Male No Thur Lunch 2 0.15349194
## 88 18.28 4.00 Male No Thur Lunch 2 0.21881838
## 89 24.71 5.85 Male No Thur Lunch 2 0.23674626
## 90 21.16 3.00 Male No Thur Lunch 2 0.14177694
## 91 28.97 3.00 Male Yes Fri Dinner 2 0.10355540
## 92 22.49 3.50 Male No Fri Dinner 2 0.15562472
## 93 5.75 1.00 Female Yes Fri Dinner 2 0.17391304
## 94 16.32 4.30 Female Yes Fri Dinner 2 0.26348039
## 95 22.75 3.25 Female No Fri Dinner 2 0.14285714
## 96 40.17 4.73 Male Yes Fri Dinner 4 0.11774956
## 97 27.28 4.00 Male Yes Fri Dinner 2 0.14662757
## 98 12.03 1.50 Male Yes Fri Dinner 2 0.12468828
## 99 21.01 3.00 Male Yes Fri Dinner 2 0.14278915
## 100 12.46 1.50 Male No Fri Dinner 2 0.12038523
## 101 11.35 2.50 Female Yes Fri Dinner 2 0.22026432
## 102 15.38 3.00 Female Yes Fri Dinner 2 0.19505852
## 103 44.30 2.50 Female Yes Sat Dinner 3 0.05643341
## 104 22.42 3.48 Female Yes Sat Dinner 2 0.15521855
## 105 20.92 4.08 Female No Sat Dinner 2 0.19502868
## 106 15.36 1.64 Male Yes Sat Dinner 2 0.10677083
## 107 20.49 4.06 Male Yes Sat Dinner 2 0.19814544
## 108 25.21 4.29 Male Yes Sat Dinner 2 0.17017057
## 109 18.24 3.76 Male No Sat Dinner 2 0.20614035
## 110 14.31 4.00 Female Yes Sat Dinner 2 0.27952481
## 111 14.00 3.00 Male No Sat Dinner 2 0.21428571
## 112 7.25 1.00 Female No Sat Dinner 1 0.13793103
## 113 38.07 4.00 Male No Sun Dinner 3 0.10506961
## 114 23.95 2.55 Male No Sun Dinner 2 0.10647182
## 115 25.71 4.00 Female No Sun Dinner 3 0.15558149
## 116 17.31 3.50 Female No Sun Dinner 2 0.20219526
## 117 29.93 5.07 Male No Sun Dinner 4 0.16939526
## 118 10.65 1.50 Female No Thur Lunch 2 0.14084507
## 119 12.43 1.80 Female No Thur Lunch 2 0.14481094
## 120 24.08 2.92 Female No Thur Lunch 4 0.12126246
## 121 11.69 2.31 Male No Thur Lunch 2 0.19760479
## 122 13.42 1.68 Female No Thur Lunch 2 0.12518629
## 123 14.26 2.50 Male No Thur Lunch 2 0.17531557
## 124 15.95 2.00 Male No Thur Lunch 2 0.12539185
## 125 12.48 2.52 Female No Thur Lunch 2 0.20192308
## 126 29.80 4.20 Female No Thur Lunch 6 0.14093960
## 127 8.52 1.48 Male No Thur Lunch 2 0.17370892
## 128 14.52 2.00 Female No Thur Lunch 2 0.13774105
## 129 11.38 2.00 Female No Thur Lunch 2 0.17574692
## 130 22.82 2.18 Male No Thur Lunch 3 0.09553024
## 131 19.08 1.50 Male No Thur Lunch 2 0.07861635
## 132 20.27 2.83 Female No Thur Lunch 2 0.13961519
## 133 11.17 1.50 Female No Thur Lunch 2 0.13428827
## 134 12.26 2.00 Female No Thur Lunch 2 0.16313214
## 135 18.26 3.25 Female No Thur Lunch 2 0.17798467
## 136 8.51 1.25 Female No Thur Lunch 2 0.14688602
## 137 10.33 2.00 Female No Thur Lunch 2 0.19361084
## 138 14.15 2.00 Female No Thur Lunch 2 0.14134276
## 139 16.00 2.00 Male Yes Thur Lunch 2 0.12500000
## 140 13.16 2.75 Female No Thur Lunch 2 0.20896657
## 141 17.47 3.50 Female No Thur Lunch 2 0.20034345
## 142 34.30 6.70 Male No Thur Lunch 6 0.19533528
## 143 41.19 5.00 Male No Thur Lunch 5 0.12138869
## 144 27.05 5.00 Female No Thur Lunch 6 0.18484288
## 145 16.43 2.30 Female No Thur Lunch 2 0.13998783
## 146 8.35 1.50 Female No Thur Lunch 2 0.17964072
## 147 18.64 1.36 Female No Thur Lunch 3 0.07296137
## 148 11.87 1.63 Female No Thur Lunch 2 0.13732098
## 149 9.78 1.73 Male No Thur Lunch 2 0.17689162
## 150 7.51 2.00 Male No Thur Lunch 2 0.26631158
## 151 14.07 2.50 Male No Sun Dinner 2 0.17768301
## 152 13.13 2.00 Male No Sun Dinner 2 0.15232292
## 153 17.26 2.74 Male No Sun Dinner 3 0.15874855
## 154 24.55 2.00 Male No Sun Dinner 4 0.08146640
## 155 19.77 2.00 Male No Sun Dinner 4 0.10116338
## 156 29.85 5.14 Female No Sun Dinner 5 0.17219430
## 157 48.17 5.00 Male No Sun Dinner 6 0.10379905
## 158 25.00 3.75 Female No Sun Dinner 4 0.15000000
## 159 13.39 2.61 Female No Sun Dinner 2 0.19492158
## 160 16.49 2.00 Male No Sun Dinner 4 0.12128563
## 161 21.50 3.50 Male No Sun Dinner 4 0.16279070
## 162 12.66 2.50 Male No Sun Dinner 2 0.19747235
## 163 16.21 2.00 Female No Sun Dinner 3 0.12338063
## 164 13.81 2.00 Male No Sun Dinner 2 0.14482259
## 165 17.51 3.00 Female Yes Sun Dinner 2 0.17133067
## 166 24.52 3.48 Male No Sun Dinner 3 0.14192496
## 167 20.76 2.24 Male No Sun Dinner 2 0.10789981
## 168 31.71 4.50 Male No Sun Dinner 4 0.14191107
## 169 10.59 1.61 Female Yes Sat Dinner 2 0.15203022
## 170 10.63 2.00 Female Yes Sat Dinner 2 0.18814675
## 171 50.81 10.00 Male Yes Sat Dinner 3 0.19681165
## 172 15.81 3.16 Male Yes Sat Dinner 2 0.19987350
## 173 7.25 5.15 Male Yes Sun Dinner 2 0.71034483
## 174 31.85 3.18 Male Yes Sun Dinner 2 0.09984301
## 175 16.82 4.00 Male Yes Sun Dinner 2 0.23781213
## 176 32.90 3.11 Male Yes Sun Dinner 2 0.09452888
## 177 17.89 2.00 Male Yes Sun Dinner 2 0.11179430
## 178 14.48 2.00 Male Yes Sun Dinner 2 0.13812155
## 179 9.60 4.00 Female Yes Sun Dinner 2 0.41666667
## 180 34.63 3.55 Male Yes Sun Dinner 2 0.10251227
## 181 34.65 3.68 Male Yes Sun Dinner 4 0.10620491
## 182 23.33 5.65 Male Yes Sun Dinner 2 0.24217745
## 183 45.35 3.50 Male Yes Sun Dinner 3 0.07717751
## 184 23.17 6.50 Male Yes Sun Dinner 4 0.28053517
## 185 40.55 3.00 Male Yes Sun Dinner 2 0.07398274
## 186 20.69 5.00 Male No Sun Dinner 5 0.24166264
## 187 20.90 3.50 Female Yes Sun Dinner 3 0.16746411
## 188 30.46 2.00 Male Yes Sun Dinner 5 0.06565988
## 189 18.15 3.50 Female Yes Sun Dinner 3 0.19283747
## 190 23.10 4.00 Male Yes Sun Dinner 3 0.17316017
## 191 15.69 1.50 Male Yes Sun Dinner 2 0.09560229
## 192 19.81 4.19 Female Yes Thur Lunch 2 0.21150934
## 193 28.44 2.56 Male Yes Thur Lunch 2 0.09001406
## 194 15.48 2.02 Male Yes Thur Lunch 2 0.13049096
## 195 16.58 4.00 Male Yes Thur Lunch 2 0.24125452
## 196 7.56 1.44 Male No Thur Lunch 2 0.19047619
## 197 10.34 2.00 Male Yes Thur Lunch 2 0.19342360
## 198 43.11 5.00 Female Yes Thur Lunch 4 0.11598237
## 199 13.00 2.00 Female Yes Thur Lunch 2 0.15384615
## 200 13.51 2.00 Male Yes Thur Lunch 2 0.14803849
## 201 18.71 4.00 Male Yes Thur Lunch 3 0.21378942
## 202 12.74 2.01 Female Yes Thur Lunch 2 0.15777080
## 203 13.00 2.00 Female Yes Thur Lunch 2 0.15384615
## 204 16.40 2.50 Female Yes Thur Lunch 2 0.15243902
## 205 20.53 4.00 Male Yes Thur Lunch 4 0.19483682
## 206 16.47 3.23 Female Yes Thur Lunch 3 0.19611415
## 207 26.59 3.41 Male Yes Sat Dinner 3 0.12824370
## 208 38.73 3.00 Male Yes Sat Dinner 4 0.07745933
## 209 24.27 2.03 Male Yes Sat Dinner 2 0.08364236
## 210 12.76 2.23 Female Yes Sat Dinner 2 0.17476489
## 211 30.06 2.00 Male Yes Sat Dinner 3 0.06653360
## 212 25.89 5.16 Male Yes Sat Dinner 4 0.19930475
## 213 48.33 9.00 Male No Sat Dinner 4 0.18621974
## 214 13.27 2.50 Female Yes Sat Dinner 2 0.18839488
## 215 28.17 6.50 Female Yes Sat Dinner 3 0.23074192
## 216 12.90 1.10 Female Yes Sat Dinner 2 0.08527132
## 217 28.15 3.00 Male Yes Sat Dinner 5 0.10657194
## 218 11.59 1.50 Male Yes Sat Dinner 2 0.12942192
## 219 7.74 1.44 Male Yes Sat Dinner 2 0.18604651
## 220 30.14 3.09 Female Yes Sat Dinner 4 0.10252157
## 221 12.16 2.20 Male Yes Fri Lunch 2 0.18092105
## 222 13.42 3.48 Female Yes Fri Lunch 2 0.25931446
## 223 8.58 1.92 Male Yes Fri Lunch 1 0.22377622
## 224 15.98 3.00 Female No Fri Lunch 3 0.18773467
## 225 13.42 1.58 Male Yes Fri Lunch 2 0.11773472
## 226 16.27 2.50 Female Yes Fri Lunch 2 0.15365704
## 227 10.09 2.00 Female Yes Fri Lunch 2 0.19821606
## 228 20.45 3.00 Male No Sat Dinner 4 0.14669927
## 229 13.28 2.72 Male No Sat Dinner 2 0.20481928
## 230 22.12 2.88 Female Yes Sat Dinner 2 0.13019892
## 231 24.01 2.00 Male Yes Sat Dinner 4 0.08329863
## 232 15.69 3.00 Male Yes Sat Dinner 3 0.19120459
## 233 11.61 3.39 Male No Sat Dinner 2 0.29198966
## 234 10.77 1.47 Male No Sat Dinner 2 0.13649025
## 235 15.53 3.00 Male Yes Sat Dinner 2 0.19317450
## 236 10.07 1.25 Male No Sat Dinner 2 0.12413108
## 237 12.60 1.00 Male Yes Sat Dinner 2 0.07936508
## 238 32.83 1.17 Male Yes Sat Dinner 2 0.03563814
## 239 35.83 4.67 Female No Sat Dinner 3 0.13033771
## 240 29.03 5.92 Male No Sat Dinner 3 0.20392697
## 241 27.18 2.00 Female Yes Sat Dinner 2 0.07358352
## 242 22.67 2.00 Male Yes Sat Dinner 2 0.08822232
## 243 17.82 1.75 Male No Sat Dinner 2 0.09820426
## 244 18.78 3.00 Female No Thur Dinner 2 0.15974441
newerdata <- subset(newdata, select = c(Total, Tip, Gender, Smoker, Meal, Party.Size, new))
head(newerdata)
## Total Tip Gender Smoker Meal Party.Size new
## 1 16.99 1.01 Female No Sun Dinner 2 0.05944673
## 2 10.34 1.66 Male No Sun Dinner 3 0.16054159
## 3 21.01 3.50 Male No Sun Dinner 3 0.16658734
## 4 23.68 3.31 Male No Sun Dinner 2 0.13978041
## 5 24.59 3.61 Female No Sun Dinner 4 0.14680765
## 6 25.29 4.71 Male No Sun Dinner 4 0.18623962
colnames (newerdata) <- c("Total", "Tip", "Gender", "Smoker", "Meal", "Party Size", "Percentage of Tip Left")
head(newerdata)
## Total Tip Gender Smoker Meal Party Size Percentage of Tip Left
## 1 16.99 1.01 Female No Sun Dinner 2 0.05944673
## 2 10.34 1.66 Male No Sun Dinner 3 0.16054159
## 3 21.01 3.50 Male No Sun Dinner 3 0.16658734
## 4 23.68 3.31 Male No Sun Dinner 2 0.13978041
## 5 24.59 3.61 Female No Sun Dinner 4 0.14680765
## 6 25.29 4.71 Male No Sun Dinner 4 0.18623962
female.tips <- subset(newerdata, Gender == "Female", select = c("Percentage of Tip Left"))
mean(female.tips$`Percentage of Tip Left`)
## [1] 0.1664907
male.tips <- subset(newerdata, Gender == "Male", select = c("Percentage of Tip Left"))
mean(male.tips$`Percentage of Tip Left`)
## [1] 0.1576505
smokers.tips <- subset(newerdata, Smoker == "Yes", select = c("Percentage of Tip Left"))
mean(smokers.tips$`Percentage of Tip Left`)
## [1] 0.163196
nonsmokers.tips <- subset(newerdata, Smoker == "No", select = c("Percentage of Tip Left"))
mean(nonsmokers.tips$`Percentage of Tip Left`)
## [1] 0.1593285
largeparty.tips <- subset(newerdata, `Party Size` >= 3, select = c("Percentage of Tip Left"))
mean(largeparty.tips$`Percentage of Tip Left`)
## [1] 0.1489818
smallparty.tips <- subset(newerdata, `Party Size` <= 2, select = c("Percentage of Tip Left"))
mean(smallparty.tips$`Percentage of Tip Left`)
## [1] 0.1670085
based on the information above, it appears that smaller parties (2 people of less) left the highest percentage of tips based on the price of their meals compared to larger parties. Parties with a female paying the bill tended to tip more than male check payers. Smokers tended to tip higher than non-smokers. Of course, in this analysis I did not isolate these variables, so there is a chance that a male nonsmoker from a large party tipped well. I think if I could figure out how to use if/and function in R, this analysis would have went a little smoother. I could continue to analyze this data and make a box plot to illustrate the amount of tips left by the time of the meal.