This project investigates whether there is a statistically significant difference in average body weight between male and female respondents. Understanding differences in body weight across sexes is important in fields such as public health, medicine, and human biology, as body weight is often associated with health outcomes and risk factors.
The dataset used in this analysis is bdims.csv, which contains 507 observations and 25 variables. Each observation represents an individual respondent, and the variables include demographic information (such as age and sex), body weight, height, and various body measurements. While the dataset includes many body dimension variables, this analysis focuses specifically on sex (coded as 0 = female, 1 = male) and weight (in kilograms), as these variables are directly relevant to the research question.
bdims <- read.csv("../bdims.csv")
str(bdims)
## 'data.frame': 507 obs. of 25 variables:
## $ bia_di: num 42.9 43.7 40.1 44.3 42.5 43.3 43.5 44.4 43.5 42 ...
## $ bii_di: num 26 28.5 28.2 29.9 29.9 27 30 29.8 26.5 28 ...
## $ bit_di: num 31.5 33.5 33.3 34 34 31.5 34 33.2 32.1 34 ...
## $ che_de: num 17.7 16.9 20.9 18.4 21.5 19.6 21.9 21.8 15.5 22.5 ...
## $ che_di: num 28 30.8 31.7 28.2 29.4 31.3 31.7 28.8 27.5 28 ...
## $ elb_di: num 13.1 14 13.9 13.9 15.2 14 16.1 15.1 14.1 15.6 ...
## $ wri_di: num 10.4 11.8 10.9 11.2 11.6 11.5 12.5 11.9 11.2 12 ...
## $ kne_di: num 18.8 20.6 19.7 20.9 20.7 18.8 20.8 21 18.9 21.1 ...
## $ ank_di: num 14.1 15.1 14.1 15 14.9 13.9 15.6 14.6 13.2 15 ...
## $ sho_gi: num 106 110 115 104 108 ...
## $ che_gi: num 89.5 97 97.5 97 97.5 ...
## $ wai_gi: num 71.5 79 83.2 77.8 80 82.5 82 76.8 68.5 77.5 ...
## $ nav_gi: num 74.5 86.5 82.9 78.8 82.5 80.1 84 80.5 69 81.5 ...
## $ hip_gi: num 93.5 94.8 95 94 98.5 95.3 101 98 89.5 99.8 ...
## $ thi_gi: num 51.5 51.5 57.3 53 55.4 57.5 60.9 56 50 59.8 ...
## $ bic_gi: num 32.5 34.4 33.4 31 32 33 42.4 34.1 33 36.5 ...
## $ for_gi: num 26 28 28.8 26.2 28.4 28 32.3 28 26 29.2 ...
## $ kne_gi: num 34.5 36.5 37 37 37.7 36.6 40.1 39.2 35.5 38.3 ...
## $ cal_gi: num 36.5 37.5 37.3 34.8 38.6 36.1 40.3 36.7 35 38.6 ...
## $ ank_gi: num 23.5 24.5 21.9 23 24.4 23.5 23.6 22.5 22 22.2 ...
## $ wri_gi: num 16.5 17 16.9 16.6 18 16.9 18.8 18 16.5 16.9 ...
## $ age : int 21 23 28 23 22 21 26 27 23 21 ...
## $ wgt : num 65.6 71.8 80.7 72.6 78.8 74.8 86.4 78.4 62 81.6 ...
## $ hgt : num 174 175 194 186 187 ...
## $ sex : int 1 1 1 1 1 1 1 1 1 1 ...
head(bdims)
## bia_di bii_di bit_di che_de che_di elb_di wri_di kne_di ank_di sho_gi che_gi
## 1 42.9 26.0 31.5 17.7 28.0 13.1 10.4 18.8 14.1 106.2 89.5
## 2 43.7 28.5 33.5 16.9 30.8 14.0 11.8 20.6 15.1 110.5 97.0
## 3 40.1 28.2 33.3 20.9 31.7 13.9 10.9 19.7 14.1 115.1 97.5
## 4 44.3 29.9 34.0 18.4 28.2 13.9 11.2 20.9 15.0 104.5 97.0
## 5 42.5 29.9 34.0 21.5 29.4 15.2 11.6 20.7 14.9 107.5 97.5
## 6 43.3 27.0 31.5 19.6 31.3 14.0 11.5 18.8 13.9 119.8 99.9
## wai_gi nav_gi hip_gi thi_gi bic_gi for_gi kne_gi cal_gi ank_gi wri_gi age
## 1 71.5 74.5 93.5 51.5 32.5 26.0 34.5 36.5 23.5 16.5 21
## 2 79.0 86.5 94.8 51.5 34.4 28.0 36.5 37.5 24.5 17.0 23
## 3 83.2 82.9 95.0 57.3 33.4 28.8 37.0 37.3 21.9 16.9 28
## 4 77.8 78.8 94.0 53.0 31.0 26.2 37.0 34.8 23.0 16.6 23
## 5 80.0 82.5 98.5 55.4 32.0 28.4 37.7 38.6 24.4 18.0 22
## 6 82.5 80.1 95.3 57.5 33.0 28.0 36.6 36.1 23.5 16.9 21
## wgt hgt sex
## 1 65.6 174.0 1
## 2 71.8 175.3 1
## 3 80.7 193.5 1
## 4 72.6 186.5 1
## 5 78.8 187.2 1
## 6 74.8 181.5 1
bdims$sex <- factor(bdims$sex,
levels = c(0, 1),
labels = c("Female", "Male"))
table(bdims$sex)
##
## Female Male
## 260 247
First I loaded and explored the dataset to see the number of colums, rows, labels, etc… .I also recoded the “sex” column so the categories are male/female. In the original dataset, male is 0 and female is 1.
library(dplyr)
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
bdims_clean <- bdims |>
select(sex, wgt)
bdims_clean
## sex wgt
## 1 Male 65.6
## 2 Male 71.8
## 3 Male 80.7
## 4 Male 72.6
## 5 Male 78.8
## 6 Male 74.8
## 7 Male 86.4
## 8 Male 78.4
## 9 Male 62.0
## 10 Male 81.6
## 11 Male 76.6
## 12 Male 83.6
## 13 Male 90.0
## 14 Male 74.6
## 15 Male 71.0
## 16 Male 79.6
## 17 Male 93.8
## 18 Male 70.0
## 19 Male 72.4
## 20 Male 85.9
## 21 Male 78.8
## 22 Male 77.8
## 23 Male 66.2
## 24 Male 86.4
## 25 Male 81.8
## 26 Male 89.6
## 27 Male 82.8
## 28 Male 76.4
## 29 Male 63.2
## 30 Male 60.9
## 31 Male 74.8
## 32 Male 70.0
## 33 Male 72.4
## 34 Male 84.1
## 35 Male 69.1
## 36 Male 59.5
## 37 Male 67.2
## 38 Male 61.3
## 39 Male 68.6
## 40 Male 80.1
## 41 Male 87.8
## 42 Male 84.7
## 43 Male 73.4
## 44 Male 72.1
## 45 Male 82.6
## 46 Male 88.7
## 47 Male 84.1
## 48 Male 94.1
## 49 Male 74.9
## 50 Male 59.1
## 51 Male 75.6
## 52 Male 86.2
## 53 Male 75.3
## 54 Male 87.1
## 55 Male 55.2
## 56 Male 57.0
## 57 Male 61.4
## 58 Male 76.8
## 59 Male 86.8
## 60 Male 72.2
## 61 Male 71.6
## 62 Male 84.8
## 63 Male 68.2
## 64 Male 66.1
## 65 Male 72.0
## 66 Male 64.6
## 67 Male 74.8
## 68 Male 70.0
## 69 Male 101.6
## 70 Male 63.2
## 71 Male 79.1
## 72 Male 78.9
## 73 Male 67.7
## 74 Male 66.0
## 75 Male 68.2
## 76 Male 63.9
## 77 Male 72.0
## 78 Male 56.8
## 79 Male 74.5
## 80 Male 90.9
## 81 Male 93.0
## 82 Male 80.9
## 83 Male 72.7
## 84 Male 68.0
## 85 Male 70.9
## 86 Male 72.5
## 87 Male 72.5
## 88 Male 83.4
## 89 Male 75.5
## 90 Male 73.0
## 91 Male 70.2
## 92 Male 73.4
## 93 Male 70.5
## 94 Male 68.9
## 95 Male 102.3
## 96 Male 68.4
## 97 Male 65.9
## 98 Male 75.7
## 99 Male 84.5
## 100 Male 87.7
## 101 Male 86.4
## 102 Male 73.2
## 103 Male 53.9
## 104 Male 72.0
## 105 Male 55.5
## 106 Male 58.4
## 107 Male 83.2
## 108 Male 72.7
## 109 Male 64.1
## 110 Male 72.3
## 111 Male 65.0
## 112 Male 86.4
## 113 Male 65.0
## 114 Male 88.6
## 115 Male 84.1
## 116 Male 66.8
## 117 Male 75.5
## 118 Male 93.2
## 119 Male 82.7
## 120 Male 58.0
## 121 Male 79.5
## 122 Male 78.6
## 123 Male 71.8
## 124 Male 116.4
## 125 Male 72.2
## 126 Male 83.6
## 127 Male 85.5
## 128 Male 90.9
## 129 Male 85.9
## 130 Male 89.1
## 131 Male 75.0
## 132 Male 77.7
## 133 Male 86.4
## 134 Male 90.9
## 135 Male 73.6
## 136 Male 76.4
## 137 Male 69.1
## 138 Male 84.5
## 139 Male 64.5
## 140 Male 69.1
## 141 Male 108.6
## 142 Male 86.4
## 143 Male 80.9
## 144 Male 87.7
## 145 Male 94.5
## 146 Male 80.2
## 147 Male 72.0
## 148 Male 71.4
## 149 Male 72.7
## 150 Male 84.1
## 151 Male 76.8
## 152 Male 63.6
## 153 Male 80.9
## 154 Male 80.9
## 155 Male 85.5
## 156 Male 68.6
## 157 Male 67.7
## 158 Male 66.4
## 159 Male 102.3
## 160 Male 70.5
## 161 Male 95.9
## 162 Male 84.1
## 163 Male 87.3
## 164 Male 71.8
## 165 Male 65.9
## 166 Male 95.9
## 167 Male 91.4
## 168 Male 81.8
## 169 Male 96.8
## 170 Male 69.1
## 171 Male 82.7
## 172 Male 75.5
## 173 Male 79.5
## 174 Male 73.6
## 175 Male 91.8
## 176 Male 84.1
## 177 Male 85.9
## 178 Male 81.8
## 179 Male 82.5
## 180 Male 80.5
## 181 Male 70.0
## 182 Male 81.8
## 183 Male 84.1
## 184 Male 90.5
## 185 Male 91.4
## 186 Male 89.1
## 187 Male 85.0
## 188 Male 69.1
## 189 Male 73.6
## 190 Male 80.5
## 191 Male 82.7
## 192 Male 86.4
## 193 Male 67.7
## 194 Male 92.7
## 195 Male 93.6
## 196 Male 70.9
## 197 Male 75.0
## 198 Male 93.2
## 199 Male 93.2
## 200 Male 77.7
## 201 Male 61.4
## 202 Male 94.1
## 203 Male 75.0
## 204 Male 83.6
## 205 Male 85.5
## 206 Male 73.9
## 207 Male 66.8
## 208 Male 87.3
## 209 Male 72.3
## 210 Male 88.6
## 211 Male 75.5
## 212 Male 101.4
## 213 Male 91.1
## 214 Male 67.3
## 215 Male 77.7
## 216 Male 81.8
## 217 Male 75.5
## 218 Male 84.5
## 219 Male 76.6
## 220 Male 85.0
## 221 Male 102.5
## 222 Male 77.3
## 223 Male 71.8
## 224 Male 87.9
## 225 Male 94.3
## 226 Male 70.9
## 227 Male 64.5
## 228 Male 77.3
## 229 Male 72.3
## 230 Male 87.3
## 231 Male 80.0
## 232 Male 82.3
## 233 Male 73.6
## 234 Male 74.1
## 235 Male 85.9
## 236 Male 73.2
## 237 Male 76.3
## 238 Male 65.9
## 239 Male 90.9
## 240 Male 89.1
## 241 Male 62.3
## 242 Male 82.7
## 243 Male 79.1
## 244 Male 98.2
## 245 Male 84.1
## 246 Male 83.2
## 247 Male 83.2
## 248 Female 51.6
## 249 Female 59.0
## 250 Female 49.2
## 251 Female 63.0
## 252 Female 53.6
## 253 Female 59.0
## 254 Female 47.6
## 255 Female 69.8
## 256 Female 66.8
## 257 Female 75.2
## 258 Female 55.2
## 259 Female 54.2
## 260 Female 62.5
## 261 Female 42.0
## 262 Female 50.0
## 263 Female 49.8
## 264 Female 49.2
## 265 Female 73.2
## 266 Female 47.8
## 267 Female 68.8
## 268 Female 50.6
## 269 Female 82.5
## 270 Female 57.2
## 271 Female 87.8
## 272 Female 72.8
## 273 Female 54.5
## 274 Female 59.8
## 275 Female 67.3
## 276 Female 67.8
## 277 Female 47.0
## 278 Female 46.2
## 279 Female 55.0
## 280 Female 83.0
## 281 Female 54.4
## 282 Female 45.8
## 283 Female 53.6
## 284 Female 73.2
## 285 Female 52.1
## 286 Female 67.9
## 287 Female 56.6
## 288 Female 62.3
## 289 Female 58.5
## 290 Female 54.5
## 291 Female 50.2
## 292 Female 60.3
## 293 Female 58.3
## 294 Female 56.2
## 295 Female 50.2
## 296 Female 72.9
## 297 Female 59.8
## 298 Female 61.0
## 299 Female 69.1
## 300 Female 55.9
## 301 Female 46.5
## 302 Female 54.3
## 303 Female 54.8
## 304 Female 60.7
## 305 Female 60.0
## 306 Female 62.0
## 307 Female 60.3
## 308 Female 52.7
## 309 Female 74.3
## 310 Female 62.0
## 311 Female 73.1
## 312 Female 80.0
## 313 Female 54.7
## 314 Female 53.2
## 315 Female 75.7
## 316 Female 61.1
## 317 Female 55.7
## 318 Female 48.7
## 319 Female 52.3
## 320 Female 50.0
## 321 Female 59.3
## 322 Female 62.5
## 323 Female 55.7
## 324 Female 54.8
## 325 Female 45.9
## 326 Female 70.6
## 327 Female 67.2
## 328 Female 69.4
## 329 Female 58.2
## 330 Female 64.8
## 331 Female 71.6
## 332 Female 52.8
## 333 Female 59.8
## 334 Female 49.0
## 335 Female 50.0
## 336 Female 69.2
## 337 Female 55.9
## 338 Female 63.4
## 339 Female 58.2
## 340 Female 58.6
## 341 Female 45.7
## 342 Female 52.2
## 343 Female 48.6
## 344 Female 57.8
## 345 Female 55.6
## 346 Female 66.8
## 347 Female 59.4
## 348 Female 53.6
## 349 Female 73.2
## 350 Female 53.4
## 351 Female 69.0
## 352 Female 58.4
## 353 Female 56.2
## 354 Female 70.6
## 355 Female 59.8
## 356 Female 72.0
## 357 Female 65.2
## 358 Female 56.6
## 359 Female 105.2
## 360 Female 51.8
## 361 Female 63.4
## 362 Female 59.0
## 363 Female 47.6
## 364 Female 63.0
## 365 Female 55.2
## 366 Female 45.0
## 367 Female 54.0
## 368 Female 50.2
## 369 Female 60.2
## 370 Female 44.8
## 371 Female 58.8
## 372 Female 56.4
## 373 Female 62.0
## 374 Female 49.2
## 375 Female 67.2
## 376 Female 53.8
## 377 Female 54.4
## 378 Female 58.0
## 379 Female 59.8
## 380 Female 54.8
## 381 Female 43.2
## 382 Female 60.5
## 383 Female 46.4
## 384 Female 64.4
## 385 Female 48.8
## 386 Female 62.2
## 387 Female 55.5
## 388 Female 57.8
## 389 Female 54.6
## 390 Female 59.2
## 391 Female 52.7
## 392 Female 53.2
## 393 Female 64.5
## 394 Female 51.8
## 395 Female 56.0
## 396 Female 63.6
## 397 Female 63.2
## 398 Female 59.5
## 399 Female 56.8
## 400 Female 64.1
## 401 Female 50.0
## 402 Female 72.3
## 403 Female 55.0
## 404 Female 55.9
## 405 Female 60.4
## 406 Female 69.1
## 407 Female 84.5
## 408 Female 55.9
## 409 Female 55.5
## 410 Female 69.5
## 411 Female 76.4
## 412 Female 61.4
## 413 Female 65.9
## 414 Female 58.6
## 415 Female 66.8
## 416 Female 56.6
## 417 Female 58.6
## 418 Female 55.9
## 419 Female 59.1
## 420 Female 81.8
## 421 Female 70.7
## 422 Female 56.8
## 423 Female 60.0
## 424 Female 58.2
## 425 Female 72.7
## 426 Female 54.1
## 427 Female 49.1
## 428 Female 75.9
## 429 Female 55.0
## 430 Female 57.3
## 431 Female 55.0
## 432 Female 65.5
## 433 Female 65.5
## 434 Female 48.6
## 435 Female 58.6
## 436 Female 63.6
## 437 Female 55.2
## 438 Female 62.7
## 439 Female 56.6
## 440 Female 53.9
## 441 Female 63.2
## 442 Female 73.6
## 443 Female 62.0
## 444 Female 63.6
## 445 Female 53.2
## 446 Female 53.4
## 447 Female 55.0
## 448 Female 70.5
## 449 Female 54.5
## 450 Female 54.5
## 451 Female 55.9
## 452 Female 59.0
## 453 Female 63.6
## 454 Female 54.5
## 455 Female 47.3
## 456 Female 67.7
## 457 Female 80.9
## 458 Female 70.5
## 459 Female 60.9
## 460 Female 63.6
## 461 Female 54.5
## 462 Female 59.1
## 463 Female 70.5
## 464 Female 52.7
## 465 Female 62.7
## 466 Female 86.3
## 467 Female 66.4
## 468 Female 67.3
## 469 Female 63.0
## 470 Female 73.6
## 471 Female 62.3
## 472 Female 57.7
## 473 Female 55.4
## 474 Female 104.1
## 475 Female 55.5
## 476 Female 77.3
## 477 Female 80.5
## 478 Female 64.5
## 479 Female 72.3
## 480 Female 61.4
## 481 Female 58.2
## 482 Female 81.8
## 483 Female 63.6
## 484 Female 53.4
## 485 Female 54.5
## 486 Female 53.6
## 487 Female 60.0
## 488 Female 73.6
## 489 Female 61.4
## 490 Female 55.5
## 491 Female 63.6
## 492 Female 60.9
## 493 Female 60.0
## 494 Female 46.8
## 495 Female 57.3
## 496 Female 64.1
## 497 Female 63.6
## 498 Female 67.3
## 499 Female 75.5
## 500 Female 68.2
## 501 Female 61.4
## 502 Female 76.8
## 503 Female 71.8
## 504 Female 55.5
## 505 Female 48.6
## 506 Female 66.4
## 507 Female 67.3
bdims_clean |>
group_by(sex) |>
summarise(
mean_weight = mean(wgt),
max_weight = max(wgt),
min_weight = min(wgt)
)
## # A tibble: 2 × 4
## sex mean_weight max_weight min_weight
## <fct> <dbl> <dbl> <dbl>
## 1 Female 60.6 105. 42
## 2 Male 78.1 116. 53.9
par(mfrow = c(1,2))
hist(bdims_clean$wgt[bdims$sex == "Female"], main = "Female Weight", xlab = "Weight")
hist(bdims$wgt[bdims_clean$sex == "Male"], main = "Male Weight", xlab = "Weight")
par(mfrow = c(1,1))
boxplot(wgt ~ sex,
data = bdims_clean,
main = "Body Weight by Sex",
xlab = "Sex",
ylab = "Weight (kg)")
To address the research question, exploratory data analysis (EDA) was
conducted to understand the distribution and characteristics of body
weight for males and females. The dataset was cleaned by converting the
sex variable into a labeled factor and selecting only the variables
relevant to the analysis. Summary statistics were calculated to compare
average weight by sex. Visualizations, including histograms and
boxplots, were created to examine the distribution of weight within each
group and to identify differences in center, spread, and potential
outliers. These steps help assess assumptions and provide context before
conducting the statistical test.
t.test(wgt ~ sex, data = bdims)
##
## Welch Two Sample t-test
##
## data: wgt by sex
## t = -19.577, df = 495.27, p-value < 2.2e-16
## alternative hypothesis: true difference in means between group Female and group Male is not equal to 0
## 95 percent confidence interval:
## -19.30486 -15.78344
## sample estimates:
## mean in group Female mean in group Male
## 60.60038 78.14453
The following hypotheses were tested using a two-sample t-test:
Null hypothesis (H₀): μ₍Female₎ = μ₍Male₎
Alternative hypothesis (H₁): μ₍Female₎ ≠ μ₍Male₎
The significance level was set at α = 0.05.
The results indicate a statistically significant difference in mean body weight between females (M = 60.60 kg) and males (M = 78.14 kg), with a p-value less than 0.001. Because the p-value is smaller than the chosen significance level, the null hypothesis is rejected. This provides strong evidence that average body weight differs between males and females in this sample.
This analysis found a statistically significant difference in average body weight between male and female respondents, with males weighing more on average than females. The results directly answer the research question and are consistent with general biological differences observed between sexes.
These findings are relevant to public health and medical research, where sex-based differences in body weight may influence health risk assessments and treatment decisions. Future research could extend this analysis by examining how body weight interacts with other variables in the dataset, such as height, age, or specific body measurements. Additionally, analyzing body mass index (BMI) rather than weight alone could provide a more nuanced understanding of health-related differences.
list.files()
## [1] "project 2.Rmd" "project-2_files" "project-2.Rmd"