Student Details
Shipren Jayadev (s3744421)
Problem Statement
For this assignment, I have chosen the variable bicep girth. I will be investigating the normality of bicep girth among both men and women to determine if they fit a normal distribution.
Load Packages
rr # This is a chunk where you can load the necessary packages required to reproduce the report library(readr) library(magrittr) library(dplyr)
Data
Import the body measurements data and prepare it for analysis. Show your code.
rr library(readr) bdims = read_csv(.csv)
Parsed with column specification:
cols(
.default = col_double(),
age = col_integer(),
sex = col_integer()
)
See spec(...) for full column specifications.
rr bic = bdims %>% select(bic.gi, sex) male = bic %>% filter(sex==1) female = bic %>% filter(sex==0) View(male) View(female) View(bic)
Summary Statistics
Calculate descriptive statistics (i.e., mean, median, standard deviation, first and third quartile, interquartile range, minimum and maximum values) of the selected measurement grouped by sex.
rr summary(female$bic.gi)
Min. 1st Qu. Median Mean 3rd Qu. Max.
22.4 26.4 27.8 28.1 29.8 40.3
rr female$bic.gi %>% IQR()
[1] 3.4
rr female$bic.gi %>% sd()
[1] 2.709477
rr summary(male$bic.gi)
Min. 1st Qu. Median Mean 3rd Qu. Max.
25.6 32.5 34.4 34.4 36.4 42.4
rr male$bic.gi %>% IQR()
[1] 3.9
rr male$bic.gi %>% sd()
[1] 2.982037
Distribution Fitting
Compare the empirical distribution of selected body measurement to a normal distribution separately in men and in women. You need to do this visually by plotting the histogram with normal distribution overlay. Show your code.
rr #Female Distribution F = female\(bic.gi HF = hist(F, col=\pink\, xlab=\Bicep Girth\, xlim = c(20,45),main = paste(\Female Bicep Girth Distribution\), breaks = 30) xfitF = seq(min(F),max(F),length.out = 260) yfitF = dnorm(xfitF, mean = mean(F), sd = sd(F)) yfitF = yfitF*diff(HF\)mids[1:2])*length(F) lines(xfitF, yfitF, col=, lwd=\2)

rr #Male Distribution M = male\(bic.gi HM = hist(M, col=\blue\, xlab=\Bicep Girth\, xlim = c(20,45),main = paste(\Male Bicep Girth Distribution\), breaks = 30) xfitM = seq(min(M),max(M),length.out = 247) yfitM = dnorm(xfitM, mean = mean(M), sd = sd(M)) yfitM = yfitM*diff(HM\)mids[1:2])*length(M) lines(xfitM, yfitM, col=, lwd=\2)

Interpretation
Based on the plotted histogram with a normal distribution curve overlay, I can conclude that for women the distribution for bicep girth is right skewed whereas for the men, the distribution is apporximately normal.
LS0tDQp0aXRsZTogIk1BVEgxMzI0IEFzc2lnbm1lbnQgMSINCnN1YnRpdGxlOiBNb2RlbGluZyBCb2R5IE1lYXN1cmVtZW50cw0Kb3V0cHV0Og0KICBodG1sX25vdGVib29rOiBkZWZhdWx0DQotLS0NCg0KIyMgU3R1ZGVudCBEZXRhaWxzDQoNCg0KU2hpcHJlbiBKYXlhZGV2IChzMzc0NDQyMSkNCg0KDQojIyBQcm9ibGVtIFN0YXRlbWVudA0KDQpGb3IgdGhpcyBhc3NpZ25tZW50LCBJIGhhdmUgY2hvc2VuIHRoZSB2YXJpYWJsZSBiaWNlcCBnaXJ0aC4gSSB3aWxsIGJlIGludmVzdGlnYXRpbmcgdGhlIG5vcm1hbGl0eSBvZiBiaWNlcCBnaXJ0aCBhbW9uZyBib3RoIG1lbiBhbmQgd29tZW4gdG8gZGV0ZXJtaW5lIGlmIHRoZXkgZml0IGEgbm9ybWFsIGRpc3RyaWJ1dGlvbi4gDQoNCg0KIyMgTG9hZCBQYWNrYWdlcw0KDQpgYGB7cn0NCiMgVGhpcyBpcyBhIGNodW5rIHdoZXJlIHlvdSBjYW4gbG9hZCB0aGUgbmVjZXNzYXJ5IHBhY2thZ2VzIHJlcXVpcmVkIHRvIHJlcHJvZHVjZSB0aGUgcmVwb3J0DQpsaWJyYXJ5KHJlYWRyKQ0KbGlicmFyeShtYWdyaXR0cikNCmxpYnJhcnkoZHBseXIpDQpgYGANCg0KIyMgRGF0YQ0KDQpJbXBvcnQgdGhlIGJvZHkgbWVhc3VyZW1lbnRzIGRhdGEgYW5kIHByZXBhcmUgaXQgZm9yIGFuYWx5c2lzLiBTaG93IHlvdXIgY29kZS4NCg0KDQpgYGB7cn0NCmxpYnJhcnkocmVhZHIpDQpiZGltcyA9IHJlYWRfY3N2KCJiZGltcy5jc3YiKQ0KDQpiaWMgPSBiZGltcyAlPiUgc2VsZWN0KGJpYy5naSwgc2V4KQ0KDQptYWxlID0gYmljICU+JSBmaWx0ZXIoc2V4PT0xKQ0KZmVtYWxlID0gYmljICU+JSBmaWx0ZXIoc2V4PT0wKQ0KDQpWaWV3KG1hbGUpDQpWaWV3KGZlbWFsZSkNClZpZXcoYmljKQ0KYGBgDQoNCg0KDQojIyBTdW1tYXJ5IFN0YXRpc3RpY3MNCg0KDQpDYWxjdWxhdGUgZGVzY3JpcHRpdmUgc3RhdGlzdGljcyAoaS5lLiwgbWVhbiwgbWVkaWFuLCBzdGFuZGFyZCBkZXZpYXRpb24sIGZpcnN0IGFuZCB0aGlyZCBxdWFydGlsZSwgaW50ZXJxdWFydGlsZSByYW5nZSwgbWluaW11bSBhbmQgbWF4aW11bSB2YWx1ZXMpIG9mIHRoZSBzZWxlY3RlZCBtZWFzdXJlbWVudCBncm91cGVkIGJ5IHNleC4NCg0KYGBge3J9DQpzdW1tYXJ5KGZlbWFsZSRiaWMuZ2kpDQpmZW1hbGUkYmljLmdpICU+JSBJUVIoKQ0KZmVtYWxlJGJpYy5naSAlPiUgc2QoKQ0Kc3VtbWFyeShtYWxlJGJpYy5naSkNCm1hbGUkYmljLmdpICU+JSBJUVIoKQ0KbWFsZSRiaWMuZ2kgJT4lIHNkKCkNCmBgYA0KDQoNCiMjIERpc3RyaWJ1dGlvbiBGaXR0aW5nDQoNCkNvbXBhcmUgdGhlIGVtcGlyaWNhbCBkaXN0cmlidXRpb24gb2Ygc2VsZWN0ZWQgYm9keSBtZWFzdXJlbWVudCB0byBhIG5vcm1hbCBkaXN0cmlidXRpb24gc2VwYXJhdGVseSBpbiBtZW4gYW5kIGluIHdvbWVuLiBZb3UgbmVlZCB0byBkbyB0aGlzIHZpc3VhbGx5IGJ5IHBsb3R0aW5nIHRoZSBoaXN0b2dyYW0gd2l0aCBub3JtYWwgZGlzdHJpYnV0aW9uIG92ZXJsYXkuIFNob3cgeW91ciBjb2RlLg0KDQoNCmBgYHtyfQ0KI0ZlbWFsZSBEaXN0cmlidXRpb24NCkYgPSBmZW1hbGUkYmljLmdpDQoNCkhGID0gaGlzdChGLCBjb2w9InBpbmsiLCB4bGFiPSJCaWNlcCBHaXJ0aCIsDQogICAgICAgICAgeGxpbSA9IGMoMjAsNDUpLG1haW4gPSBwYXN0ZSgiRmVtYWxlIEJpY2VwIEdpcnRoIERpc3RyaWJ1dGlvbiIpLCBicmVha3MgPSAzMCkNCnhmaXRGID0gc2VxKG1pbihGKSxtYXgoRiksbGVuZ3RoLm91dCA9IDI2MCkNCnlmaXRGID0gZG5vcm0oeGZpdEYsIG1lYW4gPSBtZWFuKEYpLCBzZCA9IHNkKEYpKQ0KeWZpdEYgPSB5Zml0RipkaWZmKEhGJG1pZHNbMToyXSkqbGVuZ3RoKEYpDQpsaW5lcyh4Zml0RiwgeWZpdEYsIGNvbD0icmVkIiwgbHdkPSIyIikNCg0KI01hbGUgRGlzdHJpYnV0aW9uDQpNID0gbWFsZSRiaWMuZ2kNCg0KSE0gPSBoaXN0KE0sIGNvbD0iYmx1ZSIsIHhsYWI9IkJpY2VwIEdpcnRoIiwNCiAgICAgICAgICB4bGltID0gYygyMCw0NSksbWFpbiA9IHBhc3RlKCJNYWxlIEJpY2VwIEdpcnRoIERpc3RyaWJ1dGlvbiIpLCBicmVha3MgPSAzMCkNCnhmaXRNID0gc2VxKG1pbihNKSxtYXgoTSksbGVuZ3RoLm91dCA9IDI0NykNCnlmaXRNID0gZG5vcm0oeGZpdE0sIG1lYW4gPSBtZWFuKE0pLCBzZCA9IHNkKE0pKQ0KeWZpdE0gPSB5Zml0TSpkaWZmKEhNJG1pZHNbMToyXSkqbGVuZ3RoKE0pDQpsaW5lcyh4Zml0TSwgeWZpdE0sIGNvbD0icmVkIiwgbHdkPSIyIikNCmBgYA0KDQojIyBJbnRlcnByZXRhdGlvbg0KDQpCYXNlZCBvbiB0aGUgcGxvdHRlZCBoaXN0b2dyYW0gd2l0aCBhIG5vcm1hbCBkaXN0cmlidXRpb24gY3VydmUgb3ZlcmxheSwgSSBjYW4gY29uY2x1ZGUgdGhhdCBmb3Igd29tZW4gdGhlIGRpc3RyaWJ1dGlvbiBmb3IgYmljZXAgZ2lydGggaXMgcmlnaHQgc2tld2VkIHdoZXJlYXMgZm9yIHRoZSBtZW4sIHRoZSBkaXN0cmlidXRpb24gaXMgYXBwb3J4aW1hdGVseSBub3JtYWwuIA0KDQoNCg==