Exploring the MLB data
Questions:
- How many observations are in the dataset?
- Who is the tallest / smallest player?
- Are players older than 32 significantly heavier than younger players?
t.test(weight ~ old,data = mlb)
Welch Two Sample t-test
data: weight by old
t = -3.9723, df = 396.06, p-value = 8.456e-05
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
-8.567759 -2.894750
sample estimates:
mean in group FALSE mean in group TRUE
200.4798 206.2110
lapply / anonymous function: Who is the average size by position?
- split the dataset by position to obtain a list
- run
mean on the list
lapply(by_pos,nrow)
$Catcher
[1] 76
$Designated_Hitter
[1] 18
$First_Baseman
[1] 55
$Outfielder
[1] 194
$Relief_Pitcher
[1] 315
$Second_Baseman
[1] 58
$Shortstop
[1] 52
$Starting_Pitcher
[1] 221
$Third_Baseman
[1] 45
A custom function: What’s the size of these players in centimeters?
inchToCm <- function(size_in_inch,meters = T,digits=2){
out <- size_in_inch * 2.54
if(meters) out <- round(out/100,digits=2)
out
}
inchToCm(sizes)
Catcher Designated_Hitter
1.85 1.89
First_Baseman Outfielder
1.88 1.85
Relief_Pitcher Second_Baseman
1.89 1.81
Shortstop Starting_Pitcher
1.83 1.90
Third_Baseman
1.86
LS0tCnRpdGxlOiAiV29ya2luZyB3aXRoIGRhdGEgYW5kIGZ1bmN0aW9ucyIKb3V0cHV0OiBodG1sX25vdGVib29rCi0tLQoKIyBFeHBsb3JpbmcgdGhlIE1MQiBkYXRhCgpgYGB7cn0KbG9hZCgiLi4vLi4vZGF0YS9tbGIuUkRhdGEiKQpscygpCmhlYWQobWxiKQpgYGAKCiMjIFF1ZXN0aW9uczogCgogIC0gSG93IG1hbnkgb2JzZXJ2YXRpb25zIGFyZSBpbiB0aGUgZGF0YXNldD8KICAtIFdobyBpcyB0aGUgdGFsbGVzdCAvIHNtYWxsZXN0IHBsYXllcj8KICAtIEFyZSBwbGF5ZXJzIG9sZGVyIHRoYW4gMzIgc2lnbmlmaWNhbnRseSBoZWF2aWVyIHRoYW4geW91bmdlciBwbGF5ZXJzPwogIAogIApgYGB7cn0KIyBubyBvZiBvYnNlcnZhdGlvbnMKbnJvdyhtbGIpCm1sYlt3aGljaC5tYXgobWxiJGhlaWdodCksIk5hbWUiXQptbGIkb2xkIDwtIG1sYiRBZ2UgPiAzMgpvbGRfcGxheWVycyA8LSBtbGJbbWxiJG9sZCxdCnlvdW5nX3BsYXllcnMgPC0gbWxiWyFtbGIkb2xkLF0KdC50ZXN0KG9sZF9wbGF5ZXJzJHdlaWdodCx5b3VuZ19wbGF5ZXJzJHdlaWdodCkKdC50ZXN0KHdlaWdodCB+IG9sZCxkYXRhID0gbWxiKQpgYGAKCiMjIGxhcHBseSAvIGFub255bW91cyBmdW5jdGlvbjogV2hvIGlzIHRoZSBhdmVyYWdlIHNpemUgYnkgcG9zaXRpb24/CgoxLiBzcGxpdCB0aGUgZGF0YXNldCBieSBwb3NpdGlvbiB0byBvYnRhaW4gYSBsaXN0CjIuIHJ1biBgbWVhbmAgb24gdGhlIGxpc3QKCmBgYHtyfQpieV9wb3MgPC0gc3BsaXQobWxiLGZhY3RvcihtbGIkUG9zaXRpb24pKQpsYXBwbHkoYnlfcG9zLG5yb3cpCiMgYnlfcG9zJENhdGNoZXIKc2l6ZXMgPC0gc2FwcGx5KGJ5X3BvcyxmdW5jdGlvbih4KSBtZWFuKHgkaGVpZ2h0KSkKc2l6ZXMKYGBgCgojIyBBIGN1c3RvbSBmdW5jdGlvbjogV2hhdCdzIHRoZSBzaXplIG9mIHRoZXNlIHBsYXllcnMgaW4gY2VudGltZXRlcnM/CgpgYGB7cn0KaW5jaFRvQ20gPC0gZnVuY3Rpb24oc2l6ZV9pbl9pbmNoLG1ldGVycyA9IFQsZGlnaXRzPTIpewogIG91dCA8LSBzaXplX2luX2luY2ggKiAyLjU0CiAgaWYobWV0ZXJzKSBvdXQgPC0gcm91bmQob3V0LzEwMCxkaWdpdHM9MikKICBvdXQKfQppbmNoVG9DbShzaXplcykKYGBgCgoKCgoK