Exploring the MLB data

Questions:

  • How many observations are in the dataset?
  • Who is the tallest / smallest player?
  • Are players older than 32 significantly heavier than younger players?
t.test(weight ~ old,data = mlb)

    Welch Two Sample t-test

data:  weight by old
t = -3.9723, df = 396.06, p-value = 8.456e-05
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -8.567759 -2.894750
sample estimates:
mean in group FALSE  mean in group TRUE 
           200.4798            206.2110 

lapply / anonymous function: Who is the average size by position?

  1. split the dataset by position to obtain a list
  2. run mean on the list
lapply(by_pos,nrow)
$Catcher
[1] 76

$Designated_Hitter
[1] 18

$First_Baseman
[1] 55

$Outfielder
[1] 194

$Relief_Pitcher
[1] 315

$Second_Baseman
[1] 58

$Shortstop
[1] 52

$Starting_Pitcher
[1] 221

$Third_Baseman
[1] 45

A custom function: What’s the size of these players in centimeters?

inchToCm <- function(size_in_inch,meters = T,digits=2){
  out <- size_in_inch * 2.54
  if(meters) out <- round(out/100,digits=2)
  out
}
inchToCm(sizes)
          Catcher Designated_Hitter 
             1.85              1.89 
    First_Baseman        Outfielder 
             1.88              1.85 
   Relief_Pitcher    Second_Baseman 
             1.89              1.81 
        Shortstop  Starting_Pitcher 
             1.83              1.90 
    Third_Baseman 
             1.86 
LS0tCnRpdGxlOiAiV29ya2luZyB3aXRoIGRhdGEgYW5kIGZ1bmN0aW9ucyIKb3V0cHV0OiBodG1sX25vdGVib29rCi0tLQoKIyBFeHBsb3JpbmcgdGhlIE1MQiBkYXRhCgpgYGB7cn0KbG9hZCgiLi4vLi4vZGF0YS9tbGIuUkRhdGEiKQpscygpCmhlYWQobWxiKQpgYGAKCiMjIFF1ZXN0aW9uczogCgogIC0gSG93IG1hbnkgb2JzZXJ2YXRpb25zIGFyZSBpbiB0aGUgZGF0YXNldD8KICAtIFdobyBpcyB0aGUgdGFsbGVzdCAvIHNtYWxsZXN0IHBsYXllcj8KICAtIEFyZSBwbGF5ZXJzIG9sZGVyIHRoYW4gMzIgc2lnbmlmaWNhbnRseSBoZWF2aWVyIHRoYW4geW91bmdlciBwbGF5ZXJzPwogIAogIApgYGB7cn0KIyBubyBvZiBvYnNlcnZhdGlvbnMKbnJvdyhtbGIpCm1sYlt3aGljaC5tYXgobWxiJGhlaWdodCksIk5hbWUiXQptbGIkb2xkIDwtIG1sYiRBZ2UgPiAzMgpvbGRfcGxheWVycyA8LSBtbGJbbWxiJG9sZCxdCnlvdW5nX3BsYXllcnMgPC0gbWxiWyFtbGIkb2xkLF0KdC50ZXN0KG9sZF9wbGF5ZXJzJHdlaWdodCx5b3VuZ19wbGF5ZXJzJHdlaWdodCkKdC50ZXN0KHdlaWdodCB+IG9sZCxkYXRhID0gbWxiKQpgYGAKCiMjIGxhcHBseSAvIGFub255bW91cyBmdW5jdGlvbjogV2hvIGlzIHRoZSBhdmVyYWdlIHNpemUgYnkgcG9zaXRpb24/CgoxLiBzcGxpdCB0aGUgZGF0YXNldCBieSBwb3NpdGlvbiB0byBvYnRhaW4gYSBsaXN0CjIuIHJ1biBgbWVhbmAgb24gdGhlIGxpc3QKCmBgYHtyfQpieV9wb3MgPC0gc3BsaXQobWxiLGZhY3RvcihtbGIkUG9zaXRpb24pKQpsYXBwbHkoYnlfcG9zLG5yb3cpCiMgYnlfcG9zJENhdGNoZXIKc2l6ZXMgPC0gc2FwcGx5KGJ5X3BvcyxmdW5jdGlvbih4KSBtZWFuKHgkaGVpZ2h0KSkKc2l6ZXMKYGBgCgojIyBBIGN1c3RvbSBmdW5jdGlvbjogV2hhdCdzIHRoZSBzaXplIG9mIHRoZXNlIHBsYXllcnMgaW4gY2VudGltZXRlcnM/CgpgYGB7cn0KaW5jaFRvQ20gPC0gZnVuY3Rpb24oc2l6ZV9pbl9pbmNoLG1ldGVycyA9IFQsZGlnaXRzPTIpewogIG91dCA8LSBzaXplX2luX2luY2ggKiAyLjU0CiAgaWYobWV0ZXJzKSBvdXQgPC0gcm91bmQob3V0LzEwMCxkaWdpdHM9MikKICBvdXQKfQppbmNoVG9DbShzaXplcykKYGBgCgoKCgoK