Let us continue getting started with R as we start discussing
important statistical concepts in Sports Analytics.
Case-scenario 1
This is the fourth season of outfielder Luis Robert with the Chicago
White Socks. If during the first three seasons he hit 11, 13, and 12
home runs, how many does he need on this season for his overall average
to be at least 20?
#Given that x1=11,x2=13,x3=12
#We want to find x4 such that the mean (average) number of home-runs is x¯>=20
#Notice that in this case n=4
#According to the information above: 20×4=11+13+12+x4; so when x4=44,the home-runs average will be 20.
#Answer Here
# Home-runs so far
HR_before <- c(11, 13, 12)
# Average Number of Home-runs per season wanted
wanted_HR <- 20
# Number of seasons
n_seasons <- 4
# Needed Home-runs on season 4
x_4 <- n_seasons*wanted_HR - sum(HR_before)
# Minimum number of Home-runs needed by Robert
x_4
[1] 44
According to the calculations above, Robert must hit 44 home-runs or
better on this season to get an average number of home-runs per season
of at least 20.
We could confirm this, by using the function mean() in R
# Robert's performance
Robert_HRs <- c(11, 13, 12,44)
# Find mean
mean(Robert_HRs)
[1] 20
sd(Robert_HRs)
[1] 16.02082
max(Robert_HRs)
[1] 44
min(Robert_HRs)
[1] 11
summary(Robert_HRs)
Min. 1st Qu. Median Mean 3rd Qu. Max.
11.00 11.75 12.50 20.00 20.75 44.00
# walks so far
walks_before <- c(79, 108,41,145, 135)
# Average Number of walks per season wanted
wanted_walks <- 100
# Number of seasons
n_seasons <- 6
# Needed number of walks in season 6
x_6 <- n_seasons*wanted_walks - sum(walks_before)
# Minimum number of walks needed by Juan Soto
x_6
[1] 92
Case-scenario 2 The average salary of 10 baseball
players is 72,000 dollars a week and the average salary of 4 soccer
players is 84,000. Find the mean salary of all 14 professional
players.
Solution - We can easily find the joined mean by
adding both mean and dividing by the total number of people.
Let n1=10 denote the number of baseball players, and y1=72000 their
mean salary.Let n2=4 the number of soccer players and y2=84000 their
mean salary. Then the mean salary of all 16 individuals is:
(n1x1+n2x2)/(n1+n2)
n_1 <- 10
n_2 <- 4
y_1 <- 72000
y_2 <- 84000
# Mean salary overall
salary_ave <- (n_1*y_1 + n_2*y_2)/(n_1+n_2)
salary_ave
[1] 75428.57
doubles_hit<-read.csv("doubles_hit.csv", header = TRUE, sep = ",")
doubles_hits<-doubles_hit$doubles_hit
View(doubles_hit)
# Mean
doubles_mean <- mean(doubles_hits)
doubles_mean
[1] 23.55
# Median
doubles_median <- median(doubles_hits)
doubles_median
[1] 23.5
# Find number of observations
doubles_n <- length(doubles_hits)
# Find standard deviation
doubles_sd <- sd(doubles_hits)
doubles_sd
[1] 13.37371
What percentage of the data lies within one standard deviation of the
mean?
doubles_w1sd <- sum((doubles_hits - doubles_mean)/doubles_sd < 1)/ doubles_n
# Percentage of observation within one standard deviation of the mean
doubles_w1sd
[1] 0.79
## Difference from empirical
doubles_w1sd - 0.68
[1] 0.11
What percentage of the data lies within two standard deviations of
the mean?
## Within 2 sd
doubles_w2sd <- sum((doubles_hits - doubles_mean)/ doubles_sd < 2)/doubles_n
doubles_w2sd
[1] 1
## Difference from empirical
doubles_w2sd - 0.95
[1] 0.05
What percent of the data lies within three standard deviations of the
mean?
## Within 3 sd
doubles_w3sd <- sum((doubles_hits - doubles_mean)/ doubles_sd < 3)/doubles_n
doubles_w3sd
[1] 1
## Difference from empirical
doubles_w3sd - 0.9973
[1] 0.0027
?hist
# Create histogram
hist(doubles_hits,xlab = "Number of Doubles Hits",col = "green",border = "red", xlim = c(0,50), ylim = c(0,28),breaks = 5)

LS0tDQp0aXRsZTogIkluLWNsYXNzIGFjdGl2aXR5IDUiDQpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sNCi0tLQ0KDQpMZXQgdXMgY29udGludWUgZ2V0dGluZyBzdGFydGVkIHdpdGggUiBhcyB3ZSBzdGFydCBkaXNjdXNzaW5nIGltcG9ydGFudCBzdGF0aXN0aWNhbCBjb25jZXB0cyBpbiBTcG9ydHMgQW5hbHl0aWNzLg0KDQoqKkNhc2Utc2NlbmFyaW8gMSoqDQoNClRoaXMgaXMgdGhlIGZvdXJ0aCBzZWFzb24gb2Ygb3V0ZmllbGRlciBMdWlzIFJvYmVydCB3aXRoIHRoZSBDaGljYWdvIFdoaXRlIFNvY2tzLiBJZiBkdXJpbmcgdGhlIGZpcnN0IHRocmVlIHNlYXNvbnMgaGUgaGl0IDExLCAxMywgYW5kIDEyIGhvbWUgcnVucywgaG93IG1hbnkgZG9lcyBoZSBuZWVkIG9uIHRoaXMgc2Vhc29uIGZvciBoaXMgb3ZlcmFsbCBhdmVyYWdlIHRvIGJlIGF0IGxlYXN0IDIwPw0KDQoNCmBgYHtyfQ0KI0dpdmVuIHRoYXQgeDE9MTEseDI9MTMseDM9MTINCg0KI1dlIHdhbnQgdG8gZmluZCB4NCBzdWNoIHRoYXQgdGhlIG1lYW4gKGF2ZXJhZ2UpIG51bWJlciBvZiBob21lLXJ1bnMgaXMgeMKvPj0yMA0KDQojTm90aWNlIHRoYXQgaW4gdGhpcyBjYXNlIG49NA0KDQojQWNjb3JkaW5nIHRvIHRoZSBpbmZvcm1hdGlvbiBhYm92ZTogMjDDlzQ9MTErMTMrMTIreDQ7IHNvIHdoZW4geDQ9NDQsdGhlIGhvbWUtcnVucyBhdmVyYWdlIHdpbGwgYmUgMjAuDQoNCiNBbnN3ZXIgSGVyZQ0KDQojIEhvbWUtcnVucyBzbyBmYXINCkhSX2JlZm9yZSA8LSBjKDExLCAxMywgMTIpDQojIEF2ZXJhZ2UgTnVtYmVyIG9mIEhvbWUtcnVucyBwZXIgc2Vhc29uIHdhbnRlZA0Kd2FudGVkX0hSIDwtIDIwDQojIE51bWJlciBvZiBzZWFzb25zDQpuX3NlYXNvbnMgPC0gNA0KIyBOZWVkZWQgSG9tZS1ydW5zIG9uIHNlYXNvbiA0DQp4XzQgPC0gbl9zZWFzb25zKndhbnRlZF9IUiAtIHN1bShIUl9iZWZvcmUpDQojIE1pbmltdW0gbnVtYmVyIG9mIEhvbWUtcnVucyBuZWVkZWQgYnkgUm9iZXJ0DQp4XzQNCmBgYA0KDQoNCkFjY29yZGluZyB0byB0aGUgY2FsY3VsYXRpb25zIGFib3ZlLCBSb2JlcnQgbXVzdCBoaXQgNDQgaG9tZS1ydW5zIG9yIGJldHRlciBvbiB0aGlzIHNlYXNvbiB0byBnZXQgYW4gYXZlcmFnZSBudW1iZXIgb2YgaG9tZS1ydW5zIHBlciBzZWFzb24gb2YgYXQgbGVhc3QgMjAuDQoNCldlIGNvdWxkIGNvbmZpcm0gdGhpcywgYnkgdXNpbmcgdGhlIGZ1bmN0aW9uIG1lYW4oKSBpbiBSDQoNCg0KYGBge3J9DQojIFJvYmVydCdzIHBlcmZvcm1hbmNlDQpSb2JlcnRfSFJzIDwtIGMoMTEsIDEzLCAxMiw0NCkNCiMgRmluZCBtZWFuDQptZWFuKFJvYmVydF9IUnMpDQpgYGANCg0KDQoNCmBgYHtyfQ0Kc2QoUm9iZXJ0X0hScykNCmBgYA0KDQpgYGB7cn0NCm1heChSb2JlcnRfSFJzKQ0KYGBgDQoNCg0KYGBge3J9DQptaW4oUm9iZXJ0X0hScykNCmBgYA0KDQpgYGB7cn0NCnN1bW1hcnkoUm9iZXJ0X0hScykNCmBgYA0KDQoNCg0KYGBge3J9DQojIHdhbGtzIHNvIGZhcg0Kd2Fsa3NfYmVmb3JlIDwtIGMoNzksIDEwOCw0MSwxNDUsIDEzNSkNCiMgQXZlcmFnZSBOdW1iZXIgb2Ygd2Fsa3MgcGVyIHNlYXNvbiB3YW50ZWQNCndhbnRlZF93YWxrcyA8LSAxMDANCiMgTnVtYmVyIG9mIHNlYXNvbnMNCm5fc2Vhc29ucyA8LSA2DQojIE5lZWRlZCBudW1iZXIgb2Ygd2Fsa3MgaW4gc2Vhc29uIDYNCnhfNiA8LSBuX3NlYXNvbnMqd2FudGVkX3dhbGtzIC0gc3VtKHdhbGtzX2JlZm9yZSkNCiMgTWluaW11bSBudW1iZXIgb2Ygd2Fsa3MgbmVlZGVkIGJ5IEp1YW4gU290bw0KeF82DQpgYGANCg0KDQoqKkNhc2Utc2NlbmFyaW8gMioqDQpUaGUgYXZlcmFnZSBzYWxhcnkgb2YgMTAgYmFzZWJhbGwgcGxheWVycyBpcyA3MiwwMDAgZG9sbGFycyBhIHdlZWsgYW5kIHRoZSBhdmVyYWdlIHNhbGFyeSBvZiA0IHNvY2NlciBwbGF5ZXJzIGlzIDg0LDAwMC4gRmluZCB0aGUgbWVhbiBzYWxhcnkgb2YgYWxsIDE0IHByb2Zlc3Npb25hbCBwbGF5ZXJzLg0KDQoqKlNvbHV0aW9uKioNCi0gV2UgY2FuIGVhc2lseSBmaW5kIHRoZSBqb2luZWQgbWVhbiBieSBhZGRpbmcgYm90aCBtZWFuIGFuZCBkaXZpZGluZyBieSB0aGUgdG90YWwgbnVtYmVyIG9mIHBlb3BsZS4NCg0KTGV0IG4xPTEwIGRlbm90ZSB0aGUgbnVtYmVyIG9mIGJhc2ViYWxsIHBsYXllcnMsIGFuZCB5MT03MjAwMCB0aGVpciBtZWFuIHNhbGFyeS5MZXQgbjI9NCB0aGUgbnVtYmVyIG9mIHNvY2NlciBwbGF5ZXJzIGFuZCB5Mj04NDAwMCB0aGVpciBtZWFuIHNhbGFyeS4gVGhlbiB0aGUgbWVhbiBzYWxhcnkgb2YgYWxsIDE2IGluZGl2aWR1YWxzIGlzOiAobjF4MStuMngyKS8objErbjIpDQoNCg0KDQpgYGB7cn0NCm5fMSA8LSAxMA0Kbl8yIDwtIDQNCnlfMSA8LSA3MjAwMA0KeV8yIDwtIDg0MDAwDQojIE1lYW4gc2FsYXJ5IG92ZXJhbGwNCnNhbGFyeV9hdmUgPC0gIChuXzEqeV8xICsgbl8yKnlfMikvKG5fMStuXzIpDQpzYWxhcnlfYXZlDQpgYGANCg0KYGBge3J9DQpkb3VibGVzX2hpdDwtcmVhZC5jc3YoImRvdWJsZXNfaGl0LmNzdiIsIGhlYWRlciA9IFRSVUUsIHNlcCA9ICIsIikNCmRvdWJsZXNfaGl0czwtZG91Ymxlc19oaXQkZG91Ymxlc19oaXQNCmBgYA0KDQoNCmBgYHtyfQ0KI1ZpZXcoZG91Ymxlc19oaXQpDQpgYGANCg0KDQoNCmBgYHtyfQ0KIyBNZWFuIA0KZG91Ymxlc19tZWFuICA8LSBtZWFuKGRvdWJsZXNfaGl0cykNCmRvdWJsZXNfbWVhbg0KYGBgDQoNCg0KYGBge3J9DQojIE1lZGlhbiANCmRvdWJsZXNfbWVkaWFuICA8LSBtZWRpYW4oZG91Ymxlc19oaXRzKQ0KZG91Ymxlc19tZWRpYW4NCmBgYA0KDQoNCg0KYGBge3J9DQojIEZpbmQgbnVtYmVyIG9mIG9ic2VydmF0aW9ucw0KZG91Ymxlc19uIDwtIGxlbmd0aChkb3VibGVzX2hpdHMpDQojIEZpbmQgc3RhbmRhcmQgZGV2aWF0aW9uDQpkb3VibGVzX3NkIDwtIHNkKGRvdWJsZXNfaGl0cykNCmRvdWJsZXNfc2QNCmBgYA0KDQpXaGF0IHBlcmNlbnRhZ2Ugb2YgdGhlIGRhdGEgbGllcyB3aXRoaW4gb25lIHN0YW5kYXJkIGRldmlhdGlvbiBvZiB0aGUgbWVhbj8NCg0KYGBge3J9DQpkb3VibGVzX3cxc2QgPC0gc3VtKChkb3VibGVzX2hpdHMgLSBkb3VibGVzX21lYW4pL2RvdWJsZXNfc2QgPCAxKS8gZG91Ymxlc19uDQojIFBlcmNlbnRhZ2Ugb2Ygb2JzZXJ2YXRpb24gd2l0aGluIG9uZSBzdGFuZGFyZCBkZXZpYXRpb24gb2YgdGhlIG1lYW4NCmRvdWJsZXNfdzFzZA0KYGBgDQoNCg0KDQpgYGB7cn0NCiMjIERpZmZlcmVuY2UgZnJvbSBlbXBpcmljYWwgDQpkb3VibGVzX3cxc2QgLSAwLjY4DQpgYGANCg0KV2hhdCBwZXJjZW50YWdlIG9mIHRoZSBkYXRhIGxpZXMgd2l0aGluIHR3byBzdGFuZGFyZCBkZXZpYXRpb25zIG9mIHRoZSBtZWFuPw0KDQpgYGB7cn0NCiMjIFdpdGhpbiAyIHNkDQpkb3VibGVzX3cyc2QgPC0gc3VtKChkb3VibGVzX2hpdHMgLSBkb3VibGVzX21lYW4pLyBkb3VibGVzX3NkIDwgMikvZG91Ymxlc19uDQpkb3VibGVzX3cyc2QNCmBgYA0KDQoNCg0KYGBge3J9DQojIyBEaWZmZXJlbmNlIGZyb20gZW1waXJpY2FsIA0KZG91Ymxlc193MnNkIC0gMC45NQ0KYGBgDQoNCiBXaGF0IHBlcmNlbnQgb2YgdGhlIGRhdGEgbGllcyB3aXRoaW4gdGhyZWUgc3RhbmRhcmQgZGV2aWF0aW9ucyBvZiB0aGUgbWVhbj8NCg0KYGBge3J9DQojIyBXaXRoaW4gMyBzZCANCmRvdWJsZXNfdzNzZCA8LSBzdW0oKGRvdWJsZXNfaGl0cyAtIGRvdWJsZXNfbWVhbikvIGRvdWJsZXNfc2QgPCAzKS9kb3VibGVzX24NCmRvdWJsZXNfdzNzZA0KYGBgDQoNCg0KDQpgYGB7cn0NCiMjIERpZmZlcmVuY2UgZnJvbSBlbXBpcmljYWwgDQpkb3VibGVzX3czc2QgLSAwLjk5NzMNCmBgYA0KDQpgYGB7cn0NCj9oaXN0DQpgYGANCg0KDQoNCmBgYHtyfQ0KIyBDcmVhdGUgaGlzdG9ncmFtDQpoaXN0KGRvdWJsZXNfaGl0cyx4bGFiID0gIk51bWJlciBvZiBEb3VibGVzIEhpdHMiLGNvbCA9ICJncmVlbiIsYm9yZGVyID0gInJlZCIsIHhsaW0gPSBjKDAsNTApLCB5bGltID0gYygwLDI4KSxicmVha3MgPSA1KQ0KYGBgDQoNCg==