Phillips 5.4 Q1-Q9
- Create the vector [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] in three ways: once using c(), once using a:b, and once using seq().
x<- c(1,2,3,4,5,6,7,8,9,10) #Using the c function
x
[1] 1 2 3 4 5 6 7 8 9 10
y<- 1:10 #Using the a:b function
y
[1] 1 2 3 4 5 6 7 8 9 10
z<- seq(from=1, to=10, by=1) #Using the seq function
z
[1] 1 2 3 4 5 6 7 8 9 10
zm<-seq(1,10) #You do not need to specify arguments within the code
zm
[1] 1 2 3 4 5 6 7 8 9 10
- Create the vector [2.1, 4.1, 6.1, 8.1] in two ways, once using c() and once using seq()
a<- c(2.1,4.1,6.1,8.1) #Using the c function
a
[1] 2.1 4.1 6.1 8.1
b<- seq(from= 2.1, to= 8.1, by= 2) #Using the seq function
b
[1] 2.1 4.1 6.1 8.1
- Create the vector [0, 5, 10, 15] in 3 ways: using c(), seq() with a by argument, and seq() with a length.out argument.
c<- c(0,5,10,15) #Using the c function
c
[1] 0 5 10 15
d<- seq(from= 0, to=15, by=5) #Using the seq function and by argument
d
[1] 0 5 10 15
e<- seq(from= 0, to= 15, length.out =4) #Using the seq function and length.out argument
e
[1] 0 5 10 15
The seq() with the by argument tells R to create the sequence once ever x amount of numbers. The seq() wit the length.out argument tells R to create the sequence with a total amount of x breaks.
- Create the vector [101, 102, 103, 200, 205, 210, 1000, 1100, 1200] using a combination of the c() and seq() functions
f<- c(seq(from= 101, to=103, by= 1), seq(from=200, to=210, length.out =3), seq(from= 1000, to=1200, by=100)) #Must add three different seq functions. Cannot combine all seq functions into one
f
[1] 101 102 103 200 205 210 1000 1100 1200
r<- seq(101,103)
m<-seq(200,210,5)
n<-seq(1000,1200,100)
o<-(c(r,m,n))
o
[1] 101 102 103 200 205 210 1000 1100 1200
I used both length.out and by arguments to show that it can be done with a combination and still yield the same results. The second set of codes is a bit longer but is an easier example to understand how to connect multiple vector function into one.
- A new batch of 100 pirates are boarding your ship and need new swords. You have 10 scimitars, 40 broadswords, and 50 cutlasses that you need to distribute evenly to the 100 pirates as they board. Create a vector of length 100 where there is 1 scimitar, 4 broadswords, and 5 cutlasses in each group of 10. That is, in the first 10 elements there should be exactly 1 scimitar, 4 broadswords and 5 cutlasses. The next 10 elements should also have the same number of each sword (and so on).
swords10 <- c(rep("s", times= 1), rep("b", times= 4), rep("c", times =5)) #setting up a function that gives 10 pirates the exact amount needed of each sword, where "s" stands for scimitars, "b" stands for broadswords, and "c" stands for "cutlasses"
swords10
[1] "s" "b" "b" "b" "b" "c" "c" "c" "c" "c"
swords100 <- rep(swords10, times=10) #setting up a function that repeats the previous function 10 times so all the pirates have a sword.
swords100
[1] "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c"
[18] "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b"
[35] "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s"
[52] "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c"
[69] "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b"
[86] "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c"
swords10 creates a vector with 10 swords and the right amount of each type per 10 pirates. swords100 repeats the previous vector 10 times to give all pirates swords. These are alternative ways of solving the problem:
swordstry2 <-rep(c(rep("s", times= 1), rep("b", times= 4), rep("c", times =5)), times=10)
swordstry2
[1] "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c"
[18] "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b"
[35] "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s"
[52] "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c"
[69] "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b"
[86] "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c"
In swordstry2, I joined the two previous codes into one string, To do that, I simply pasted the code for the swords10 vector into the rep() function used to create the swords100. This code is less wordy, but a little bit harder to understand at first.
swordstry3 <-rep(c("s","b","c"), c(1,4,5)) #This is a faster, easier to understand version of swords10.
swordstry3
[1] "s" "b" "b" "b" "b" "c" "c" "c" "c" "c"
swordstry4 <-c(rep(rep(c("s","b","c"), c(1,4,5)), times=10)) #This is swords 100 in an easier format.
swordstry4
[1] "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c"
[18] "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b"
[35] "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s"
[52] "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c"
[69] "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c" "s" "b" "b" "b" "b"
[86] "c" "c" "c" "c" "c" "s" "b" "b" "b" "b" "c" "c" "c" "c" "c"
swordstry5<- rep(c("scimitar","broadsword","cutlasses"),times=c(1,4,5))
swordstry6<-rep(swordstry5,length.out=100)
swordstry6
[1] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[6] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[11] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[16] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[21] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[26] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[31] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[36] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[41] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[46] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[51] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[56] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[61] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[66] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[71] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[76] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[81] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[86] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
[91] "scimitar" "broadsword" "broadsword" "broadsword" "broadsword"
[96] "cutlasses" "cutlasses" "cutlasses" "cutlasses" "cutlasses"
swordstry3 is an easier format in which the rep() function is only one, and the arguments for the repetition are given in two c() vectors. swordstry4 is the repetition of swordstry3 10 times.
6 Create a vector that repeats the integers from 1 to 5, 10 times. That is [1, 2, 3, 4, 5, 1, 2, 3, 4, 5, …]. The length of the vector should be 50!
g<- rep(1:5, times =10)
g
[1] 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
[36] 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
galt <- c(1:5)
galt
[1] 1 2 3 4 5
galt2 <- rep(galt, times =10) #wordier code but more clear to understand if I forget the gist of it
galt2
[1] 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
[36] 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
galt3<-rep(c(1,2,3,4,5),length.out=50)
galt3
[1] 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
[36] 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
galt is used within galt2 to create the same result as g did. The code is wordier, but it is easier to understand how the vectors come together
- Now, create the same vector as before, but this time repeat 1, 10 times, then 2, 10 times, etc., That is [1, 1, 1, …, 2, 2, 2, …, … 5, 5, 5]. The length of the vector should also be 50
h <- rep(c(1:5), each =10) #no need to put times =1 as a repeat of only once is understood
h
[1] 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 3 3 3 3 3 4 4 4 4 4
[36] 4 4 4 4 4 5 5 5 5 5 5 5 5 5 5
w<-rep(c(1,2,3,4,5),each=10,length.out=50)
w
[1] 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 3 3 3 3 3 4 4 4 4 4
[36] 4 4 4 4 4 5 5 5 5 5 5 5 5 5 5
This is the same function as galt2, except that the argument times is defaulted as one, and I have added the argument each=10 to repeat each number 10 times.
- Create a vector containing 50 samples from a Normal distribution with a population mean of 20 and standard deviation of 2.
i<- rnorm(50, mean= 20, sd=2)
i
[1] 18.83234 21.01646 19.82972 21.29045 18.81231 21.94318 18.84860
[8] 19.45026 21.72308 21.14108 18.00189 19.65106 17.80373 18.94642
[15] 20.22401 22.12111 23.72048 19.52390 19.02718 18.09501 22.84299
[22] 16.97512 20.41328 21.96479 18.96943 20.57759 17.81581 21.37442
[29] 22.09976 20.89112 21.92272 20.45444 20.90367 20.25653 17.51852
[36] 18.46248 20.28065 20.71817 22.93471 20.04638 21.88571 19.33634
[43] 22.07797 17.26762 23.67309 18.98197 20.10185 21.05183 22.72083
[50] 19.60263
- Create a vector containing 25 samples from a Uniform distribution with a lower bound of -100 and an upper bound of -50.
j<- runif(25, -100, -50) #I dont need to add min= and max=, since the function understands those arguments as implied in that order.
j
[1] -78.22388 -95.21501 -96.19566 -95.46194 -65.86185 -94.98939 -68.98670
[8] -87.11942 -64.96901 -59.96831 -79.39296 -59.20089 -59.83845 -78.76075
[15] -97.63543 -69.06691 -61.44043 -91.18795 -84.28111 -62.61370 -74.85105
[22] -96.76955 -70.47684 -53.02273 -96.20513
Drennan Ch.1
KRV <- data.frame(Area=c(12.8, 11.5, 14, 1.3, 10.3, 9.8, 2.3, 15.3, 11.2, 3.4, 12.8, 13.9, 9, 10.6, 9.9, 13.4, 8.7, 3.8, 11.7, 1.7, 12.3, 11, 2.9, 10.7, 7.4, 8.2, 2, 2.2, 4.5))
This is the data from the Drennan chapter. It needs to be run in the console before it can be used to create a histogram. I also loaded the data set called Scrapers
- Recreate the Kiskiminetas River Valley histogram in the chapter.Note the number of bins and how the bin members are decided. Is the cutoff at the bottom or top of the range? How can you adjust this?
KRV$Area #Subset only the area portion of the data frame in order to have a numeric value for the histogram
[1] 12.8 11.5 14.0 1.3 10.3 9.8 2.3 15.3 11.2 3.4 12.8 13.9 9.0 10.6
[15] 9.9 13.4 8.7 3.8 11.7 1.7 12.3 11.0 2.9 10.7 7.4 8.2 2.0 2.2
[29] 4.5
KRVhist<- hist(KRV$Area, main= "Areas of 29 Sites in the Kiskiminetas River Valley", xlab= "Area", breaks = c(1:16)) #Create a histogram. Making it an object and then running the object gives a breakdown summary of the histogram

KRVhist
$breaks
[1] 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
$counts
[1] 3 3 2 1 0 0 1 3 2 4 3 3 3 0 1
$density
[1] 0.10344828 0.10344828 0.06896552 0.03448276 0.00000000 0.00000000
[7] 0.03448276 0.10344828 0.06896552 0.13793103 0.10344828 0.10344828
[13] 0.10344828 0.00000000 0.03448276
$mids
[1] 1.5 2.5 3.5 4.5 5.5 6.5 7.5 8.5 9.5 10.5 11.5 12.5 13.5 14.5
[15] 15.5
$xname
[1] "KRV$Area"
$equidist
[1] TRUE
attr(,"class")
[1] "histogram"
This one has 12 bins and 16 breaks. I had to ensure the breaks were 16 for the histogram to have the same cuts as the one on the book. This histogram has the cutoff at the bottom of the bin, since the margin value is added to the next bin and not the preceding one. This is why the histogram is not exactly like the one in the book. By readjusting the argument, I can readjust the bin cutoff to be at the top of the bin.


The right=FALSE argument ensures the cells are left-closed intervals or cutoff at the top of the range
- Make two histogram of the scraper length data with different bin sizes. Do you notice anything different in the data distribution when you change the number of breaks?
Scrapers$Length #Subset only the length data to have numeric values and plot them on a histogram
[1] 25.8 6.3 44.6 21.3 25.7 20.6 22.2 10.5 18.9 25.9 23.8 22.0 10.6 33.2
[15] 16.8 21.8 48.3 15.8 39.4 43.5 39.8 16.3 40.5 91.7 21.7 17.9 29.3 39.1
[29] 42.5 49.6 13.7 19.1 40.6 49.1 41.7 15.2 21.2 30.2 40.0 20.2 31.9 42.3
[43] 47.2 50.5 10.6 23.1 44.1 45.8
Scrapershist <- hist(Scrapers$Length, main= "Scraper Lengths from Pine Ridge Cave and Willow Flats Site",xlab = "Scraper Length") #Create histogram. You can also run the object to find the breakdown of the histogram (i.e the breaks for the bins)

Scarperhist5 <- hist(Scrapers$Length, main= "Scraper Lengths 5 breaks",xlab = "Scraper Length", breaks= 5)

Scrapershist2 <- hist(Scrapers$Length, main= "Scraper Lengths 2 breaks",xlab = "Scraper Length", breaks= 2)

Scrapershit20 <- hist(Scrapers$Length,main= "Scraper Lengths 20 breaks",xlab = "Scraper Length", breaks= 20)

Alternative way to subset the data:
slength<- Scrapers[,3]
slength
[1] 25.8 6.3 44.6 21.3 25.7 20.6 22.2 10.5 18.9 25.9 23.8 22.0 10.6 33.2
[15] 16.8 21.8 48.3 15.8 39.4 43.5 39.8 16.3 40.5 91.7 21.7 17.9 29.3 39.1
[29] 42.5 49.6 13.7 19.1 40.6 49.1 41.7 15.2 21.2 30.2 40.0 20.2 31.9 42.3
[43] 47.2 50.5 10.6 23.1 44.1 45.8
hist(slength,main= "Scraper Lengths 48 breaks",xlab = "Scraper Length",breaks = 48)

hist(slength,main= "Scraper Lengths 30 breaks",xlab = "Scraper Length",breaks = 30)

