There are two main goals of this first homework: 1) For you to download R, RStudio, and LaTex so that you are set up for all future homework and labs and 2) To introduce R and RStudio with some basic exploratory work.
If you have any trouble with either of these tasks, please go to a TA’s office hours for help. It is crucial that you are properly set up for this class by the time the first homework is due so that you do not fall behind.
For this class, homework will be uploaded onto Blackboard. For Homework 1 the following should be uploaded onto Blackboard:
A screen shot of RStudio from your computer where the “Files” tab is selected and the folder in which you are saving all of your files for HW 1 is open. This folder should include the files created for your first homework. This folder should be nested within another folder labeled “BTRY6010”. Creating these folders is part of your first assignment under “File Management” below.
Your LastF-HW1.Rmd and LastF-HW1.pdf files where LastF is replaced by your last name and the first letter of your first name. In this homework, you will be given instructions on how to create these files.
For information on how to upload files to blackboard see http://bbhelp.cit.cornell.edu/assignments/.
In this class, we will be downloading and uploading data and files from Blackboard and possibly other sources. It is important that you know where these files are located. Having a good system for organizing your files on the computer can go a long way to ensuring that you can find what you need from old projects years later (imagine a reader asks you for your code five years after you published a paper). Here we will create a folder for BTRY 6010 on your computer.
Please complete the following tasks using the appropriate instructions for your computer (i.e. Mac or Windows).
For Macs: Every file you have on your computer can be referenced by a file path. If you open Finder which should be on your toolbar. It will look something like this, where your home directory will be labeled differently than the one shown here and you may have different folders listed.
All of your computer files can be accessed through Finder. The home directory here is called “ceciliaearls”. In this directory is a folder labeled “Documents”. For most of you, you likely have your documents folder under your home directory. If not, please create a folder under your home directory called “Documents.” You can do this by clicking on your home directory on the left column of your Finder window. Then from the Finder menu at the top of your screen, choose File -> New Folder. This will create a new folder in your home directory. You will need to rename it by clicking on the name it currently has and changing it to “Documents”.
Once you have a “Documents” folder, create a new folder within “Documents” called “BTRY6010.” Again this is done by double clicking on the folder “Documents” to open it and choosing File -> New Folder from the menu bar. Again, rename this folder as indicated above.
We suggest you create a new folder for every assignment in this class where all the files for that assignment are kept. Use the directions in step 2 to create a folder for homework 1 in your BTRY 6010 folder.
On the left hand side of your Finder window, the folder “Downloads” should be listed. If not, it should be found under your home directory. This is an important folder as everything you download will be put in that folder. Now open your internet browser. We suggest that you use Safari.
In Blackboard, go to the folder for homework 1 and right click on the HW1.Rmd file. Choose “Download Linked File.” This will send this file to your “Downloads” folder.
Now put HW1.Rmd in your homework 1 folder. You can do this by opening two finder windows. A second Finder window can be opened by choosing File -> New Finder Window from the menu bar for Finder. Open one Finder window to “Downloads”. Open the other Finder window to your folder for HW1. Click on the HW1.Rmd file in “Downloads” and drag it into the folder for homework 1. Rename this file as “LastF-HW1.Rmd”.
Proceed to the instructions for Macs to download R, RStudio, and LaTex.
For Windows: Every file you have on your computer can be referenced by a file path.
Clicking on this icon gives you access to all of the files on your computer. On the right hand side after you click on Start will be your “Documents” folder. Open this folder by clicking on “Documents”.
Create a new folder within “Documents” called “BTRY6010.” You can do this by clicking on “New Folder” on the menu on the top of the window for your “Documents” folder. After the folder is created, change the name of this folder to “BTRY 6010 - Your Initials.”
Click on this new folder. We suggest you create a new folder for every assignment in this class where all the files for that assignment are kept. Use the directions in step 2 to create a folder for homework 1 in your BTRY 6010 folder.
Note that in the window containing your “Documents” folder, on the left hand side are a list of other folders that you can also access. One of these folders is called “Downloads”. Anything you download off of the internet will be placed in this folder unless you specify otherwise. We recommend using Explorer or Firefox as your browser.
In Blackboard, go to the folder for homework 1 and right click on the HW1.Rmd file. Choose Save target as…. A window will pop up in which you can specify where this file is saved. If you are not already in the folder you created for homework 1, select “Documents” from the left hand side of this window. Then double click on the BTRY 6010 folder you created within “Documents”, and finally double click on the homework 1 folder that you created within your BTRY 6010 folder. Change the name of this .Rmd file to “LastF-HW1.Rmd” before you save it. Save this document in your folder for homework 1 by clicking on “Save” at the bottom of this window.
Proceed to the instructions for Windows to download R, RStudio, and LaTeX.
In this class we will primarily be using RStudio for data analysis. RStudio depends on two other applications, R and LaTex. Here we will go through the steps to download all three of these applications. Note: If you already have these applications, please download the latest versions so that RStudio has the same look and functionality for everyone.
For Macs:
We will first download R. Go to the wesite, http://cran.r-project.org/bin/macosx.
Click on either R-3.1.1-snowleopard.pkg or R-3.1.1-mavericks.pkg depending on your operating system, see descriptions. This will download the R package on your computer. It will take a few minutes or significantly more depending on your downloading speed.
Go to your downloads folder and double click on the package you downloaded. This will start the installer for R. Go through the steps to start the installation. When you have choices, unless you prefer otherwise, choose the defaults. This will take a couple of minutes.
R should now be in your “Applications” folder which can again be found on the left hand side of your Finder window.
Next we will download MacTex, the LaTex editor for Mac. The steps are similar to those used for downloading R. You can find MacTex at http://www.tug.org/mactex/index.html.
To download MacTeX, click on MacTeX.pkg on this page. Again, this will take a few minutes or longer to download.
Once MacTeX.pkg is downloaded, you will again see it in “Downloads”. Double click on this package to start the installation.
Now MacTex will also be in your “Applications” folder.
Finally, we will download RStudio, our main tool for this class. Go to the website http://www.rstudio.com/products/rstudio/download/.
Click on RStudio 0.98.1028 - Mac OS X 10.6+ (64-bit). This will download RStudio and a folder will pop up that includes both the “Applications” folder and RStudio. Drag RStudio into the “Applications” folder and close this window.
Likely after step 10, you will have an RStudio icon on your desktop. You can right click on this icon and choose Eject to get rid of this icon.
You are now ready to use RStudio.
For Windows:
We will first download R. Go to the wesite, http://cran.r-project.org/bin/windows/base/.
Click on Download R-3.1.1 for Windows. At the bottom of the Explorer window, you will be asked whether you want to run or save this file. You can go ahead and run it unless you would like to start the installation at a later time. After selecting to run this program, you may be asked if you want this program to make changes to your computer. Click “Yes.”
This will start the installer for R. Go through the steps to start the installation. When you have choices, unless you prefer otherwise, choose the defaults. This will take a few minutes.
R should now be in your “Programs” folder. It can be accessed by clicking on the “Start” icon in the bottom left corner of your screen and choosing “All Programs”. You should see a folder for R.
Next we will download MiKTex, the LaTex editor for Windows. The steps are similar to those used for downloading R. You can find MiKTex at http://miktex.org/download.
To download MiKTeX, click on the blue box under “Recommended Download.” Again, you will be asked whether you want to run or save this file. After selecting to run the program, you may be asked whether you would like it to make changes to your computer. Choose “Yes.”
This will start the installer for MiKTeX. Go through the steps to start the installation. When you have choices, unless you prefer otherwise, choose the defaults. This will take a few minutes.
MiKTeX should now be in your “Programs” folder. It can be accessed by clicking on the “Start” icon in the bottom left corner of your screen and choosing “All Programs”. You should see a folder for MiKTeX.
Finally, we will download RStudio, our main tool for this class. Go to the website http://www.rstudio.com/products/rstudio/download/.
Click on RStudio 0.98.994 - Windows XP/Vista/7/8. Again, at the bottom of the Explorer window you will be asked whether you want to run or save this file. Go ahead and run it. You will likely be asked if you want this program to make changes to your computer. Choose, “Yes.”
This will start an installer for RStudio. Unless you prefer otherwise, choose the defaults for the installation process. This will take a few minutes.
Once RStudio is installed, it also should be found in your program files.
You are now ready to use RStudio.
Please note (for later): When you create your first .pdf file using R Markdown, you may need to download more files for MiKTeX. You will have an option to download them from the internet. Choose this option. It will give you a number of places where these files can be downloaded from. Choose something in the US.
The goal of this homework is to introduce you to R and RStudio, which you’ll be using throughout the course both to learn the statistical concepts discussed in the texbook and also to analyze real data and come to informed conclusions. To straighten out which is which: R is the name of the programming language itself and RStudio is a convenient environment for writing and running R code.
As the course progresses, you are encouraged to explore beyond what the labs and homework dictate; a willingness to experiment will make you a much better programmer. Before we get to that stage, however, you need to build some basic fluency in R. Today we begin with the fundamental building blocks of R and RStudio: the interface and basic commands.
The panel in the upper right contains your workspace as well as a history of the commands that you’ve previously entered. Any plots that you generate will show up in the panel in the lower right corner.
The panel on the left is where the action happens. It’s called the console. Everytime you launch RStudio, it will have the same text at the top of the console telling you the version of R that you’re running. Below that information is the prompt, >. As its name suggests, this prompt is really a request, a request for a command. Initially, interacting with R is all about typing commands and interpreting the output. These commands and their syntax have evolved over decades (literally) and now provide what many users feel is a fairly natural way to access data and organize, describe, and invoke statistical computations.
If you have used R before, the console in RStudio has the same functionality as the basic R console. Any command in R can be used in the exact same way in the RStudio console. RStudio, however, is designed to be more user friendly and to provide an easy way to reproduce any previous work in R through R Markdown. In this class, your homework will consist of pdf files “knitted” by R Markdown. In this section of your homework, you will produce your first homework file through R Markdown.
Complete the following steps to create your first homework document.
For this homework, you will modify a previously created R Markdown document. We will use the HW1.Rmd file downloaded in the first section of this homework. This file should now be named “LastF-HW1.Rmd” and be found in your folder for homework 1.
In RStudio, in the lower right window click on the tab “Files.” In “Files” navigate to your .Rmd file for homework 1. Click the highlighted name of this file to open it in RStudio. This file will now be opened in RStudio in a new window on the left above your R Console.
Click on the window of your .Rmd file. You can edit this file. In lines 13 and 14 of this file, put in your name and your NetID.
R Markdown can create (or “knit”) .html, .pdf, and Word documents from a .Rmd file. On the menu for your .Rmd file, click the arrow next to . This dropdown menu allows you to choose whether you want to knit a pdf, html, or Word document. For now, choose “Knit PDF” to knit this document into a .pdf file. The .pdf file will have the same name as your homework file with the new extension, .pdf. By default it is saved in the same folder as the .Rmd file from which it was generated. Look in your folder for your first homework using the lower right window of RStudio to verify your .pdf homework file is now there. This .pdf file is saved as the newest version every time you re-knit your pdf.
For this homework, you will perform a number of basic tasks in R through R Markdown. After each step (or maybe after a couple of steps), you should knit your .pdf file to make sure your code ran correctly. At the end of this homework you will upload both your homework .Rmd and .pdf files to blackboard.
R is a programming language primarily used for data analysis. To use R, you need to use the proper syntax for coding in R. In this class we do not assume prior knowledge of this syntax, and you will be given a lot of code that you only need to run. In lab, you may be given code to use for a particular dataset that you can edit for use in your homework when applying this code to a different dataset.
Many “libraries” or “packages” have been developed for R. These libraries contain functions that can be used directly. For example, the function plot(x,y) will plot points on a graph where the domain and range have been user defined.
You will not have all packages for R installed on your computer, but many basic functions are available after an initial installation of R. If we need more functions as the semester progresses, you will be told how to download and access these functions.
For the rest of homework 1, follow the instructions in your .Rmd file for homework 1.