There are two main goals of this first homework: 1) For you to download R and RStudio so that you are set up for all future homework and labs and 2) To introduce R and RStudio with some basic exploratory work.
If you have any trouble with either of these tasks, please go to a TA’s office hours for help. It is crucial that you are properly set up for this class by the time the first homework is due so that you do not fall behind.
Throughout this class, homework will be uploaded onto Blackboard. For Homework 1 the following should be uploaded onto Blackboard:
Your LastF-HW1.R file where LastF is replaced by your last name and the first letter of your first name.
Your LastF-HW1.pdf file
In this homework, you will be given instructions on how to create these files. For information on how to upload files to blackboard see http://bbhelp.cit.cornell.edu/assignments/.
In this class, we will be downloading and uploading data and files from Blackboard and possibly other sources. It is important that you know where these files are located. Here we will create a folder for BTRY 3010 on your computer.
The instructions below are for students who have personal computers that they will use for this course. If you are going to use lab computers, create a “BTRY 3010” folder on your thumb drive. Then within this folder, create a folder for homework 1. It is best if every week you create new folders to store all of the files for the homework and lab for that week.
Please complete the following tasks using the appropriate instructions for your computer (i.e. Mac or Windows).
For Macs: Every file you have on your computer can be referenced by a file path. If you open Finder which should be on your toolbar. It will look something like this, where your home directory will be labeled differently than the one shown here and you may have different folders listed.
All of your computer files can be accessed through Finder. The home directory here is called “ceciliaearls”. In this directory is found a folder labeled “Documents”. For most of you, you likely have your documents folder under your home directory. If not, please create a folder under your home directory called “Documents.” You can do this by clicking on your home directory on the left column of your Finder window. Then from the Finder menu at the top of your screen, choose File -> New Folder. This will create a new folder in your home directory. You will need to rename it by clicking on the name it currently has and changing it to “Documents”.
Once you have a “Documents” folder, create a new folder within “Documents” called “BTRY 3010.” Again, this is done by double clicking on the folder “Documents” to open it and choosing File -> New Folder from the menu bar. Again, rename this folder as indicated above.
We suggest you create a new folder for every assignment in this class where all the files for that assignment are kept. Use the directions in step 2 to create a folder for homework 1 in your BTRY 3010 folder.
On the left hand side of your Finder window, the folder “Downloads” should be listed. If not, it should be found under your home directory. This is an important folder as everything you download will be put in that folder. Now open your internet browser. We suggest that you use Safari.
In Blackboard, go to the folder for homework 1 and right click on the HW1.R file. Choose “Download Linked File.” This will send this file to your “Downloads” folder. Perform the same steps to download the HW1.doc file.
Now put HW1.R in your homework 1 folder. You can do this by opening two finder windows. A second Finder window can be opened by choosing File -> New Finder Window from the menu bar for Finder. Open one Finder window to “Downloads”. Open the other Finder window to your folder for HW1. Click on the HW1.R and HW1.doc files in “Downloads” and drag them into the folder for homework 1. Rename this file as “LastF-HW1.R” and “LastF-HW1.doc” .
Proceed to the instructions for Macs to download R and RStudio.
For Windows: Every file you have on your computer can be referenced by a file path.
Clicking on this icon gives you access to all of the files on your computer. On the right hand side after you click on Start will be your “Documents” folder. Open this folder by clicking on “Documents”.
Create a new folder within “Documents” called “BTRY 3010.” You can do this by clicking on “New Folder” on the menu on the top of the window for your “Documents” folder. After the folder is created, change the name of this folder to “BTRY 3010.”
Click on this new folder. We suggest you create a new folder for every assignment in this class where all the files for that assignment are kept. Use the directions in step 2 to create a folder for homework 1 in your BTRY 3010 folder.
Note that in the window containing your “Documents” folder, on the left hand side are a list of other folders that you can also access. One of these folders is called “Downloads”. Anything you download off of the internet will be placed in this folder unless you specify otherwise. We recommend using Explorer or Firefox as your browser.
In Blackboard, go to the folder for homework 1 and right click on the HW1.R file. Choose Save target as…. A window will pop up in which you can specify where this file is saved. If you are not already in the folder you created for homework 1, select “Documents” from the left hand side of this window. Then double click on the BTRY 3010 folder you created within “Documents”, and finally double click on the homework 1 folder that you created within your BTRY 3010 folder. Change the name of this .R file to “LastF-HW1.R” before you save it. Save this document in your folder for homework 1 by clicking on “Save” at the bottom of this window. Repeat these same steps to download HW1.doc and rename it “LastF-HW1.doc”. Both files should be in your folder for homework 1.
Proceed to the instructions for Windows to download R and RStudio.
In this class we will primarily be using RStudio for data analysis. RStudio is an convenient interface for the R programming language. Here we will go through the steps to download R and RStudio. Note: If you already have these applications, please download the latest versions so that RStudio has the same look and functionality for everyone.
For Macs:
We will first download R. Go to the wesite, http://cran.r-project.org/bin/macosx.
Click on either R-3.1.1-snowleopard.pkg or R-3.1.1-mavericks.pkg depending on your operating system, see descriptions. This will download the R package on your computer. It will take a few minutes or significantly more depending on your downloading speed.
Go to your downloads folder and double click on the package you downloaded. This will start the installer for R. Go through the steps to start the installation. When you have choices, unless you prefer otherwise, choose the defaults. This will take a couple of minutes.
R should now be in your “Applications” folder which can again be found on the left hand side of your Finder window.
Next, we will download RStudio, our main tool for this class. Go to the website http://www.rstudio.com/products/rstudio/download/.
Click on RStudio 0.98.994 - Mac OS X 10.6+ (64-bit). This will download RStudio and a folder will pop up that includes both the “Applications” folder and RStudio. Drag RStudio into the “Applications” folder and close this window.
Likely after step 6, you will have an RStudio icon on your desktop. You can right click on this icon and choose Eject to get rid of this icon.
You are now ready to use RStudio.
For Windows:
We will first download R. Go to the wesite, http://cran.r-project.org/bin/windows/base/.
Click on Download R-3.1.1 for Windows. At the bottom of the Explorer window, you will be asked whether you want to run or save this file. You can go ahead and run it unless you would like to start the installation at a later time. After selecting to run this program, you may be asked if you want this program to make changes to your computer. Click “Yes.”
This will start the installer for R. Go through the steps to start the installation. When you have choices, unless you prefer otherwise, choose the defaults. This will take a few minutes.
R should now be in your “Programs” folder. It can be accessed by clicking on the “Start” icon in the bottom left corner of your screen and choosing “All Programs”. You should see a folder for R.
Finally, we will download RStudio, our main tool for this class. Go to the website http://www.rstudio.com/products/rstudio/download/.
Click on RStudio 0.98.994 - Windows XP/Vista/7/8. Again, at the bottom of the Explorer window you will be asked whether you want to run or save this file. Go ahead and run it. You will likely be asked if you want this program to make changes to your computer. Choose, “Yes.”
This will start an installer for RStudio. Unless you prefer otherwise, choose the defaults for the installation process. This will take a few minutes.
Once RStudio is installed, it also should be found in your program files.
You are now ready to use RStudio.
The goal of this homework is to introduce you to R and RStudio, which you’ll be using throughout the course both to learn the statistical concepts discussed in the texbook and also to analyze real data and come to informed conclusions. To straighten out which is which: R is the name of the programming language itself and RStudio is a convenient environment for writing and running R code.
As the course progresses, you are encouraged to explore beyond what the labs and homework dictate; a willingness to experiment will make you a much better programmer. Before we get to that stage, however, you need to build some basic fluency in R. Today we begin with the fundamental building blocks of R and RStudio: the interface and basic commands.
The panel in the upper right contains your workspace as well as a history of the commands that you’ve previously entered. Any plots that you generate will show up in the panel in the lower right corner.
The panel on the left is where the action happens. It’s called the console. Everytime you launch RStudio, it will have the same text at the top of the console telling you the version of R that you’re running. Below that information is the prompt, >. As its name suggests, this prompt is really a request, a request for a command. Initially, interacting with R is all about typing commands and interpreting the output. These commands and their syntax have evolved over decades (literally) and now provide what many users feel is a fairly natural way to access data and organize, describe, and invoke statistical computations.
Complete the following steps to create your first homework document.
For this homework, you will modify the previously created HW1.R and HW1.doc documents. These files should now be named “LastF-HW1.R” and “LastF-HW1.doc” and be found in your folder for homework 1. Your final submission for this homework that will be updloaded into blackboard will be your edited “LastF-HW1.R” document and your edited “LastF-HW1.doc” saved as a pdf. Note, this means you can use any text editor available to you after initially copying the homework from the .doc file.
In RStudio, in the lower right window click on the tab “Files.” In “Files” navigate to your .R file for homework 1. Click the highlighted name of this file to open it in RStudio. This file will now be opened in a new window on the left above your R Console.
Click on the window of your .R file. You can edit this file. In lines 4 and 5 of this file, put in your name and your NetID.
For this homework, you will perform a number of basic tasks in R through RStudio. At the end of this homework you will upload both your homework .R and .pdf files to blackboard.
R is an application primarily used for data analysis. To use R, you need to use the proper syntax for coding in R. In this class we do not assume prior knowledge of this syntax, and you will be given a lot of code that you only need to run. In lab, you may be given code to use for a particular dataset that you can edit for use in your homework when applying this code to a different dataset.
Many “libraries” or “packages” have been developed for R. These libraries contain functions that can be used directly. For example, the function ‘plot(x,y)’ will plot points on a graph where the domain and range have been user defined.
You will not have all packages for R installed on your computer, but many basic functions are available after an initial installation of R. If we need more functions as the semester progresses, you will be told how to download and access these functions.
For the rest of homework 1, follow the instructions in your .doc file for homework 1.