Introductory Statistics (CRN: 6896)



Objective

Today, we’ll learn how to make reports in R. Rstudio gives us the ability to analyse our data, and write a report in the same file. The file can be updated and published online and distributed in different formats. You have been receving the past three labs manuals in this format. Now it’s time for you to learn how to create one of these reports.


Open a new file

The file format we use in R to create reports is called rmarkdown. To create a blank markdown file, go to File menu, select New File, and select R markdwon…. See the path below:


This new file opens up in the top left box. The file is not empty. It comes with some code and text that you can delete. Everything pas teh --- line is just there to provide examples.

The first time you try to save the file, it asks you to give it a name. So go ahead and call it Lab4-YourName. Once you’ve saved the file, you can click on the yarn button to knit the file. Knit “will run each chunk of R code in the document and append the results of the code to the document next to the code chunk.” What does this mean?


Knit

If you recall from previous labs, every report contains text, code, and the outputs of those codes. Knit goes through your markdown and puts everything in the right place. If it’s text, it formats it and places it in the text. If it’s code, the code is placed in a box, and if its code output, the output is included in the file too. Here’s is how you spereate code from text:

  • text: just type in. There are many formatin g option which you can look up and use.
  • code: you need to place it between '''{r} and '''. The code between these signs will be run and the output will be shown next to it.

So for example, look at the line of code below:

nc.data<-read.csv("./nc.csv")


In the markdown file, it is written like this:


Note that text and code in the new file are just there to giv you examples. For example, the multiple # behind a line of tezt can make a text size change. or, * surrounding a line of text make it italic, or ** around a text make it bold. You can find other formating options here: https://github.com/rstudio/cheatsheets/raw/master/rmarkdown-2.0.pdf

Tips

  • These code chunks are highlighted in grey.
  • You can use command + option + i to insert new code chunks.
  • You can also have the code output with the code shoing using the field echo = FALSE ({r, echo = FALSE}).
  • If you don’t want to run the code but want to have the code shown, you can specify use eval = FALSE ({r, eval = FALSE})

R markdown gives you many options and choices to format your report (see more options here: https://rmarkdown.rstudio.com/index.html). Also, you can format your output into word documents, pdf, and many others.


Summary of what we have learned so far

  • How to read csv file
  • How many rows and columns the dataset has (dataset dimensions)
  • How to see the structure of the dataset
  • How to see the first few rows
  • How o see a summary of the dataset
  • How to get the mean of a numeric variable
  • How to calculate the median of a numeric variable
  • How to get the standard deviation of a variable
  • How to get the range of a numeric variable
  • How to give a value to a variable
  • How to access a column (variable) in our dataset
  • How to access a row in our dataset
  • How to create tables of values of a variable
  • How to deal with missing values (NA)
  • How to apply a function to a variable seperately for different groups
  • How to plot distribution of numeric variables
  • How to plot boxplots of numeric variabls sperated by groups (another variable)

Now we want to use our knowledge of R to create our first report. In the file in this section, you can see a file called Lab Exercise. Open that folder and get started.

LS0tCnRpdGxlOiAiTGFiIDQiCm91dHB1dDoKICBodG1sX25vdGVib29rOiBkZWZhdWx0CiAgcGRmX2RvY3VtZW50OiBkZWZhdWx0CkF1dGhvcjogTW9zdGFmYSBTYWxhcmkgUmFkCgotLS0KCiMjIyBJbnRyb2R1Y3RvcnkgU3RhdGlzdGljcyAoQ1JOOiA2ODk2KQoKXApcCgojIyMjIE9iamVjdGl2ZQoKVG9kYXksIHdlJ2xsIGxlYXJuIGhvdyB0byBtYWtlIHJlcG9ydHMgaW4gUi4gUnN0dWRpbyBnaXZlcyB1cyB0aGUgYWJpbGl0eSB0byBhbmFseXNlIG91ciBkYXRhLCBhbmQgd3JpdGUgYSByZXBvcnQgaW4gdGhlIHNhbWUgZmlsZS4gVGhlIGZpbGUgY2FuIGJlIHVwZGF0ZWQgYW5kIHB1Ymxpc2hlZCBvbmxpbmUgYW5kIGRpc3RyaWJ1dGVkIGluIGRpZmZlcmVudCBmb3JtYXRzLiBZb3UgaGF2ZSBiZWVuIHJlY2V2aW5nIHRoZSBwYXN0IHRocmVlIGxhYnMgbWFudWFscyBpbiB0aGlzIGZvcm1hdC4gTm93IGl0J3MgdGltZSBmb3IgeW91IHRvIGxlYXJuIGhvdyB0byBjcmVhdGUgb25lIG9mIHRoZXNlIHJlcG9ydHMuIAoKXAoKIyMjIyMgT3BlbiBhIG5ldyBmaWxlClRoZSBmaWxlIGZvcm1hdCB3ZSB1c2UgaW4gUiB0byBjcmVhdGUgcmVwb3J0cyBpcyBjYWxsZWQgKnJtYXJrZG93biouIFRvIGNyZWF0ZSBhIGJsYW5rIG1hcmtkb3duIGZpbGUsIGdvIHRvIEZpbGUgbWVudSwgc2VsZWN0IE5ldyBGaWxlLCBhbmQgc2VsZWN0IFIgbWFya2R3b24uLi4uIFNlZSB0aGUgcGF0aCBiZWxvdzoKCiFbXSguL2ltYWdlcy9MYWI0LWltYWdlMS5wbmcpCgpcCgpUaGlzIG5ldyBmaWxlIG9wZW5zIHVwIGluIHRoZSB0b3AgbGVmdCBib3guIFRoZSBmaWxlIGlzIG5vdCBlbXB0eS4gSXQgY29tZXMgd2l0aCBzb21lIGNvZGUgYW5kIHRleHQgdGhhdCB5b3UgY2FuIGRlbGV0ZS4gRXZlcnl0aGluZyBwYXMgdGVoIGAtLS1gIGxpbmUgaXMganVzdCB0aGVyZSB0byBwcm92aWRlIGV4YW1wbGVzLgoKVGhlIGZpcnN0IHRpbWUgeW91IHRyeSB0byBzYXZlIHRoZSBmaWxlLCBpdCBhc2tzIHlvdSB0byBnaXZlIGl0IGEgbmFtZS4gU28gZ28gYWhlYWQgYW5kIGNhbGwgaXQgKkxhYjQtWW91ck5hbWUqLiBPbmNlIHlvdSd2ZSBzYXZlZCB0aGUgZmlsZSwgeW91IGNhbiBjbGljayBvbiB0aGUgeWFybiBidXR0b24gdG8gKmtuaXQqIHRoZSBmaWxlLiBLbml0ICIqd2lsbCBydW4gZWFjaCBjaHVuayBvZiBSIGNvZGUgaW4gdGhlIGRvY3VtZW50IGFuZCBhcHBlbmQgdGhlIHJlc3VsdHMgb2YgdGhlIGNvZGUgdG8gdGhlIGRvY3VtZW50IG5leHQgdG8gdGhlIGNvZGUgY2h1bmsuKiIgV2hhdCBkb2VzIHRoaXMgbWVhbj8KClwKCiMjIyMjIEtuaXQKSWYgeW91IHJlY2FsbCBmcm9tIHByZXZpb3VzIGxhYnMsIGV2ZXJ5IHJlcG9ydCBjb250YWlucyB0ZXh0LCBjb2RlLCBhbmQgdGhlIG91dHB1dHMgb2YgdGhvc2UgY29kZXMuIEtuaXQgZ29lcyB0aHJvdWdoIHlvdXIgbWFya2Rvd24gYW5kIHB1dHMgZXZlcnl0aGluZyBpbiB0aGUgcmlnaHQgcGxhY2UuIElmIGl0J3MgdGV4dCwgaXQgZm9ybWF0cyBpdCBhbmQgcGxhY2VzIGl0IGluIHRoZSB0ZXh0LiBJZiBpdCdzIGNvZGUsIHRoZSBjb2RlIGlzIHBsYWNlZCBpbiBhIGJveCwgYW5kIGlmIGl0cyBjb2RlIG91dHB1dCwgdGhlIG91dHB1dCBpcyBpbmNsdWRlZCBpbiB0aGUgZmlsZSB0b28uIEhlcmUncyBpcyBob3cgeW91IHNwZXJlYXRlIGNvZGUgZnJvbSB0ZXh0OgoKIC0gKip0ZXh0Kio6IGp1c3QgdHlwZSBpbi4gVGhlcmUgYXJlIG1hbnkgZm9ybWF0aW4gZyBvcHRpb24gd2hpY2ggeW91IGNhbiBsb29rIHVwIGFuZCB1c2UuIAogLSAqKmNvZGUqKjogeW91IG5lZWQgdG8gcGxhY2UgaXQgYmV0d2VlbiBgJycne3J9YCBhbmQgYCcnJ2AuIFRoZSBjb2RlIGJldHdlZW4gdGhlc2Ugc2lnbnMgd2lsbCBiZSBydW4gYW5kIHRoZSBvdXRwdXQgd2lsbCBiZSBzaG93biBuZXh0IHRvIGl0LgogClNvIGZvciBleGFtcGxlLCBsb29rIGF0IHRoZSBsaW5lIG9mIGNvZGUgYmVsb3c6IAogCmBgYHtyfQpuYy5kYXRhPC1yZWFkLmNzdigiLi9uYy5jc3YiKQpgYGAgCgpcCkluIHRoZSBtYXJrZG93biBmaWxlLCBpdCBpcyB3cml0dGVuIGxpa2UgdGhpczoKCiFbXSguL2ltYWdlcy9MYWI0LWltYWdlMi5wbmcpCgpcCgpOb3RlIHRoYXQgdGV4dCBhbmQgY29kZSBpbiB0aGUgbmV3IGZpbGUgYXJlIGp1c3QgdGhlcmUgdG8gZ2l2IHlvdSBleGFtcGxlcy4gRm9yIGV4YW1wbGUsIHRoZSBtdWx0aXBsZSAjIGJlaGluZCBhIGxpbmUgb2YgdGV6dCBjYW4gbWFrZSBhIHRleHQgc2l6ZSBjaGFuZ2UuIG9yLCBgKmAgc3Vycm91bmRpbmcgYSBsaW5lIG9mIHRleHQgbWFrZSBpdCAqaXRhbGljKiwgb3IgYCoqYCBhcm91bmQgYSB0ZXh0IG1ha2UgaXQgKipib2xkKiouIFlvdSBjYW4gZmluZCBvdGhlciBmb3JtYXRpbmcgb3B0aW9ucyBoZXJlOiBodHRwczovL2dpdGh1Yi5jb20vcnN0dWRpby9jaGVhdHNoZWV0cy9yYXcvbWFzdGVyL3JtYXJrZG93bi0yLjAucGRmCgoKKipUaXBzKioKCiAtIFRoZXNlIGNvZGUgY2h1bmtzIGFyZSBoaWdobGlnaHRlZCBpbiBncmV5LgogLSBZb3UgY2FuIHVzZSAqKmNvbW1hbmQgKyBvcHRpb24gKyBpKiogdG8gaW5zZXJ0IG5ldyBjb2RlIGNodW5rcy4KIC0gWW91IGNhbiBhbHNvIGhhdmUgdGhlIGNvZGUgb3V0cHV0IHdpdGggdGhlIGNvZGUgc2hvaW5nIHVzaW5nIHRoZSBmaWVsZCBgZWNobyA9IEZBTFNFYCAoYHtyLCBlY2hvID0gRkFMU0V9YCkuIAogLSBJZiB5b3UgZG9uJ3Qgd2FudCB0byBydW4gdGhlIGNvZGUgYnV0IHdhbnQgdG8gaGF2ZSB0aGUgY29kZSBzaG93biwgeW91IGNhbiBzcGVjaWZ5IHVzZSBgZXZhbCA9IEZBTFNFYCAoYHtyLCBldmFsID0gRkFMU0V9YCkKCgpSIG1hcmtkb3duIGdpdmVzIHlvdSBtYW55IG9wdGlvbnMgYW5kIGNob2ljZXMgdG8gZm9ybWF0IHlvdXIgcmVwb3J0IChzZWUgbW9yZSBvcHRpb25zIGhlcmU6IGh0dHBzOi8vcm1hcmtkb3duLnJzdHVkaW8uY29tL2luZGV4Lmh0bWwpLiBBbHNvLCB5b3UgY2FuIGZvcm1hdCB5b3VyIG91dHB1dCBpbnRvIHdvcmQgZG9jdW1lbnRzLCBwZGYsIGFuZCBtYW55IG90aGVycy4gCgoKXAoKCioqU3VtbWFyeSBvZiB3aGF0IHdlIGhhdmUgbGVhcm5lZCBzbyBmYXIqKgoKLSBIb3cgdG8gKipyZWFkKiogY3N2IGZpbGUKLSBIb3cgbWFueSByb3dzIGFuZCBjb2x1bW5zIHRoZSBkYXRhc2V0IGhhcyAoZGF0YXNldCBkaW1lbnNpb25zKQotIEhvdyB0byBzZWUgdGhlICoqc3RydWN0dXJlKiogb2YgdGhlIGRhdGFzZXQKLSBIb3cgdG8gc2VlIHRoZSAqKmZpcnN0IGZldyByb3dzKioKLSBIb3cgbyBzZWUgYSAqKnN1bW1hcnkqKiBvZiB0aGUgZGF0YXNldAotIEhvdyB0byBnZXQgdGhlICoqbWVhbioqIG9mIGEgbnVtZXJpYyB2YXJpYWJsZQotIEhvdyB0byBjYWxjdWxhdGUgdGhlICoqbWVkaWFuKiogb2YgYSBudW1lcmljIHZhcmlhYmxlCi0gSG93IHRvIGdldCB0aGUgKipzdGFuZGFyZCBkZXZpYXRpb24qKiBvZiBhIHZhcmlhYmxlCi0gSG93IHRvIGdldCB0aGUgKipyYW5nZSoqIG9mIGEgbnVtZXJpYyB2YXJpYWJsZQotIEhvdyB0byAqKmdpdmUqKiBhIHZhbHVlIHRvIGEgdmFyaWFibGUKLSBIb3cgdG8gYWNjZXNzIGEgKipjb2x1bW4qKiAodmFyaWFibGUpIGluIG91ciBkYXRhc2V0Ci0gSG93IHRvIGFjY2VzcyBhICoqcm93KiogaW4gb3VyIGRhdGFzZXQKLSBIb3cgdG8gY3JlYXRlICoqdGFibGVzKiogb2YgdmFsdWVzIG9mIGEgdmFyaWFibGUKLSBIb3cgdG8gZGVhbCB3aXRoICoqbWlzc2luZyoqIHZhbHVlcyAoTkEpCi0gSG93IHRvIGFwcGx5IGEgZnVuY3Rpb24gdG8gYSB2YXJpYWJsZSBzZXBlcmF0ZWx5IGZvciAqKmRpZmZlcmVudCBncm91cHMqKiAKLSBIb3cgdG8gcGxvdCBkaXN0cmlidXRpb24gb2YgbnVtZXJpYyB2YXJpYWJsZXMKLSBIb3cgdG8gcGxvdCBib3hwbG90cyBvZiBudW1lcmljIHZhcmlhYmxzIHNwZXJhdGVkIGJ5IGdyb3VwcyAoYW5vdGhlciB2YXJpYWJsZSkKCgpOb3cgd2Ugd2FudCB0byB1c2Ugb3VyIGtub3dsZWRnZSBvZiBSIHRvIGNyZWF0ZSBvdXIgZmlyc3QgcmVwb3J0LiBJbiB0aGUgZmlsZSBpbiB0aGlzIHNlY3Rpb24sIHlvdSBjYW4gc2VlIGEgZmlsZSBjYWxsZWQgYExhYiBFeGVyY2lzZWAuIE9wZW4gdGhhdCBmb2xkZXIgYW5kIGdldCBzdGFydGVkLiAgCgoKCgo=