R Markdown

Loading different datasets (csv, txt, xlsx, SPSS, WWW, json, SQL, noSQL) - this is just for practice & to know that it is possible to read data from multiple sources using R.

head(Ranking_csv)
##                      University         Country Overall Teaching Research
## 1     Michigan State University   United States    63.9     56.6     57.4
## 2 University of Texas at Austin United States      75.4     68.2     76.2
## 3  George Washington University   United States    53.8     48.6     32.4
## 4            Indiana University   United States    58.4     50.8     49.7
## 5          Princeton University   United States    93.2     90.3     96.3
## 6       University of Rochester   United States    55.6     43.4     30.3
##   Citations IndInc IntOut
## 1      80.0   37.8   63.3
## 2      93.2   46.6   39.1
## 3      81.5   35.0   55.5
## 4      76.7   49.8   53.5
## 5      98.8   58.6   81.1
## 6      91.4   40.9   67.2
head(HR_excel)
## # A tibble: 6 × 4
##   DepartmentID Name                     GroupName            ModifiedDate       
##          <dbl> <chr>                    <chr>                <dttm>             
## 1            1 Engineering              Research and Develo… 2008-04-30 00:00:00
## 2            2 Tool Design              Research and Develo… 2008-04-30 00:00:00
## 3            3 Sales                    Sales and Marketing  2008-04-30 00:00:00
## 4            4 Marketing                Sales and Marketing  2008-04-30 00:00:00
## 5            5 Purchasing               Inventory Management 2008-04-30 00:00:00
## 6            6 Research and Development Research and Develo… 2008-04-30 00:00:00
head(df_web)
##   sepal_length sepal_width petal_length petal_width species
## 1          5.1         3.5          1.4         0.2  setosa
## 2          4.9         3.0          1.4         0.2  setosa
## 3          4.7         3.2          1.3         0.2  setosa
## 4          4.6         3.1          1.5         0.2  setosa
## 5          5.0         3.6          1.4         0.2  setosa
## 6          5.4         3.9          1.7         0.4  setosa
head(df_web)
##   sepal_length sepal_width petal_length petal_width species
## 1          5.1         3.5          1.4         0.2  setosa
## 2          4.9         3.0          1.4         0.2  setosa
## 3          4.7         3.2          1.3         0.2  setosa
## 4          4.6         3.1          1.5         0.2  setosa
## 5          5.0         3.6          1.4         0.2  setosa
## 6          5.4         3.9          1.7         0.4  setosa
head(text_data)
## [1] "*** START OF THE PROJECT GUTENBERG EBOOK 11 ***"
## [2] ""                                               
## [3] "[Illustration]"                                 
## [4] ""                                               
## [5] ""                                               
## [6] ""
head(as.data.frame(continent))
##   BD BE BF BG BA BB WF BL BM BN BO BH BI BJ BT JM BV BW WS BQ BR BS JE BY BZ RU
## 1 AS EU AF EU EU NA OC NA NA AS SA AS AF AF AS NA AN AF OC NA SA NA EU EU NA EU
##   RW RS TL RE TM TJ RO TK GW GU GT GS GR GQ GP JP GY GG GF GE GD GB GA SV GN GM
## 1 AF EU OC AF AS AS EU OC AF OC NA AN EU AF NA AS SA EU SA AS NA EU AF NA AF AF
##   GL GI GH OM TN JO HR HT HU HK HN HM VE PR PS PW PT SJ PY IQ PA PF PG PE PK PH
## 1 NA EU AF AS AF AS EU NA EU AS NA AN SA NA AS OC EU EU SA AS NA OC OC SA AS AS
##   PN PL PM ZM EH EE EG ZA EC IT VN SB ET SO ZW SA ES ER ME MD MG MF MA MC UZ MM
## 1 OC EU NA AF AF EU AF AF SA EU AS OC AF AF AF AS EU AF EU EU AF NA AF EU AS AS
##   ML MO MN MH MK MU MT MW MV MQ MP MS MR IM UG TZ MY MX IL FR IO SH FI FJ FK FM
## 1 AF AS AS OC EU AF EU AF AS NA OC NA AF EU AF AF AS NA AS EU AS AF EU OC SA OC
##   FO NI NL NO NA. VU NC NE NF NG NZ NP NR NU CK XK CI CH CO CN CM CL CC CA CG
## 1 EU NA EU EU  AF OC OC AF OC AF OC AS OC OC OC EU AF EU SA AS AF SA AS NA AF
##   CF CD CZ CY CX CR CW CV CU SZ SY SX KG KE SS SR KI KH KN KM ST SK KR SI KP KW
## 1 AF AF EU EU AS NA NA AF NA AF AS NA AS AF AF SA OC AS NA AF AF EU AS EU AS AS
##   SN SM SL SC KZ KY SG SE SD DO DM DJ DK VG DE YE DZ US UY YT UM LB LC LA TV TW
## 1 AF EU AF AF AS NA AS EU AF NA NA AF EU NA EU AS AF NA SA AF OC AS NA AS OC AS
##   TT TR LK LI LV TO LT LU LR LS TH TF TG TD TC LY VA VC AE AD AG AF AI VI IS IR
## 1 NA AS AS EU EU OC EU EU AF AF AS AN AF AF NA AF EU NA AS EU NA AS NA NA EU AS
##   AM AL AO AQ AS AR AU AT AW IN AX AZ IE ID UA QA MZ
## 1 AS EU AF AN OC SA OC EU NA AS EU AS EU AS EU AS AF
head(df_sqlite)
##                      University         Country Overall Teaching Research
## 1     Michigan State University   United States    63.9     56.6     57.4
## 2 University of Texas at Austin United States      75.4     68.2     76.2
## 3  George Washington University   United States    53.8     48.6     32.4
## 4            Indiana University   United States    58.4     50.8     49.7
## 5          Princeton University   United States    93.2     90.3     96.3
## 6       University of Rochester   United States    55.6     43.4     30.3
##   Citations IndInc IntOut
## 1      80.0   37.8   63.3
## 2      93.2   46.6   39.1
## 3      81.5   35.0   55.5
## 4      76.7   49.8   53.5
## 5      98.8   58.6   81.1
## 6      91.4   40.9   67.2
sum(is.na(Ranking_csv))
## [1] 0
colSums(is.na(Ranking_csv))
## University    Country    Overall   Teaching   Research  Citations     IndInc 
##          0          0          0          0          0          0          0 
##     IntOut 
##          0
colMeans(is.na(Ranking_csv)) * 100
## University    Country    Overall   Teaching   Research  Citations     IndInc 
##          0          0          0          0          0          0          0 
##     IntOut 
##          0
Ranking_csv[!complete.cases(Ranking_csv), ]
## [1] University Country    Overall    Teaching   Research   Citations  IndInc    
## [8] IntOut    
## <0 rows> (or 0-length row.names)
stats_summary
##             Mean Median  Variance        SD  Min   Max Range    IQR
## Overall   67.162  62.70 131.16812 11.452865 53.8  94.3  40.5 16.450
## Teaching  55.516  55.40 286.09974 16.914483 27.5  92.8  65.3 22.600
## Research  59.486  58.95 369.96653 19.234514 30.1  96.4  66.3 29.900
## Citations 86.964  89.90  91.78031  9.580204 63.0  99.9  36.9 13.575
## IndInc    59.418  55.80 421.78681 20.537449 35.0 100.0  65.0 31.375
## IntOut    67.738  67.35 356.83342 18.890035 35.8  98.6  62.8 29.900

Including Plots

You can also embed plots, for example:

Note that the echo = FALSE parameter was added to the code chunk to prevent printing of the R code that generated the plot.