Use Anscombe's quartret to see why is it important to use table, listing and graph to understand the whole data
November 29, 2016
Use Anscombe's quartret to see why is it important to use table, listing and graph to understand the whole data
## Warning: package 'ggplot2' was built under R version 3.3.2
## Warning: package 'gridExtra' was built under R version 3.3.2
## Warning: package 'plotly' was built under R version 3.3.2
## x1 x2 x3 x4 y1 y2 y3 y4 ## 1 10 10 10 8 8.04 9.14 7.46 6.58 ## 2 8 8 8 8 6.95 8.14 6.77 5.76 ## 3 13 13 13 8 7.58 8.74 12.74 7.71 ## 4 9 9 9 8 8.81 8.77 7.11 8.84 ## 5 11 11 11 8 8.33 9.26 7.81 8.47 ## 6 14 14 14 8 9.96 8.10 8.84 7.04 ## 7 6 6 6 8 7.24 6.13 6.08 5.25 ## 8 4 4 4 19 4.26 3.10 5.39 12.50 ## 9 12 12 12 8 10.84 9.13 8.15 5.56 ## 10 7 7 7 8 4.82 7.26 6.42 7.91 ## 11 5 5 5 8 5.68 4.74 5.73 6.89
## x1 x2 x3 x4 ## Min. : 4.0 Min. : 4.0 Min. : 4.0 Min. : 8 ## 1st Qu.: 6.5 1st Qu.: 6.5 1st Qu.: 6.5 1st Qu.: 8 ## Median : 9.0 Median : 9.0 Median : 9.0 Median : 8 ## Mean : 9.0 Mean : 9.0 Mean : 9.0 Mean : 9 ## 3rd Qu.:11.5 3rd Qu.:11.5 3rd Qu.:11.5 3rd Qu.: 8 ## Max. :14.0 Max. :14.0 Max. :14.0 Max. :19 ## y1 y2 y3 y4 ## Min. : 4.260 Min. :3.100 Min. : 5.39 Min. : 5.250 ## 1st Qu.: 6.315 1st Qu.:6.695 1st Qu.: 6.25 1st Qu.: 6.170 ## Median : 7.580 Median :8.140 Median : 7.11 Median : 7.040 ## Mean : 7.501 Mean :7.501 Mean : 7.50 Mean : 7.501 ## 3rd Qu.: 8.570 3rd Qu.:8.950 3rd Qu.: 7.98 3rd Qu.: 8.190 ## Max. :10.840 Max. :9.260 Max. :12.74 Max. :12.500
## [1] 0.8164205 0.8162365 0.8162867 0.8165214
## [1] 4.127269 4.127629 4.122620 4.123249
## ## Call: ## lm(formula = y1 ~ x1, data = anscombe) ## ## Coefficients: ## (Intercept) x1 ## 3.0001 0.5001
## ## Call: ## lm(formula = y2 ~ x2, data = anscombe) ## ## Coefficients: ## (Intercept) x2 ## 3.001 0.500
## ## Call: ## lm(formula = y3 ~ x3, data = anscombe) ## ## Coefficients: ## (Intercept) x3 ## 3.0025 0.4997
## ## Call: ## lm(formula = y4 ~ x4, data = anscombe) ## ## Coefficients: ## (Intercept) x4 ## 3.0017 0.4999
Even if the data has same descriptive statistics values, plot the data to understand it better.
Thank you !!!