Data wrangling: In-Class Exercise 5

2020-Spring [Data Management] Instructor: SHEU, Ching-Fan

CHIU, Ming-Tzu

2020-04-11

The ‘MASS’ library has these two data sets: ‘Animals’ and ‘mammals’. Merge the two files and remove duplicated observations using ‘duplicated’.

讀取資料

library(MASS)
str(Animals)
#> 'data.frame':    28 obs. of  2 variables:
#>  $ body : num  1.35 465 36.33 27.66 1.04 ...
#>  $ brain: num  8.1 423 119.5 115 5.5 ...
str(mammals)
#> 'data.frame':    62 obs. of  2 variables:
#>  $ body : num  3.38 0.48 1.35 465 36.33 ...
#>  $ brain: num  44.5 15.5 8.1 423 119.5 ...

合併資料

dta <- rbind(Animals, mammals)
str(dta)
#> 'data.frame':    90 obs. of  2 variables:
#>  $ body : num  1.35 465 36.33 27.66 1.04 ...
#>  $ brain: num  8.1 423 119.5 115 5.5 ...

去除重複觀察值

duplicated(dta)
#>  [1] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [13] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [25] FALSE FALSE FALSE FALSE FALSE FALSE  TRUE  TRUE  TRUE  TRUE FALSE  TRUE
#> [37] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE  TRUE FALSE
#> [49]  TRUE  TRUE FALSE  TRUE  TRUE FALSE FALSE  TRUE  TRUE FALSE FALSE  TRUE
#> [61]  TRUE FALSE  TRUE  TRUE FALSE  TRUE  TRUE FALSE FALSE FALSE  TRUE  TRUE
#> [73]  TRUE  TRUE FALSE FALSE FALSE FALSE FALSE  TRUE FALSE  TRUE FALSE  TRUE
#> [85] FALSE FALSE FALSE FALSE FALSE FALSE
result <- dta[!duplicated(dta),]
str(result)
#> 'data.frame':    65 obs. of  2 variables:
#>  $ body : num  1.35 465 36.33 27.66 1.04 ...
#>  $ brain: num  8.1 423 119.5 115 5.5 ...

除去重複值後,剩下 65 種不同的物種。

result
#>                                body   brain
#> Mountain beaver               1.350    8.10
#> Cow                         465.000  423.00
#> Grey wolf                    36.330  119.50
#> Goat                         27.660  115.00
#> Guinea pig                    1.040    5.50
#> Dipliodocus               11700.000   50.00
#> Asian elephant             2547.000 4603.00
#> Donkey                      187.100  419.00
#> Horse                       521.000  655.00
#> Potar monkey                 10.000  115.00
#> Cat                           3.300   25.60
#> Giraffe                     529.000  680.00
#> Gorilla                     207.000  406.00
#> Human                        62.000 1320.00
#> African elephant           6654.000 5712.00
#> Triceratops                9400.000   70.00
#> Rhesus monkey                 6.800  179.00
#> Kangaroo                     35.000   56.00
#> Golden hamster                0.120    1.00
#> Mouse                         0.023    0.40
#> Rabbit                        2.500   12.10
#> Sheep                        55.500  175.00
#> Jaguar                      100.000  157.00
#> Chimpanzee                   52.160  440.00
#> Rat                           0.280    1.90
#> Brachiosaurus             87000.000  154.50
#> Mole                          0.122    3.00
#> Pig                         192.000  180.00
#> Arctic fox                    3.385   44.50
#> Owl monkey                    0.480   15.50
#> Roe deer                     14.830   98.20
#> Verbet                        4.190   58.00
#> Chinchilla                    0.425    6.40
#> Ground squirrel               0.101    4.00
#> Arctic ground squirrel        0.920    5.70
#> African giant pouched rat     1.000    6.60
#> Lesser short-tailed shrew     0.005    0.14
#> Star-nosed mole               0.060    1.00
#> Nine-banded armadillo         3.500   10.80
#> Tree hyrax                    2.000   12.30
#> N.A. opossum                  1.700    6.30
#> Big brown bat                 0.023    0.30
#> European hedgehog             0.785    3.50
#> Galago                        0.200    5.00
#> Genet                         1.410   17.50
#> Grey seal                    85.000  325.00
#> Rock hyrax-a                  0.750   12.30
#> Water opossum                 3.500    3.90
#> Yellow-bellied marmot         4.050   17.00
#> Little brown bat              0.010    0.25
#> Slow loris                    1.400   12.50
#> Okapi                       250.000  490.00
#> Baboon                       10.550  179.50
#> Desert hedgehog               0.550    2.40
#> Giant armadillo              60.000   81.00
#> Rock hyrax-b                  3.600   21.00
#> Raccoon                       4.288   39.20
#> E. American mole              0.075    1.20
#> Musk shrew                    0.048    0.33
#> Echidna                       3.000   25.00
#> Brazilian tapir             160.000  169.00
#> Tenrec                        0.900    2.60
#> Phalanger                     1.620   11.40
#> Tree shrew                    0.104    2.50
#> Red fox                       4.235   50.40