R-markdown ini berisi tentang cara menganalisis data dengan metode Algortima K-means clustering. Akan diberikan penjelasan mengenai package yang dipakai dan step by step untuk analisisnya. Data diambil dari :
Dari dataset tersebut akan dilakukan clustering berdasarkan angka harapan hidup dari 179 negara pada tahun 2015. Variabel yang dipakai yaitu Country, Infant_deaths, Under_five_deaths, Adult_mortality, Economy_status_develop, Economy_status_developing, dan Life_expetancy. Dari variabel yang dipakai tersebut akan di analisis setiap negara akan berada di klaster apa, lalu setiap klaster akan mendeskripsikan sebuah negara termasuk negara yang mempunyai kondisi Angka Harapan Hidup yang seperti apa.
Langkah-langkah untuk melakukan K-means clustering yaitu :
Menyiapkan data yang sesuai untuk dianalisis menggunakan algoritma k-means. Pastikan data berisi hanya variabel numerik atau berskala interval/rasio.
Melakukan cleaning data dan preprocessing data seperti normalisasi atau standarisasi agar variabel memiliki skala yang serupa.
Menentukan nilai k dengan membandingkan nilai koefisien sillhoute jika tiap metode menghasilkan k yang berbeda.
Menjalankan algoritma K-means, dengan memakai nilai k yang dihasilkan dari tiap-tiap metode, lalu pastikan memilih k dengan koefisien silhouette paling besar.
Interpretasikan hasil analisisnya, nilai koefisien sillhouette (euclidean) berada di antara -1 s.d. 1, semakin mendekati 1 maka semakin baik hasil klsterisasi.
Step 1
dalam r-markdown ini, untuk melakukan clustering kita harus menginstall beberapa paket lalu meloadnya.
readxl : untuk import data dari file xls dan xlxs
cluster : untuk melakukan perintah analisis clustering
factoextra : melakukan visualisasi hasil dari analisis data multivariat
dplyr : untuk melakukan manipulasi data
library(readxl)
library(cluster)
library(factoextra)
## Loading required package: ggplot2
## Welcome! Want to learn more? See two factoextra-related books at https://goo.gl/ve3WBa
library(dplyr)
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
Step 2
Melakukan import data, disini menggunakan data dari excel dengan nama “dataset.xlsx”. Gunakan function read_excel untuk mengimort data dari file excel.
dataset <- read_excel("D:/Cooleyah & Organization/UNS/SMT 4/SIM/RMarkdown Case Method/dataset.xlsx") #mengimpor data dari file excel
Step 3
Dalam step ini dilakukan filter data hanya untuk tahun 2015, lalu melakukan slicing data hanya untuk memilih kolom ke 2,5,6,7,20,21,22. Setelah dilakukan cleaning data akan dilakukan standarisasi data menggunakan function scale.
data <- subset(dataset, Year == 2015) #filter data pada kolom 'Year' pada tahun 2015
data <- data[,c(2,5,6,7,20,21,22)] #seleksi kolom untuk hanya memilih kolom ke-2,5,6,7,20,21,22
data <- data.frame(data) #membuat data yang sudah di manipulasi ke format data frame
rownames(data) <- data[,1] #membuat kolom "Country" menjadi nama baris
data <- data[,-1] #menghilangkan kolom "Country"
std <- scale(data) #melakukan standarisasi data
datastd <- data.frame(std) #membuat data terstandarisasi menjadi format data frame
head(data)
## Infant_deaths Under_five_deaths Adult_mortality
## Turkiye 11.1 13.0 105.8240
## Spain 2.7 3.3 57.9025
## Russian Federation 6.6 8.2 223.0000
## Cameroon 57.0 88.0 340.1265
## Gambia The 39.7 59.8 261.7065
## Algeria 21.6 25.2 95.8155
## Economy_status_Developed Economy_status_Developing
## Turkiye 0 1
## Spain 1 0
## Russian Federation 0 1
## Cameroon 0 1
## Gambia The 0 1
## Algeria 0 1
## Life_expectancy
## Turkiye 76.5
## Spain 82.8
## Russian Federation 71.2
## Cameroon 57.6
## Gambia The 60.9
## Algeria 76.1
head(datastd)
## Infant_deaths Under_five_deaths Adult_mortality
## Turkiye -0.57980940 -0.5798303 -0.6430316
## Spain -0.97073421 -0.8809126 -1.1757622
## Russian Federation -0.78923341 -0.7288195 0.6595829
## Cameroon 1.55631545 1.7481263 1.9616472
## Gambia The 0.75119650 0.8728147 1.0898728
## Algeria -0.09115339 -0.2011493 -0.7542935
## Economy_status_Developed Economy_status_Developing
## Turkiye -0.5090262 0.5090262
## Spain 1.9535602 -1.9535602
## Russian Federation -0.5090262 0.5090262
## Cameroon -0.5090262 0.5090262
## Gambia The -0.5090262 0.5090262
## Algeria -0.5090262 0.5090262
## Life_expectancy
## Turkiye 0.64302080
## Spain 1.44738527
## Russian Federation -0.03366676
## Cameroon -1.77007258
## Gambia The -1.34873881
## Algeria 0.59195005
Step 4
Dengan menggunakan function fviz_nbclust akan didapatkan k optimum untuk masing-masing metode. Algotima K-means memiliki 3 metode yaitu :
Silhouette
Gap Statistics (Elbow)
Within Cluster Sums of Squares (WSS)
Di dapatkan k untuk Silhouette adalah 3, Gap statistics adalah 5, dan WSS adalah 3
Step 5
Setelah didapatkan kemungkinan klaster optimum yaitu 3 dan 5, lalu dengan function kmeans dilakukan 2 kali running dengan centers = 3 dan centers = 5
result1 <- kmeans(datastd, centers = 3)
result2 <- kmeans(datastd, centers = 5)
result1
## K-means clustering with 3 clusters of sizes 37, 93, 49
##
## Cluster means:
## Infant_deaths Under_five_deaths Adult_mortality Economy_status_Developed
## 1 -0.9293525 -0.8495377 -0.9321917 1.9535602
## 2 -0.3596765 -0.3963305 -0.3148997 -0.5090262
## 3 1.3844073 1.3937068 1.3015667 -0.5090262
## Economy_status_Developing Life_expectancy
## 1 -1.9535602 1.1102492
## 2 0.5090262 0.2726205
## 3 0.5090262 -1.3557741
##
## Clustering vector:
## Turkiye Spain
## 2 1
## Russian Federation Cameroon
## 2 3
## Gambia The Algeria
## 3 2
## Oman Madagascar
## 2 3
## Norway Vietnam
## 1 2
## Eswatini Botswana
## 3 3
## Latvia Nepal
## 1 2
## Congo Dem. Rep. Belarus
## 3 2
## Angola Ukraine
## 3 2
## Costa Rica Israel
## 2 1
## New Zealand Chad
## 1 3
## Solomon Islands Iraq
## 2 2
## Guinea-Bissau Honduras
## 3 2
## Guinea Sweden
## 3 1
## Indonesia Dominican Republic
## 2 2
## Mexico Czechia
## 2 1
## Benin St. Vincent and the Grenadines
## 3 2
## Kiribati Brazil
## 3 2
## Guyana China
## 2 2
## Eritrea Kuwait
## 3 2
## Canada Grenada
## 1 2
## Albania Lesotho
## 2 3
## Saudi Arabia India
## 2 2
## Lithuania Turkmenistan
## 1 2
## Antigua and Barbuda Malawi
## 2 3
## Myanmar Seychelles
## 3 2
## Netherlands Burundi
## 1 3
## Mozambique Bulgaria
## 3 1
## Sierra Leone Syrian Arab Republic
## 3 2
## El Salvador Panama
## 2 2
## Jordan Tanzania
## 2 3
## Gabon Azerbaijan
## 3 2
## Nigeria Thailand
## 3 2
## Mongolia Portugal
## 2 1
## Maldives Comoros
## 2 3
## Trinidad and Tobago United Arab Emirates
## 2 2
## Uganda Sao Tome and Principe
## 3 2
## Pakistan Ghana
## 3 3
## Finland Afghanistan
## 1 3
## Belize Lebanon
## 2 2
## Slovenia Jamaica
## 1 2
## Belgium Equatorial Guinea
## 1 3
## Barbados Japan
## 2 1
## North Macedonia Morocco
## 2 2
## Slovak Republic Senegal
## 1 2
## St. Lucia Colombia
## 2 2
## Tajikistan Austria
## 2 1
## Moldova Congo Rep.
## 2 3
## Namibia Zimbabwe
## 3 3
## Malta Kenya
## 1 3
## Montenegro Greece
## 2 1
## Bolivia Ireland
## 2 1
## Croatia Kyrgyz Republic
## 1 2
## Switzerland Guatemala
## 1 2
## Qatar Mauritania
## 2 3
## Yemen Rep. Zambia
## 3 3
## Kazakhstan Serbia
## 2 2
## Paraguay Micronesia Fed. Sts.
## 2 2
## Bosnia and Herzegovina Brunei Darussalam
## 2 2
## United Kingdom Niger
## 1 3
## Armenia Bangladesh
## 2 2
## Somalia Uzbekistan
## 3 2
## Uruguay Chile
## 2 2
## Germany Liberia
## 1 3
## Denmark Rwanda
## 1 2
## Mauritius Papua New Guinea
## 2 3
## Singapore Iran Islamic Rep.
## 2 2
## Poland Italy
## 1 1
## Cuba Peru
## 2 2
## Vanuatu Cambodia
## 2 2
## Hungary Central African Republic
## 1 3
## Australia Egypt Arab Rep.
## 1 2
## Georgia Bahamas The
## 2 2
## Suriname Luxembourg
## 2 1
## Libya United States
## 2 1
## Argentina Bhutan
## 2 2
## Cyprus Cabo Verde
## 1 2
## Lao PDR Iceland
## 3 1
## Togo Estonia
## 3 1
## Philippines Romania
## 2 1
## Haiti Cote d'Ivoire
## 3 3
## Burkina Faso Ecuador
## 3 2
## Timor-Leste Fiji
## 2 2
## Mali Samoa
## 3 2
## Sri Lanka Ethiopia
## 2 3
## France Malaysia
## 1 2
## Tonga South Africa
## 2 3
## Tunisia Venezuela RB
## 2 2
## Djibouti Nicaragua
## 3 2
## Bahrain
## 2
##
## Within cluster sum of squares by cluster:
## [1] 9.090027 61.441648 100.105389
## (between_SS / total_SS = 84.0 %)
##
## Available components:
##
## [1] "cluster" "centers" "totss" "withinss" "tot.withinss"
## [6] "betweenss" "size" "iter" "ifault"
result2
## K-means clustering with 5 clusters of sizes 10, 87, 27, 20, 35
##
## Cluster means:
## Infant_deaths Under_five_deaths Adult_mortality Economy_status_Developed
## 1 -0.8692799 -0.8023829 -0.4186765 1.9535602
## 2 -0.4216314 -0.4507848 -0.3543244 -0.5090262
## 3 -0.9516017 -0.8670024 -1.1223826 1.9535602
## 4 2.0587004 2.1718144 1.7810647 -0.5090262
## 5 0.8541134 0.7775680 0.8484578 -0.5090262
## Economy_status_Developing Life_expectancy
## 1 -1.9535602 0.5996107
## 2 0.5090262 0.3214805
## 3 -1.9535602 1.2993746
## 4 0.5090262 -1.8671070
## 5 0.5090262 -0.9058824
##
## Clustering vector:
## Turkiye Spain
## 2 3
## Russian Federation Cameroon
## 2 4
## Gambia The Algeria
## 5 2
## Oman Madagascar
## 2 5
## Norway Vietnam
## 3 2
## Eswatini Botswana
## 4 5
## Latvia Nepal
## 1 2
## Congo Dem. Rep. Belarus
## 4 2
## Angola Ukraine
## 4 2
## Costa Rica Israel
## 2 3
## New Zealand Chad
## 3 4
## Solomon Islands Iraq
## 2 2
## Guinea-Bissau Honduras
## 4 2
## Guinea Sweden
## 4 3
## Indonesia Dominican Republic
## 2 2
## Mexico Czechia
## 2 3
## Benin St. Vincent and the Grenadines
## 4 2
## Kiribati Brazil
## 5 2
## Guyana China
## 5 2
## Eritrea Kuwait
## 5 2
## Canada Grenada
## 3 2
## Albania Lesotho
## 2 4
## Saudi Arabia India
## 2 5
## Lithuania Turkmenistan
## 1 5
## Antigua and Barbuda Malawi
## 2 5
## Myanmar Seychelles
## 5 2
## Netherlands Burundi
## 3 5
## Mozambique Bulgaria
## 4 1
## Sierra Leone Syrian Arab Republic
## 4 2
## El Salvador Panama
## 2 2
## Jordan Tanzania
## 2 5
## Gabon Azerbaijan
## 5 2
## Nigeria Thailand
## 4 2
## Mongolia Portugal
## 2 3
## Maldives Comoros
## 2 5
## Trinidad and Tobago United Arab Emirates
## 2 2
## Uganda Sao Tome and Principe
## 5 2
## Pakistan Ghana
## 5 5
## Finland Afghanistan
## 3 5
## Belize Lebanon
## 2 2
## Slovenia Jamaica
## 3 2
## Belgium Equatorial Guinea
## 3 4
## Barbados Japan
## 2 3
## North Macedonia Morocco
## 2 2
## Slovak Republic Senegal
## 1 5
## St. Lucia Colombia
## 2 2
## Tajikistan Austria
## 2 3
## Moldova Congo Rep.
## 2 5
## Namibia Zimbabwe
## 5 5
## Malta Kenya
## 3 5
## Montenegro Greece
## 2 3
## Bolivia Ireland
## 2 3
## Croatia Kyrgyz Republic
## 1 2
## Switzerland Guatemala
## 3 2
## Qatar Mauritania
## 2 5
## Yemen Rep. Zambia
## 5 5
## Kazakhstan Serbia
## 2 2
## Paraguay Micronesia Fed. Sts.
## 2 2
## Bosnia and Herzegovina Brunei Darussalam
## 2 2
## United Kingdom Niger
## 3 4
## Armenia Bangladesh
## 2 2
## Somalia Uzbekistan
## 4 2
## Uruguay Chile
## 2 2
## Germany Liberia
## 3 4
## Denmark Rwanda
## 3 5
## Mauritius Papua New Guinea
## 2 5
## Singapore Iran Islamic Rep.
## 2 2
## Poland Italy
## 1 3
## Cuba Peru
## 2 2
## Vanuatu Cambodia
## 2 2
## Hungary Central African Republic
## 1 4
## Australia Egypt Arab Rep.
## 3 2
## Georgia Bahamas The
## 2 2
## Suriname Luxembourg
## 2 3
## Libya United States
## 2 1
## Argentina Bhutan
## 2 2
## Cyprus Cabo Verde
## 3 2
## Lao PDR Iceland
## 5 3
## Togo Estonia
## 5 1
## Philippines Romania
## 2 1
## Haiti Cote d'Ivoire
## 5 4
## Burkina Faso Ecuador
## 4 2
## Timor-Leste Fiji
## 5 2
## Mali Samoa
## 4 2
## Sri Lanka Ethiopia
## 2 5
## France Malaysia
## 3 2
## Tonga South Africa
## 2 5
## Tunisia Venezuela RB
## 2 2
## Djibouti Nicaragua
## 5 2
## Bahrain
## 2
##
## Within cluster sum of squares by cluster:
## [1] 1.0530356 46.1447351 0.7701647 30.9648324 23.4425829
## (between_SS / total_SS = 90.4 %)
##
## Available components:
##
## [1] "cluster" "centers" "totss" "withinss" "tot.withinss"
## [6] "betweenss" "size" "iter" "ifault"
Step 6
Untuk mendapatkan hasil klaster optimum, harus membandingkan nilai euclid tiap-tiap nilai k. Menggunakan function fviz_silhouette dengan data ‘datastd’ maka akan diperoleh nilai koefisien untuk k=3 adalah 0.65 dan k-5 adalah 0.5, sehingga diperoleh nilai k optimum adalah 3.
value_result1 <- silhouette(result1$cluster, dist(datastd))
fviz_silhouette(value_result1)
## cluster size ave.sil.width
## 1 1 37 0.85
## 2 2 93 0.67
## 3 3 49 0.46
value_result2 <- silhouette(result2$cluster, dist(datastd))
fviz_silhouette(value_result2)
## cluster size ave.sil.width
## 1 1 10 0.54
## 2 2 87 0.61
## 3 3 27 0.78
## 4 4 20 0.33
## 5 5 35 0.49
Step 7
Dengan hasil klasterisasi yang disimpan ke dalam ‘result1’ dengan k=3, tambahkan hasil klaster ke dalam data sebelum di standarisasi menggunakan cbind. Setelah itu buat plot dengan fviz_cluster untuk memvisualisasikan hasil klasterisasi.
data_cluster <- cbind(data, cluster = result1$cluster)
data_cluster
## Infant_deaths Under_five_deaths Adult_mortality
## Turkiye 11.1 13.0 105.8240
## Spain 2.7 3.3 57.9025
## Russian Federation 6.6 8.2 223.0000
## Cameroon 57.0 88.0 340.1265
## Gambia The 39.7 59.8 261.7065
## Algeria 21.6 25.2 95.8155
## Oman 9.6 11.2 89.1875
## Madagascar 41.3 59.0 218.4575
## Norway 2.2 2.7 53.8970
## Vietnam 17.4 21.8 124.5470
## Eswatini 45.2 57.2 434.8210
## Botswana 39.6 49.7 248.5950
## Latvia 4.3 5.0 160.2370
## Nepal 29.2 35.5 154.6040
## Congo Dem. Rep. 73.0 95.9 260.0235
## Belarus 3.1 4.1 163.8720
## Angola 57.7 88.1 242.9655
## Ukraine 8.1 9.5 202.8000
## Costa Rica 7.7 9.1 85.6715
## Israel 3.2 3.9 56.3370
## New Zealand 4.6 5.5 67.0000
## Chad 75.7 128.7 363.1670
## Solomon Islands 19.3 22.7 148.0870
## Iraq 24.8 29.7 162.9175
## Guinea-Bissau 60.0 92.1 270.2885
## Honduras 16.5 19.2 146.5195
## Guinea 68.4 107.3 262.8710
## Sweden 2.3 2.9 53.4415
## Indonesia 23.2 27.7 175.5640
## Dominican Republic 28.8 35.0 161.6170
## Mexico 13.9 16.2 110.8405
## Czechia 2.5 3.2 82.5385
## Benin 63.2 97.6 242.9535
## St. Vincent and the Grenadines 15.2 16.7 154.8235
## Kiribati 44.7 57.6 198.4490
## Brazil 14.2 15.9 144.3625
## Guyana 27.4 33.1 209.3720
## China 8.3 10.7 82.2730
## Eritrea 34.0 46.3 255.7810
## Kuwait 7.5 8.8 58.1365
## Canada 4.7 5.4 65.9695
## Grenada 14.3 16.2 141.3000
## Albania 8.5 9.6 75.2050
## Lesotho 72.6 94.6 513.4755
## Saudi Arabia 7.8 9.1 87.9740
## India 34.9 43.5 177.9465
## Lithuania 4.0 4.9 165.2330
## Turkmenistan 36.4 42.2 189.9280
## Antigua and Barbuda 6.5 7.8 130.0025
## Malawi 36.7 53.3 294.8580
## Myanmar 41.0 52.2 196.0530
## Seychelles 12.7 14.8 151.6085
## Netherlands 3.4 4.0 58.0775
## Burundi 46.2 67.6 294.1870
## Mozambique 61.3 84.0 332.5830
## Bulgaria 6.9 8.2 136.0675
## Sierra Leone 95.1 140.2 397.8705
## Syrian Arab Republic 25.6 41.6 178.1065
## El Salvador 13.3 15.5 182.1595
## Panama 14.6 17.0 114.7010
## Jordan 15.0 17.5 109.9420
## Tanzania 40.9 58.1 239.6670
## Gabon 36.1 50.3 238.5125
## Azerbaijan 23.3 26.3 124.8970
## Nigeria 79.3 126.4 356.2145
## Thailand 9.3 10.8 148.8890
## Mongolia 16.6 19.4 212.8410
## Portugal 3.0 3.6 73.2830
## Maldives 8.4 9.8 68.3620
## Comoros 54.4 72.3 227.2735
## Trinidad and Tobago 17.3 19.3 168.7145
## United Arab Emirates 6.5 7.6 68.5765
## Uganda 38.9 55.7 311.7215
## Sao Tome and Principe 18.5 23.1 191.5860
## Pakistan 62.1 76.0 160.1410
## Ghana 38.8 54.6 244.6325
## Finland 2.0 2.5 68.8685
## Afghanistan 53.2 70.4 227.7350
## Belize 13.2 15.3 174.4925
## Lebanon 7.1 8.3 60.3175
## Slovenia 2.1 2.6 73.3560
## Jamaica 13.4 15.5 132.2455
## Belgium 3.3 4.1 71.7100
## Equatorial Guinea 67.6 93.2 328.8035
## Barbados 13.0 14.0 99.0935
## Japan 2.0 2.7 53.5860
## North Macedonia 10.0 11.4 97.1970
## Morocco 20.2 23.5 70.9135
## Slovak Republic 5.1 6.2 110.5150
## Senegal 34.7 49.4 190.0275
## St. Lucia 19.5 21.9 136.3935
## Colombia 13.4 15.5 140.0675
## Tajikistan 32.6 37.5 134.7150
## Austria 3.0 3.7 64.6875
## Moldova 13.6 15.8 171.2775
## Congo Rep. 37.6 52.6 262.2240
## Namibia 33.3 47.6 315.4730
## Zimbabwe 42.1 61.3 368.1410
## Malta 5.7 6.6 51.8135
## Kenya 35.2 48.9 223.6080
## Montenegro 3.3 4.0 100.8610
## Greece 3.8 4.4 70.7555
## Bolivia 25.4 31.3 184.5200
## Ireland 3.2 3.7 61.0195
## Croatia 4.2 4.9 90.2285
## Kyrgyz Republic 19.9 22.3 165.7935
## Switzerland 3.7 4.3 49.3840
## Guatemala 23.7 28.3 163.8125
## Qatar 6.7 7.8 56.2695
## Mauritania 55.0 82.7 203.5835
## Yemen Rep. 46.1 60.7 221.5195
## Zambia 46.9 67.7 302.8220
## Kazakhstan 10.6 11.9 182.1110
## Serbia 5.4 6.3 114.3280
## Paraguay 19.0 22.3 145.3510
## Micronesia Fed. Sts. 24.2 28.9 164.8710
## Bosnia and Herzegovina 5.4 6.3 96.0670
## Brunei Darussalam 8.8 10.6 88.0930
## United Kingdom 3.8 4.5 69.3335
## Niger 51.8 93.8 249.0860
## Armenia 12.6 14.1 125.4875
## Bangladesh 31.0 38.1 129.7530
## Somalia 83.6 134.3 308.3030
## Uzbekistan 17.3 19.3 140.8150
## Uruguay 7.6 8.9 105.6850
## Chile 6.7 7.9 88.8665
## Germany 3.3 3.9 71.3120
## Liberia 64.4 88.3 241.0125
## Denmark 3.5 4.1 67.1710
## Rwanda 34.6 47.7 203.5740
## Mauritius 12.8 14.5 144.3615
## Papua New Guinea 40.2 51.0 225.4075
## Singapore 2.2 2.7 50.9615
## Iran Islamic Rep. 13.3 15.5 81.8615
## Poland 4.2 4.9 113.6690
## Italy 3.0 3.5 52.9595
## Cuba 4.5 5.7 89.2395
## Peru 12.2 15.7 124.0700
## Vanuatu 23.6 28.2 131.2085
## Cambodia 27.3 31.8 175.6540
## Hungary 4.2 5.1 131.7765
## Central African Republic 87.9 123.3 411.0745
## Australia 3.3 3.9 61.7325
## Egypt Arab Rep. 19.8 23.3 149.5290
## Georgia 9.4 10.5 156.2355
## Bahamas The 12.0 14.0 158.5890
## Suriname 18.2 20.3 170.1290
## Luxembourg 2.3 2.8 54.3930
## Libya 11.4 13.3 134.7895
## United States 5.8 6.8 107.3770
## Argentina 10.2 11.5 117.1895
## Bhutan 27.4 33.1 208.0030
## Cyprus 2.4 2.9 51.6435
## Cabo Verde 16.7 19.5 125.1490
## Lao PDR 42.1 53.8 195.6250
## Iceland 1.8 2.3 55.3590
## Togo 50.8 75.9 271.7090
## Estonia 2.4 3.1 116.3510
## Philippines 23.4 29.6 199.7365
## Romania 7.7 9.2 128.6030
## Haiti 52.8 69.9 243.6235
## Cote d'Ivoire 65.7 90.8 396.4580
## Burkina Faso 59.6 100.9 257.9580
## Ecuador 13.2 15.4 123.9060
## Timor-Leste 42.8 50.2 149.7365
## Fiji 20.6 24.4 185.8580
## Mali 66.8 108.3 267.5905
## Samoa 15.9 18.6 111.9490
## Sri Lanka 7.5 8.7 109.1795
## Ethiopia 43.2 62.4 224.4230
## France 3.2 4.2 75.4835
## Malaysia 6.9 8.1 121.7230
## Tonga 10.7 12.4 134.0135
## South Africa 28.6 36.3 351.3925
## Tunisia 14.8 17.2 91.3430
## Venezuela RB 16.4 19.1 141.5540
## Djibouti 54.6 65.8 251.7490
## Nicaragua 16.6 19.4 148.4805
## Bahrain 6.5 7.6 66.6840
## Economy_status_Developed
## Turkiye 0
## Spain 1
## Russian Federation 0
## Cameroon 0
## Gambia The 0
## Algeria 0
## Oman 0
## Madagascar 0
## Norway 1
## Vietnam 0
## Eswatini 0
## Botswana 0
## Latvia 1
## Nepal 0
## Congo Dem. Rep. 0
## Belarus 0
## Angola 0
## Ukraine 0
## Costa Rica 0
## Israel 1
## New Zealand 1
## Chad 0
## Solomon Islands 0
## Iraq 0
## Guinea-Bissau 0
## Honduras 0
## Guinea 0
## Sweden 1
## Indonesia 0
## Dominican Republic 0
## Mexico 0
## Czechia 1
## Benin 0
## St. Vincent and the Grenadines 0
## Kiribati 0
## Brazil 0
## Guyana 0
## China 0
## Eritrea 0
## Kuwait 0
## Canada 1
## Grenada 0
## Albania 0
## Lesotho 0
## Saudi Arabia 0
## India 0
## Lithuania 1
## Turkmenistan 0
## Antigua and Barbuda 0
## Malawi 0
## Myanmar 0
## Seychelles 0
## Netherlands 1
## Burundi 0
## Mozambique 0
## Bulgaria 1
## Sierra Leone 0
## Syrian Arab Republic 0
## El Salvador 0
## Panama 0
## Jordan 0
## Tanzania 0
## Gabon 0
## Azerbaijan 0
## Nigeria 0
## Thailand 0
## Mongolia 0
## Portugal 1
## Maldives 0
## Comoros 0
## Trinidad and Tobago 0
## United Arab Emirates 0
## Uganda 0
## Sao Tome and Principe 0
## Pakistan 0
## Ghana 0
## Finland 1
## Afghanistan 0
## Belize 0
## Lebanon 0
## Slovenia 1
## Jamaica 0
## Belgium 1
## Equatorial Guinea 0
## Barbados 0
## Japan 1
## North Macedonia 0
## Morocco 0
## Slovak Republic 1
## Senegal 0
## St. Lucia 0
## Colombia 0
## Tajikistan 0
## Austria 1
## Moldova 0
## Congo Rep. 0
## Namibia 0
## Zimbabwe 0
## Malta 1
## Kenya 0
## Montenegro 0
## Greece 1
## Bolivia 0
## Ireland 1
## Croatia 1
## Kyrgyz Republic 0
## Switzerland 1
## Guatemala 0
## Qatar 0
## Mauritania 0
## Yemen Rep. 0
## Zambia 0
## Kazakhstan 0
## Serbia 0
## Paraguay 0
## Micronesia Fed. Sts. 0
## Bosnia and Herzegovina 0
## Brunei Darussalam 0
## United Kingdom 1
## Niger 0
## Armenia 0
## Bangladesh 0
## Somalia 0
## Uzbekistan 0
## Uruguay 0
## Chile 0
## Germany 1
## Liberia 0
## Denmark 1
## Rwanda 0
## Mauritius 0
## Papua New Guinea 0
## Singapore 0
## Iran Islamic Rep. 0
## Poland 1
## Italy 1
## Cuba 0
## Peru 0
## Vanuatu 0
## Cambodia 0
## Hungary 1
## Central African Republic 0
## Australia 1
## Egypt Arab Rep. 0
## Georgia 0
## Bahamas The 0
## Suriname 0
## Luxembourg 1
## Libya 0
## United States 1
## Argentina 0
## Bhutan 0
## Cyprus 1
## Cabo Verde 0
## Lao PDR 0
## Iceland 1
## Togo 0
## Estonia 1
## Philippines 0
## Romania 1
## Haiti 0
## Cote d'Ivoire 0
## Burkina Faso 0
## Ecuador 0
## Timor-Leste 0
## Fiji 0
## Mali 0
## Samoa 0
## Sri Lanka 0
## Ethiopia 0
## France 1
## Malaysia 0
## Tonga 0
## South Africa 0
## Tunisia 0
## Venezuela RB 0
## Djibouti 0
## Nicaragua 0
## Bahrain 0
## Economy_status_Developing Life_expectancy
## Turkiye 1 76.5
## Spain 0 82.8
## Russian Federation 1 71.2
## Cameroon 1 57.6
## Gambia The 1 60.9
## Algeria 1 76.1
## Oman 1 76.9
## Madagascar 1 65.5
## Norway 0 82.3
## Vietnam 1 75.1
## Eswatini 1 55.4
## Botswana 1 67.3
## Latvia 0 74.5
## Nepal 1 69.5
## Congo Dem. Rep. 1 59.3
## Belarus 1 73.6
## Angola 1 59.4
## Ukraine 1 71.2
## Costa Rica 1 79.6
## Israel 0 82.1
## New Zealand 0 81.5
## Chad 1 53.1
## Solomon Islands 1 72.2
## Iraq 1 69.9
## Guinea-Bissau 1 57.0
## Honduras 1 74.5
## Guinea 1 59.6
## Sweden 0 82.2
## Indonesia 1 70.8
## Dominican Republic 1 73.2
## Mexico 1 74.9
## Czechia 0 78.6
## Benin 1 60.6
## St. Vincent and the Grenadines 1 72.1
## Kiribati 1 67.3
## Brazil 1 75.0
## Guyana 1 69.3
## China 1 75.9
## Eritrea 1 64.7
## Kuwait 1 75.1
## Canada 0 81.9
## Grenada 1 72.4
## Albania 1 78.0
## Lesotho 1 51.0
## Saudi Arabia 1 74.7
## India 1 68.6
## Lithuania 0 74.3
## Turkmenistan 1 67.7
## Antigua and Barbuda 1 76.5
## Malawi 1 62.0
## Myanmar 1 65.8
## Seychelles 1 74.3
## Netherlands 0 81.5
## Burundi 1 60.1
## Mozambique 1 57.2
## Bulgaria 0 74.6
## Sierra Leone 1 52.9
## Syrian Arab Republic 1 69.9
## El Salvador 1 72.4
## Panama 1 77.8
## Jordan 1 74.1
## Tanzania 1 63.1
## Gabon 1 64.9
## Azerbaijan 1 72.3
## Nigeria 1 53.1
## Thailand 1 76.1
## Mongolia 1 69.1
## Portugal 0 81.1
## Maldives 1 77.7
## Comoros 1 63.5
## Trinidad and Tobago 1 72.9
## United Arab Emirates 1 77.3
## Uganda 1 61.4
## Sao Tome and Principe 1 69.4
## Pakistan 1 66.6
## Ghana 1 62.8
## Finland 0 81.5
## Afghanistan 1 63.4
## Belize 1 74.0
## Lebanon 1 78.8
## Slovenia 0 80.8
## Jamaica 1 74.1
## Belgium 0 81.0
## Equatorial Guinea 1 57.4
## Barbados 1 78.8
## Japan 0 83.8
## North Macedonia 1 75.4
## Morocco 1 75.7
## Slovak Republic 0 76.6
## Senegal 1 66.7
## St. Lucia 1 75.6
## Colombia 1 76.5
## Tajikistan 1 70.1
## Austria 0 81.2
## Moldova 1 71.5
## Congo Rep. 1 63.1
## Namibia 1 62.1
## Zimbabwe 1 59.5
## Malta 0 81.9
## Kenya 1 64.8
## Montenegro 1 76.4
## Greece 0 81.0
## Bolivia 1 70.3
## Ireland 0 81.5
## Croatia 0 77.3
## Kyrgyz Republic 1 70.7
## Switzerland 0 82.9
## Guatemala 1 73.3
## Qatar 1 79.8
## Mauritania 1 63.9
## Yemen Rep. 1 66.1
## Zambia 1 61.7
## Kazakhstan 1 72.0
## Serbia 1 75.3
## Paraguay 1 73.7
## Micronesia Fed. Sts. 1 67.3
## Bosnia and Herzegovina 1 76.9
## Brunei Darussalam 1 75.3
## United Kingdom 0 81.0
## Niger 1 60.6
## Armenia 1 74.5
## Bangladesh 1 71.5
## Somalia 1 55.9
## Uzbekistan 1 70.9
## Uruguay 1 77.4
## Chile 1 79.6
## Germany 0 80.6
## Liberia 1 62.3
## Denmark 0 80.7
## Rwanda 1 67.5
## Mauritius 1 74.4
## Papua New Guinea 1 63.5
## Singapore 1 82.7
## Iran Islamic Rep. 1 75.8
## Poland 0 77.5
## Italy 0 82.5
## Cuba 1 78.6
## Peru 1 75.8
## Vanuatu 1 69.9
## Cambodia 1 68.6
## Hungary 0 75.6
## Central African Republic 1 50.9
## Australia 0 82.4
## Egypt Arab Rep. 1 71.3
## Georgia 1 73.0
## Bahamas The 1 73.1
## Suriname 1 71.2
## Luxembourg 0 82.3
## Libya 1 72.1
## United States 0 78.7
## Argentina 1 76.1
## Bhutan 1 70.4
## Cyprus 0 80.4
## Cabo Verde 1 72.1
## Lao PDR 1 66.5
## Iceland 0 82.5
## Togo 1 59.9
## Estonia 0 77.6
## Philippines 1 70.6
## Romania 0 74.9
## Haiti 1 62.5
## Cote d'Ivoire 1 56.1
## Burkina Faso 1 59.9
## Ecuador 1 76.1
## Timor-Leste 1 68.5
## Fiji 1 67.1
## Mali 1 57.5
## Samoa 1 72.7
## Sri Lanka 1 76.3
## Ethiopia 1 65.0
## France 0 82.3
## Malaysia 1 75.5
## Tonga 1 70.5
## South Africa 1 62.6
## Tunisia 1 75.9
## Venezuela RB 1 72.6
## Djibouti 1 64.1
## Nicaragua 1 73.6
## Bahrain 1 76.8
## cluster
## Turkiye 2
## Spain 1
## Russian Federation 2
## Cameroon 3
## Gambia The 3
## Algeria 2
## Oman 2
## Madagascar 3
## Norway 1
## Vietnam 2
## Eswatini 3
## Botswana 3
## Latvia 1
## Nepal 2
## Congo Dem. Rep. 3
## Belarus 2
## Angola 3
## Ukraine 2
## Costa Rica 2
## Israel 1
## New Zealand 1
## Chad 3
## Solomon Islands 2
## Iraq 2
## Guinea-Bissau 3
## Honduras 2
## Guinea 3
## Sweden 1
## Indonesia 2
## Dominican Republic 2
## Mexico 2
## Czechia 1
## Benin 3
## St. Vincent and the Grenadines 2
## Kiribati 3
## Brazil 2
## Guyana 2
## China 2
## Eritrea 3
## Kuwait 2
## Canada 1
## Grenada 2
## Albania 2
## Lesotho 3
## Saudi Arabia 2
## India 2
## Lithuania 1
## Turkmenistan 2
## Antigua and Barbuda 2
## Malawi 3
## Myanmar 3
## Seychelles 2
## Netherlands 1
## Burundi 3
## Mozambique 3
## Bulgaria 1
## Sierra Leone 3
## Syrian Arab Republic 2
## El Salvador 2
## Panama 2
## Jordan 2
## Tanzania 3
## Gabon 3
## Azerbaijan 2
## Nigeria 3
## Thailand 2
## Mongolia 2
## Portugal 1
## Maldives 2
## Comoros 3
## Trinidad and Tobago 2
## United Arab Emirates 2
## Uganda 3
## Sao Tome and Principe 2
## Pakistan 3
## Ghana 3
## Finland 1
## Afghanistan 3
## Belize 2
## Lebanon 2
## Slovenia 1
## Jamaica 2
## Belgium 1
## Equatorial Guinea 3
## Barbados 2
## Japan 1
## North Macedonia 2
## Morocco 2
## Slovak Republic 1
## Senegal 2
## St. Lucia 2
## Colombia 2
## Tajikistan 2
## Austria 1
## Moldova 2
## Congo Rep. 3
## Namibia 3
## Zimbabwe 3
## Malta 1
## Kenya 3
## Montenegro 2
## Greece 1
## Bolivia 2
## Ireland 1
## Croatia 1
## Kyrgyz Republic 2
## Switzerland 1
## Guatemala 2
## Qatar 2
## Mauritania 3
## Yemen Rep. 3
## Zambia 3
## Kazakhstan 2
## Serbia 2
## Paraguay 2
## Micronesia Fed. Sts. 2
## Bosnia and Herzegovina 2
## Brunei Darussalam 2
## United Kingdom 1
## Niger 3
## Armenia 2
## Bangladesh 2
## Somalia 3
## Uzbekistan 2
## Uruguay 2
## Chile 2
## Germany 1
## Liberia 3
## Denmark 1
## Rwanda 2
## Mauritius 2
## Papua New Guinea 3
## Singapore 2
## Iran Islamic Rep. 2
## Poland 1
## Italy 1
## Cuba 2
## Peru 2
## Vanuatu 2
## Cambodia 2
## Hungary 1
## Central African Republic 3
## Australia 1
## Egypt Arab Rep. 2
## Georgia 2
## Bahamas The 2
## Suriname 2
## Luxembourg 1
## Libya 2
## United States 1
## Argentina 2
## Bhutan 2
## Cyprus 1
## Cabo Verde 2
## Lao PDR 3
## Iceland 1
## Togo 3
## Estonia 1
## Philippines 2
## Romania 1
## Haiti 3
## Cote d'Ivoire 3
## Burkina Faso 3
## Ecuador 2
## Timor-Leste 2
## Fiji 2
## Mali 3
## Samoa 2
## Sri Lanka 2
## Ethiopia 3
## France 1
## Malaysia 2
## Tonga 2
## South Africa 3
## Tunisia 2
## Venezuela RB 2
## Djibouti 3
## Nicaragua 2
## Bahrain 2
fviz_cluster(result1, data = data_cluster,
main = 'Cluster Plot',
xlab = 'x',
ylab = 'y',
geom = "point",
ggtheme = theme_bw())
Step 8
Dengan menggunakan package dplyr gunakan pipe operator %>% untuk melakukan manipulasi data agar menghasilkan statistika deskriptif yang diinginkan. Gunakan function group_by dan summarize_all untuk menghasilkan rata-rata, nilai minimum, dan nilai maximum.
summary <- data_cluster %>%
group_by(cluster) %>%
summarize_all(list(mean = mean, min = min, max = max))
summary
## # A tibble: 3 x 19
## cluster Infant_deaths_mean Under_five_deaths_mean Adult_mortality_mean
## <int> <dbl> <dbl> <dbl>
## 1 1 3.59 4.31 79.8
## 2 2 15.8 18.9 135.
## 3 3 53.3 76.6 281.
## # i 15 more variables: Economy_status_Developed_mean <dbl>,
## # Economy_status_Developing_mean <dbl>, Life_expectancy_mean <dbl>,
## # Infant_deaths_min <dbl>, Under_five_deaths_min <dbl>,
## # Adult_mortality_min <dbl>, Economy_status_Developed_min <dbl>,
## # Economy_status_Developing_min <dbl>, Life_expectancy_min <dbl>,
## # Infant_deaths_max <dbl>, Under_five_deaths_max <dbl>,
## # Adult_mortality_max <dbl>, Economy_status_Developed_max <dbl>, ...
Step 9
Dari hasil analisis di atas, diperoleh bahwa terdapat 3 klaster, dimana klaster 1 berisi negara-negara dengan angka harapan hidup yang tinggi mempunyai rata-rata life_expetancy sebesar 80.15946, klster 2 berisi negara-negara dengan angka harapan hidup yang sedang mempunyai rata-rata life_expetancy sebesar 73.59892, dan klaster 3 berisi negara-negara dengan agka harapan hidup yang rendah mempunyai rata-rata life_expetancy sebesar 60.84490. Pengklasterisasian ini dapat dilihat dari rata-rata life_expetancy pada tabel di atas. Tinggi redahnya life_expetancy dipengaruhi oleh nilai kematian bayi, kematian balita, tingkat mortalitas orang dewasa, dan status sebuah negara apakah termasuk negara berkembang atau negara maju sehingga semua variabel ini menjadi satu kesatuan yang tak terpisahkan untuk menghasilkan nilai angka harapan hidup setiap negara.