Proyecto para Análisis Estadístico. Se utilizó la base de datos de deslizamientos a nivel mundial brindada por el profesor, mediante la cual ejecutamos las gráficas en RStudio mostradas a continuación

Primeras 6 filas de la tabla de datos importada

id date time continent_code country_name country_code state.province population city.town distance location_description latitude longitude geolocation hazard_type landslide_type landslide_size trigger storm_name injuries fatalities source_name source_link
34 3/2/07 Night NA United States US Virginia 16000 Cherry Hill 3.40765 Unknown 38.6009 -77.2682 (38.600900000000003, -77.268199999999993) Landslide Landslide Small Rain NA NA NBC 4 news http://www.nbc4.com/news/11186871/detail.html
42 3/22/07 NA United States US Ohio 17288 New Philadelphia 3.33522 40.5175 -81.4305 (40.517499999999998, -81.430499999999995) Landslide Landslide Small Rain NA NA Canton Rep.com http://www.cantonrep.com/index.php?ID=345054&Category=9&subCategoryID=0
56 4/6/07 NA United States US Pennsylvania 15930 Wilkinsburg 2.91977 Urban area 40.4377 -79.9160 (40.4377, -79.915999999999997) Landslide Landslide Small Rain NA NA The Pittsburgh Channel.com https://web.archive.org/web/20080423132842/http://www.thepittsburghchannel.com/news/11846833/detail.html
59 4/14/07 NA Canada CA Quebec 42786 Châteauguay 2.98682 Above river 45.3226 -73.7771 (45.322600000000001, -73.777100000000004) Landslide Riverbank collapse Small Rain NA NA Le Soleil http://www.hebdos.net/lsc/edition162007/articles.asp?article_id=166976
61 4/15/07 NA United States US Kentucky 6903 Pikeville 5.66542 Below road 37.4325 -82.4931 (37.432499999999997, -82.493099999999998) Landslide Landslide Small Downpour NA 0 Matthew Crawford (KGS)
64 4/20/07 NA United States US Kentucky 6903 Pikeville 0.23715 37.4814 -82.5186 (37.481400000000001, -82.518600000000006) Landslide Landslide Small Rain NA NA Applalachain news-express http://www.news-expressky.com/articles/2007/04/19/top_story/01mudslide.txt

Primero se efectupo una correción en la base de datos el valor nulo NA reemplazandolo por NA.

Luego fue necesario corregir valores repetidos para tipo de deslizamientos ya que había variaciones de caracteres en cuanto a mayúsculas y minúsculas, lo cual afectaba la homogeneidad de los gráficos realizados.

id date time continent_code country_name country_code state.province population city.town distance location_description latitude longitude geolocation hazard_type landslide_type landslide_size trigger storm_name injuries fatalities source_name source_link
34 3/2/07 Night NA United States US Virginia 16000 Cherry Hill 3.40765 Unknown 38.6009 -77.2682 (38.600900000000003, -77.268199999999993) Landslide Landslide Small Rain NA NA NBC 4 news http://www.nbc4.com/news/11186871/detail.html
42 3/22/07 NA United States US Ohio 17288 New Philadelphia 3.33522 40.5175 -81.4305 (40.517499999999998, -81.430499999999995) Landslide Landslide Small Rain NA NA Canton Rep.com http://www.cantonrep.com/index.php?ID=345054&Category=9&subCategoryID=0
56 4/6/07 NA United States US Pennsylvania 15930 Wilkinsburg 2.91977 Urban area 40.4377 -79.9160 (40.4377, -79.915999999999997) Landslide Landslide Small Rain NA NA The Pittsburgh Channel.com https://web.archive.org/web/20080423132842/http://www.thepittsburghchannel.com/news/11846833/detail.html
59 4/14/07 NA Canada CA Quebec 42786 Châteauguay 2.98682 Above river 45.3226 -73.7771 (45.322600000000001, -73.777100000000004) Landslide Riverbank collapse Small Rain NA NA Le Soleil http://www.hebdos.net/lsc/edition162007/articles.asp?article_id=166976
61 4/15/07 NA United States US Kentucky 6903 Pikeville 5.66542 Below road 37.4325 -82.4931 (37.432499999999997, -82.493099999999998) Landslide Landslide Small Downpour NA 0 Matthew Crawford (KGS)
64 4/20/07 NA United States US Kentucky 6903 Pikeville 0.23715 37.4814 -82.5186 (37.481400000000001, -82.518600000000006) Landslide Landslide Small Rain NA NA Applalachain news-express http://www.news-expressky.com/articles/2007/04/19/top_story/01mudslide.txt

Gráfico apilado para población de paises de Norte América y Sur América

En este gráfico se puede observar que los valores para la población perteneciente a Norte América es mayor que la perteneciente a Sur América

Histograma tipos de deslizamientos vs Cantidad para toda la base de datos

En este gráfico se puede observar que la mayoría de deslizamientos ocurridos son del tipo deslizamiento de lodo y deslizamiento de arena

Histograma con tipos de deslizamientos por país para Norte América

Primeras 6 filas de la base filtrada por Norte América

id date time continent_code country_name country_code state.province population city.town distance location_description latitude longitude geolocation hazard_type landslide_type landslide_size trigger storm_name injuries fatalities source_name source_link
34 3/2/07 Night NA United States US Virginia 16000 Cherry Hill 3.40765 Unknown 38.6009 -77.2682 (38.600900000000003, -77.268199999999993) Landslide Landslide Small Rain NA NA NBC 4 news http://www.nbc4.com/news/11186871/detail.html
42 3/22/07 NA United States US Ohio 17288 New Philadelphia 3.33522 40.5175 -81.4305 (40.517499999999998, -81.430499999999995) Landslide Landslide Small Rain NA NA Canton Rep.com http://www.cantonrep.com/index.php?ID=345054&Category=9&subCategoryID=0
56 4/6/07 NA United States US Pennsylvania 15930 Wilkinsburg 2.91977 Urban area 40.4377 -79.9160 (40.4377, -79.915999999999997) Landslide Landslide Small Rain NA NA The Pittsburgh Channel.com https://web.archive.org/web/20080423132842/http://www.thepittsburghchannel.com/news/11846833/detail.html
59 4/14/07 NA Canada CA Quebec 42786 Châteauguay 2.98682 Above river 45.3226 -73.7771 (45.322600000000001, -73.777100000000004) Landslide Riverbank collapse Small Rain NA NA Le Soleil http://www.hebdos.net/lsc/edition162007/articles.asp?article_id=166976
61 4/15/07 NA United States US Kentucky 6903 Pikeville 5.66542 Below road 37.4325 -82.4931 (37.432499999999997, -82.493099999999998) Landslide Landslide Small Downpour NA 0 Matthew Crawford (KGS)
64 4/20/07 NA United States US Kentucky 6903 Pikeville 0.23715 37.4814 -82.5186 (37.481400000000001, -82.518600000000006) Landslide Landslide Small Rain NA NA Applalachain news-express http://www.news-expressky.com/articles/2007/04/19/top_story/01mudslide.txt
## Warning: Removed 1 rows containing missing values (geom_bar).

Este Gráfico muestra los tipos de deslizamientos ocurridos en cada país y en el eje y observamos la distancia de dichos eventos

Histograma con tipos de deslizamientos por país para Norte América

SAMERICA <- subset( data, continent_code == "SA")
knitr::kable(head(SAMERICA))
id date time continent_code country_name country_code state.province population city.town distance location_description latitude longitude geolocation hazard_type landslide_type landslide_size trigger storm_name injuries fatalities source_name source_link
8 77 5/21/07 SA Colombia CO Risaralda 440118 Pereira 0.62022 4.8081 -75.6941 (4.8080999999999996, -75.694100000000006) Landslide Mudslide Large Rain NA 13 Reuters - AlertNet.org http://www.reuters.com/news/video/videoStory?videoId=53594&feedType=RSS&rpc=23
9 105 6/27/07 SA Ecuador EC Zamora-Chinchipe 15276 Zamora 0.47714 -4.0650 -78.9510 (-4.0650000000000004, -78.950999999999993) Landslide Landslide Medium Downpour NA NA Red Cross - Field reports https://www-secure.ifrc.org/dmis/prepare/view_report.asp?ReportID=2908
10 106 6/27/07 SA Ecuador EC Loja 117796 Loja 0.35649 -3.9900 -79.2050 (-3.99, -79.204999999999998) Landslide Landslide Medium Downpour NA NA Red Cross - Field reports https://www-secure.ifrc.org/dmis/prepare/view_report.asp?ReportID=2908
11 107 6/27/07 SA Ecuador EC Pichincha 5114 Sangolquí 33.94603 -0.3560 -78.1480 (-0.35599999999999998, -78.147999999999996) Landslide Landslide Medium Downpour NA NA Red Cross - Field reports https://www-secure.ifrc.org/dmis/prepare/view_report.asp?ReportID=2908
49 307 10/13/07 SA Colombia CO Cauca 9985 Suárez 8.46579 2.9437 -76.7719 (2.9437000000000002, -76.771900000000002) Landslide Mudslide Large Continuous rain NA 24 Reuters - AlertNet.org http://www.reuters.com/article/newsOne/idUSN1329387220071013
70 397 12/19/07 SA Colombia CO Tolima 4892 Ambalema 6.96130 4.8470 -74.7631 (4.8470000000000004, -74.763099999999994) Landslide Landslide Large Rain NA NA Indiamuslims.info http://www.indiamuslims.info/news/2007/dec/20/eight_people_rescued_colombian_landslide.html
ggplot(SAMERICA, aes(fill=landslide_type, y=distance, x=country_code)) + geom_bar(position="dodge", stat="identity")

knitr::kable(head(SAMERICA))
id date time continent_code country_name country_code state.province population city.town distance location_description latitude longitude geolocation hazard_type landslide_type landslide_size trigger storm_name injuries fatalities source_name source_link
8 77 5/21/07 SA Colombia CO Risaralda 440118 Pereira 0.62022 4.8081 -75.6941 (4.8080999999999996, -75.694100000000006) Landslide Mudslide Large Rain NA 13 Reuters - AlertNet.org http://www.reuters.com/news/video/videoStory?videoId=53594&feedType=RSS&rpc=23
9 105 6/27/07 SA Ecuador EC Zamora-Chinchipe 15276 Zamora 0.47714 -4.0650 -78.9510 (-4.0650000000000004, -78.950999999999993) Landslide Landslide Medium Downpour NA NA Red Cross - Field reports https://www-secure.ifrc.org/dmis/prepare/view_report.asp?ReportID=2908
10 106 6/27/07 SA Ecuador EC Loja 117796 Loja 0.35649 -3.9900 -79.2050 (-3.99, -79.204999999999998) Landslide Landslide Medium Downpour NA NA Red Cross - Field reports https://www-secure.ifrc.org/dmis/prepare/view_report.asp?ReportID=2908
11 107 6/27/07 SA Ecuador EC Pichincha 5114 Sangolquí 33.94603 -0.3560 -78.1480 (-0.35599999999999998, -78.147999999999996) Landslide Landslide Medium Downpour NA NA Red Cross - Field reports https://www-secure.ifrc.org/dmis/prepare/view_report.asp?ReportID=2908
49 307 10/13/07 SA Colombia CO Cauca 9985 Suárez 8.46579 2.9437 -76.7719 (2.9437000000000002, -76.771900000000002) Landslide Mudslide Large Continuous rain NA 24 Reuters - AlertNet.org http://www.reuters.com/article/newsOne/idUSN1329387220071013
70 397 12/19/07 SA Colombia CO Tolima 4892 Ambalema 6.96130 4.8470 -74.7631 (4.8470000000000004, -74.763099999999994) Landslide Landslide Large Rain NA NA Indiamuslims.info http://www.indiamuslims.info/news/2007/dec/20/eight_people_rescued_colombian_landslide.html

Gráfico circular deslizamientos América del Norte vs América del sur

En este gráfico podemos comparar la proporción de eventos registrados de deslizamientos que contiene la base de datos para Norte América y Sur América

Diagrama de pareto - tipo de deslizamientos vs cantidad de eventos presentados

Este gráfico permite observar como la mayoría de eventos registrados corresponden a deslizamientos de arena y deslizamientos de lodo. Posterior se encuentran caídas de rocas, las cuales disminuyen notablemente en su aparición en el registro frente a las 2 primeras mencionadas

Diagrama de tallo y hojas para muertes por deslizamientos en Colombia

6 primeras filas de la base de datos filtrada para Colombia

id date time continent_code country_name country_code state.province population city.town distance location_description latitude longitude geolocation hazard_type landslide_type landslide_size trigger storm_name injuries fatalities source_name source_link
8 77 5/21/07 SA Colombia CO Risaralda 440118 Pereira 0.62022 4.8081 -75.6941 (4.8080999999999996, -75.694100000000006) Landslide Mudslide Large Rain NA 13 Reuters - AlertNet.org http://www.reuters.com/news/video/videoStory?videoId=53594&feedType=RSS&rpc=23
49 307 10/13/07 SA Colombia CO Cauca 9985 Suárez 8.46579 2.9437 -76.7719 (2.9437000000000002, -76.771900000000002) Landslide Mudslide Large Continuous rain NA 24 Reuters - AlertNet.org http://www.reuters.com/article/newsOne/idUSN1329387220071013
70 397 12/19/07 SA Colombia CO Tolima 4892 Ambalema 6.96130 4.8470 -74.7631 (4.8470000000000004, -74.763099999999994) Landslide Landslide Large Rain NA NA Indiamuslims.info http://www.indiamuslims.info/news/2007/dec/20/eight_people_rescued_colombian_landslide.html
103 562 5/31/08 SA Colombia CO Antioquia 1999979 Medellín 5.12170 6.2746 -75.6039 (6.2746000000000004, -75.603899999999996) Landslide Complex Large Downpour NA 27 http://english.people.com.cn/90001/90777/90852/6422291.html
110 605 6/24/08 SA Colombia CO Norte de Santander 1502 Hacarí 0.38844 8.3200 -73.1500 (8.32, -73.150000000000006) Landslide Landslide Medium Downpour NA 10 http://news.xinhuanet.com/english/2008-06/25/content_8434589.htm
117 644 7/14/08 SA Colombia CO Cundinamarca 1374 Quetame 8.58891 4.4100 -73.8600 (4.41, -73.86) Landslide Landslide Medium Downpour NA 4 http://news.xinhuanet.com/english/2008-07/15/content_8548107.htm
## 
##   The decimal point is 1 digit(s) to the right of the |
## 
##   0 | 00000000000000000000000000001111112222222222223333344444444455555555
##   1 | 000133
##   2 | 047
##   3 | 
##   4 | 8
##   5 | 
##   6 | 
##   7 | 
##   8 | 
##   9 | 12
weight Time Chick Diet
42 0 1 1
51 2 1 1
59 4 1 1
64 6 1 1
76 8 1 1
93 10 1 1
## 
##   The decimal point is 1 digit(s) to the right of the |
## 
##    2 | 599999999
##    4 | 00000111111111111111111112222222222222223333456678888888899999999999+38
##    6 | 00111111122222222333334444455555666677777888888900111111222222333334+8
##    8 | 00112223344444455555566777788999990001223333566666788888889
##   10 | 0000111122233333334566667778889901122223445555667789
##   12 | 00002223333344445555667788890113444555566788889
##   14 | 11123444455556666677788890011234444555666777777789
##   16 | 00002233334444466788990000134445555789
##   18 | 12244444555677782225677778889999
##   20 | 0123444555557900245578
##   22 | 0012357701123344556788
##   24 | 08001699
##   26 | 12344569259
##   28 | 01780145
##   30 | 355798
##   32 | 12712
##   34 | 1
##   36 | 13
## 
##   The decimal point is 1 digit(s) to the right of the |
## 
##    3 | 599999999
##    4 | 00000111111111111111111112222222222222223333456678888888899999999999
##    5 | 000000111111112222333334445555566667778888899999
##    6 | 001111111222222223333344444555556666777778888889
##    7 | 0011111122222233333444444446667778889999
##    8 | 0011222334444445555556677778899999
##    9 | 0001223333566666788888889
##   10 | 00001111222333333345666677788899
##   11 | 01122223445555667789
##   12 | 0000222333334444555566778889
##   13 | 0113444555566788889
##   14 | 1112344445555666667778889
##   15 | 0011234444555666777777789
##   16 | 0000223333444446678899
##   17 | 0000134445555789
##   18 | 1224444455567778
##   19 | 2225677778889999
##   20 | 01234445555579
##   21 | 00245578
##   22 | 00123577
##   23 | 01123344556788
##   24 | 08
##   25 | 001699
##   26 | 12344569
##   27 | 259
##   28 | 0178
##   29 | 0145
##   30 | 35579
##   31 | 8
##   32 | 127
##   33 | 12
##   34 | 1
##   35 | 
##   36 | 1
##   37 | 3

Este diagrama permite observar como se distribuyen la cantidad de muertes en las cifras

Gráfico series de tiempo para deslizamientos en Colombia

##    Year Muertes
## 1  2007      13
## 2  2007      24
## 3  2007      27
## 4  2008      10
## 5  2008       4
## 6  2008       8
## 7  2008       8
## 8  2008      10
## 9  2008       5
## 10 2008       1

Tablas de frecuencia para muertes por deslizamientos en Colombia

## Warning: package 'questionr' was built under R version 4.1.1
n % val% %cum val%cum
0 28 30.8 30.8 30.8 30.8
2 12 13.2 13.2 44.0 44.0
4 9 9.9 9.9 53.8 53.8
5 8 8.8 8.8 62.6 62.6
1 6 6.6 6.6 69.2 69.2
3 5 5.5 5.5 74.7 74.7
6 5 5.5 5.5 80.2 80.2
9 3 3.3 3.3 83.5 83.5
10 3 3.3 3.3 86.8 86.8
8 2 2.2 2.2 89.0 89.0
13 2 2.2 2.2 91.2 91.2
7 1 1.1 1.1 92.3 92.3
11 1 1.1 1.1 93.4 93.4
20 1 1.1 1.1 94.5 94.5
24 1 1.1 1.1 95.6 95.6
27 1 1.1 1.1 96.7 96.7
48 1 1.1 1.1 97.8 97.8
91 1 1.1 1.1 98.9 98.9
92 1 1.1 1.1 100.0 100.0
Total 91 100.0 100.0 100.0 100.0
## Classes 'freqtab' and 'data.frame':  20 obs. of  5 variables:
##  $ n      : num  28 12 9 8 6 5 5 3 3 2 ...
##  $ %      : num  30.8 13.2 9.9 8.8 6.6 5.5 5.5 3.3 3.3 2.2 ...
##  $ val%   : num  30.8 13.2 9.9 8.8 6.6 5.5 5.5 3.3 3.3 2.2 ...
##  $ %cum   : num  30.8 44 53.8 62.6 69.2 74.7 80.2 83.5 86.8 89 ...
##  $ val%cum: num  30.8 44 53.8 62.6 69.2 74.7 80.2 83.5 86.8 89 ...

Tabla de frecuencias agrupada para años en que se presentaron deslizamientos en Colombia según la base de datos

## [1]  7  9 11 13 15 17
A Freq Rel_Freq Cum_Freq
(7,9] 11 0.1250000 11
(9,11] 57 0.6477273 68
(11,13] 7 0.0795455 75
(13,15] 12 0.1363636 87
(15,17] 1 0.0113636 88
## 'data.frame':    5 obs. of  4 variables:
##  $ A       : Factor w/ 5 levels "(7,9]","(9,11]",..: 1 2 3 4 5
##  $ Freq    : int  11 57 7 12 1
##  $ Rel_Freq: num  0.125 0.6477 0.0795 0.1364 0.0114
##  $ Cum_Freq: int  11 68 75 87 88
x y
(7,9] 11
(9,11] 57
(11,13] 7
(13,15] 12
(15,17] 1

Estadísticos para heridos y muertos a causa de deslizamientos en Sur América

##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##   0.000   0.000   0.000   3.571   3.500  40.000    1658
## Warning: package 'pastecs' was built under R version 4.1.1
##           x          y
## nbr.val  NA   5.000000
## nbr.null NA   0.000000
## nbr.na   NA   0.000000
## min      NA   1.000000
## max      NA  57.000000
## range    NA  56.000000
## sum      NA  88.000000
## median   NA  11.000000
## mean     NA  17.600000
## SE.mean  NA  10.037928
## CI.mean  NA  27.869756
## var      NA 503.800000
## std.dev  NA  22.445490
## coef.var NA   1.275312

Esta gráfica permite observar

Gráfico de caja y bigotes

Este gráfico permite analizar la cantidad de muertes por eventos de deslizamientos en Colombia, así mismo se pueden observar valores aberrantes o atípicos de muertes, donde el número de fallecidos excede enormemente el usual según el histórico reportado.

Conclusión

En conclusión, pudimos organizar de diferentes formas información contenida en la base de datos para analizar y mostrar de una forma más eficiente partes que consideramos importantes. El uso de RStudio y RMarckdown nos permitió realizar dichas gráficas y adquirir experiencia en el uso de base de datos importadas desde un csv, lo cual nos brinda más herramientas a la hora de realizar nuestros futuros trabajos.