J. Kavanagh
2022-07-22
In this series of slides we will be exploring the Lisbon dataset. Use the following command to import the libson.csv file
First examine the dataset, note the number of rows & variables and the class types.
## Rows: 250
## Columns: 79
## $ Date <chr> "1704-10-06", "1711-10-06", "1712-03-01",…
## $ Date_Partial <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ First.Name.EN <chr> "Mary", NA, NA, NA, NA, "Sarah", "Frances…
## $ Surname.EN <chr> "Sed", NA, NA, NA, NA, " Coghron", "Hughe…
## $ Name.PG <chr> "Maria Cedly", "Anna Wolfo", "Margaret Fa…
## $ DOB <lgl> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Age <int> 21, 26, 22, 21, 31, 30, 21, 30, 34, NA, 2…
## $ Age_Partial <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Place.of.Birth <chr> NA, "Ambur(?)", "Bratefort(?)", "Carbery"…
## $ Nationality <chr> "Irish", "Irish", "Irish", "Irish", "Iris…
## $ First.Religion <chr> "Calvinism", "Protestantism", "Protestant…
## $ Second.Religion <chr> "Catholicism", "Catholicism", "Catholicis…
## $ Reason.for.Religious.Change <chr> "She was baptised as a Calvinist but afte…
## $ Occupation <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Local.Address <chr> "Benfica, Lisbon", "Lisbon", "Lisbon", "L…
## $ Irish.Address.Type <chr> NA, "Ambur(?)", "Brateford(?)", "Carbery"…
## $ Province <chr> NA, NA, NA, "Munster", NA, "HibŽrnia", NA…
## $ County <chr> NA, NA, NA, "Cork", "Linster", NA, "Cork"…
## $ City <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Town <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Village <chr> NA, NA, NA, NA, "Grobeg ", NA, NA, NA, NA…
## $ Marital.Status <chr> "Married", "Married", "Married", "Married…
## $ Spouse.Type <chr> "Husband", "Husband", "Husband", "Husband…
## $ No.of.Dependents <int> NA, NA, NA, NA, NA, NA, NA, NA, NA, 1, NA…
## $ Mother.First.Name.EN <chr> "Catherine", NA, NA, NA, "Maria ", "Anna"…
## $ Mother.Surname.EN <chr> NA, NA, NA, NA, "Wall", "Farmour", "Bel "…
## $ Mother.Name.PG <chr> "Catarina Boulen", "Izabella Ulssen", "Iz…
## $ Mother.Occupation <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Mother.Religion <chr> "Calvinist", "Protestantism", "Protestant…
## $ Mother.Local.Address <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Mother.Irish.Address.Type <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Mother.Province <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Mother.County <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Mother.City <lgl> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Mother.Town <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Mother.Village <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Father.First.Name.EN <chr> "John", NA, NA, NA, "Robert ", "Thomas ",…
## $ Father.Surname.EN <chr> "Cedly", NA, NA, NA, "Wall", "Fuitor", NA…
## $ Father.Name.PG <chr> "Jo‹o Cedly", "Antonio Volfo ", "Sim‹o Fe…
## $ Father.Occupation <chr> "Farmer", NA, NA, NA, NA, NA, NA, NA, NA,…
## $ Father.Religion <chr> "Calvinist", "Protestantism", "Protestant…
## $ Father.Local.Address <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Father.Irish.Address.Type <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Father.Province <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Father.County <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Father.City <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Father.Town <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Father.Village <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Subject.Spouse.First.Name.EN <chr> "Henry", NA, NA, "Robert", NA, NA, NA, "J…
## $ Subject.Spouse.Surname.EN <chr> NA, NA, NA, "Purcell", NA, NA, NA, "Garnd…
## $ Subject.Spouse.Name.PG <chr> "Henrique Maur’cio", "Jo‹o Dilon", "Henri…
## $ Spouse.Occupation <chr> "Infant Soldier", NA, "Carpenter", "Soldi…
## $ Spouse.Religion <chr> NA, "Catholicism", NA, NA, NA, NA, NA, NA…
## $ Spouse.Local.Address <chr> "Benfica, Lisbon", "Lisbon", "Lisbon", NA…
## $ Spouse.Irish.Address.Type <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Spouse.Province <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Spouse.County <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Spouse.City <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Spouse.Town <lgl> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Spouse.Village <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
## $ Inquisitor.1.First.Name.PG <chr> "Paulo Afonso", "Manuel", "Jo‹o", "Manuel…
## $ Inquistior.1.Surname.PG <chr> "de Albuquerque", "da Cunha Pinheiro", "d…
## $ Inquisitor.1.Org <chr> "Third House of the Inquistion Hearings",…
## $ Inquisitor.2.First.Name.PG <chr> NA, NA, NA, "Lu’s Barata", NA, NA, NA, "L…
## $ Inquisitor.2.Surname.PG <chr> NA, NA, NA, "de Lima", NA, NA, NA, "de Li…
## $ Inquisitor.2.Org <chr> NA, NA, NA, "House of the Dispatch of the…
## $ Commissioner.First.Name.PG <chr> NA, NA, NA, NA, NA, "Father Doctor Maunel…
## $ Commissioner.Surname.PG <chr> NA, NA, NA, NA, NA, "Rodrigues", NA, NA, …
## $ Commissioner.Org <chr> NA, NA, NA, NA, NA, "the mother church of…
## $ Curator.First.Name.PG <chr> "Father Crist—v‹o", NA, "Father Friar Tho…
## $ Curator.Surname.PG <chr> "Bruno", NA, "of Jesus Mary", "de Santa M…
## $ Interpreter.First.Name.PG <chr> "Father Crist—v‹o", "Father Friar Thomas …
## $ Interpreter.Surname.PG <chr> "Bruno", "of Jesus Mary", "of Jesus Mary"…
## $ Interpreter.Occupation <chr> NA, NA, NA, NA, "Priest, Vice-President "…
## $ Interpreter.Org <chr> "St. Peter's Habit", "Order of St. Domini…
## $ Evidence.of.Signature <chr> "Yes ", "No", "No", "No", "No ", "Yes", "…
## $ Literacy.Status <chr> "Yes", "No", "No", "No", "No ", "Yes", "Y…
## $ Additional.Notes <chr> "She was recommended to complete her inst…
## $ Source <chr> "Tribunal do Santo Of’cio, Inquisition of…
Use the ymd() command and verify the results with the class() command
## Warning: 3 failed to parse.
## [1] "Date"
# This creates a new variable, however, you will need to rename the column names
lisbon %>% count(Date) -> lisbon_date
lisbon_date## # A tibble: 210 × 2
## Date n
## <date> <int>
## 1 1657-06-09 1
## 2 1658-01-07 1
## 3 1658-03-12 1
## 4 1658-08-01 1
## 5 1660-11-20 1
## 6 1664-10-07 1
## 7 1665-03-29 1
## 8 1672-01-25 1
## 9 1686-02-06 1
## 10 1686-07-05 1
## # … with 200 more rows
This is vital as you will create multiple smaller dataframes and need to individualise the column names. This will prevent future errors.
# Rename the columns
colnames(lisbon_date) <- c("Date", "Converts")
# Check your results
lisbon_date## # A tibble: 210 × 2
## Date Converts
## <date> <int>
## 1 1657-06-09 1
## 2 1658-01-07 1
## 3 1658-03-12 1
## 4 1658-08-01 1
## 5 1660-11-20 1
## 6 1664-10-07 1
## 7 1665-03-29 1
## 8 1672-01-25 1
## 9 1686-02-06 1
## 10 1686-07-05 1
## # … with 200 more rows
Group nomination dates into years using the floor_date() command from the ‘lubridate’ package. Its fairly intelligent and can reorganise dates into days, months, years etc.
lisbon_date %>% group_by(year=floor_date(Date, "year")) %>% summarize(Converts=sum(Converts)) -> lisbon_yearly
lisbon_yearly## # A tibble: 43 × 2
## year Converts
## <date> <int>
## 1 1657-01-01 1
## 2 1658-01-01 3
## 3 1660-01-01 1
## 4 1664-01-01 1
## 5 1665-01-01 1
## 6 1672-01-01 1
## 7 1686-01-01 2
## 8 1690-01-01 1
## 9 1691-01-01 1
## 10 1697-01-01 1
## # … with 33 more rows
Group nomination dates into years using the floor_date() command from the ‘lubridate’ package. Its fairly intelligent and can reorganise dates into days, months, years etc.
lisbon_date %>% group_by(year=floor_date(Date, "month")) %>% summarize(Converts=sum(Converts)) -> lisbon_monthly
lisbon_monthly## # A tibble: 157 × 2
## year Converts
## <date> <int>
## 1 1657-06-01 1
## 2 1658-01-01 1
## 3 1658-03-01 1
## 4 1658-08-01 1
## 5 1660-11-01 1
## 6 1664-10-01 1
## 7 1665-03-01 1
## 8 1672-01-01 1
## 9 1686-02-01 1
## 10 1686-07-01 1
## # … with 147 more rows
Using ggplot we can display the findings
## Warning: Removed 1 row(s) containing missing values (geom_path).
## [1] NA
## [2] NA
## [3] NA
## [4] NA
## [5] NA
## [6] NA
## [7] NA
## [8] NA
## [9] NA
## [10] NA
## [11] NA
## [12] NA
## [13] "Assistant "
## [14] NA
## [15] NA
## [16] NA
## [17] NA
## [18] NA
## [19] "Sailor"
## [20] NA
## [21] "Sailor"
## [22] "Sailor "
## [23] NA
## [24] "Sailor "
## [25] NA
## [26] NA
## [27] NA
## [28] "Tailor"
## [29] "Confectioner"
## [30] "Sailor"
## [31] "Caretaker "
## [32] "Sailor"
## [33] NA
## [34] "Master at sea, Assistant "
## [35] "Sailor"
## [36] "Sailor, assistant at the 35-year-old Catechumens College"
## [37] NA
## [38] NA
## [39] NA
## [40] NA
## [41] NA
## [42] NA
## [43] NA
## [44] NA
## [45] "Craft of Springardeiro, Seaman "
## [46] "Carpenter"
## [47] NA
## [48] "Apothecary"
## [49] "Carpenter"
## [50] "Farmer "
## [51] "Fisherman "
## [52] "Assistant "
## [53] "Sailor "
## [54] "Horse Lieutenant"
## [55] NA
## [56] NA
## [57] "Sailor"
## [58] NA
## [59] "Sailor"
## [60] NA
## [61] NA
## [62] NA
## [63] "Sailor"
## [64] NA
## [65] "soldier, assistant "
## [66] NA
## [67] NA
## [68] "Sailor"
## [69] NA
## [70] "Clerk"
## [71] "soldier"
## [72] "Tailor"
## [73] "Soldier"
## [74] NA
## [75] "Sailor"
## [76] NA
## [77] NA
## [78] NA
## [79] NA
## [80] NA
## [81] NA
## [82] NA
## [83] "Shoemaker"
## [84] NA
## [85] NA
## [86] "Sailor"
## [87] NA
## [88] "Sailor"
## [89] NA
## [90] "Sailor"
## [91] "Hairdresser"
## [92] NA
## [93] NA
## [94] "Sailor"
## [95] "Sailor"
## [96] NA
## [97] "Sailor"
## [98] NA
## [99] NA
## [100] NA
## [101] NA
## [102] NA
## [103] "Sailor"
## [104] "Sailor"
## [105] "Locksmith, Assistant, Soldier"
## [106] "Sailor"
## [107] NA
## [108] NA
## [109] "Sailor"
## [110] NA
## [111] "Soldier"
## [112] NA
## [113] NA
## [114] NA
## [115] NA
## [116] NA
## [117] NA
## [118] NA
## [119] NA
## [120] "No trade"
## [121] NA
## [122] "Infant Soldier "
## [123] "shoemaker and sailor, assistant at the so-called Convent of the Holy Body"
## [124] NA
## [125] "Carpenter officer "
## [126] NA
## [127] "Consul appointed to the Island of Faial"
## [128] "Soldier"
## [129] NA
## [130] NA
## [131] "Assistant "
## [132] NA
## [133] NA
## [134] "without a trade"
## [135] "Tanner, Bacehlor"
## [136] NA
## [137] "Soldier "
## [138] "Tailor "
## [139] NA
## [140] "Cardigan, Assistant "
## [141] NA
## [142] "Sailor, Assistant "
## [143] "Sailor "
## [144] "carpenter, bachelor, and an assistant in this Court in the parish of S‹o Paulo"
## [145] NA
## [146] "Seaman "
## [147] NA
## [148] NA
## [149] "Clerk, Sailor "
## [150] "Serving Girl "
## [151] "Carpenter "
## [152] NA
## [153] NA
## [154] NA
## [155] NA
## [156] NA
## [157] NA
## [158] "Trumpet of the Regiment of Colonel Goncalo Pires Bandeira "
## [159] "Sailor, Assistant "
## [160] "Seaman "
## [161] "Seaman "
## [162] NA
## [163] NA
## [164] NA
## [165] NA
## [166] NA
## [167] NA
## [168] "Shoemaker "
## [169] "Sailor "
## [170] "Hunter"
## [171] NA
## [172] NA
## [173] NA
## [174] NA
## [175] NA
## [176] NA
## [177] NA
## [178] NA
## [179] "Assistant "
## [180] "Shoemaker"
## [181] "Sailor"
## [182] NA
## [183] "Weaver"
## [184] "Surgeon"
## [185] "Pilot"
## [186] "Sailor "
## [187] "Sock Weaver, Assistant at the College of Catechumens"
## [188] "Tailor, Assistant at the 44-year-old College of Catechumens"
## [189] NA
## [190] NA
## [191] NA
## [192] "Soldier"
## [193] "Assistant at the Court of Lisbon"
## [194] "Soldier"
## [195] NA
## [196] NA
## [197] NA
## [198] NA
## [199] NA
## [200] NA
## [201] "Assistant"
## [202] NA
## [203] NA
## [204] NA
## [205] NA
## [206] "Sailor"
## [207] "Sailor"
## [208] "Sailor"
## [209] NA
## [210] NA
## [211] "Soldier"
## [212] NA
## [213] NA
## [214] "infantry lieutenant"
## [215] "a despenseiro "
## [216] "Assistant "
## [217] NA
## [218] "Soldier "
## [219] "Sailor"
## [220] NA
## [221] NA
## [222] "Soldier"
## [223] "Carpenter "
## [224] NA
## [225] NA
## [226] NA
## [227] NA
## [228] NA
## [229] NA
## [230] "Businessman "
## [231] NA
## [232] NA
## [233] "Sailor"
## [234] "Pilot"
## [235] NA
## [236] "Sailor, Assistant "
## [237] "Sailor, Assistant "
## [238] NA
## [239] NA
## [240] "Sailor navigator, Assistant "
## [241] "Seaman "
## [242] NA
## [243] NA
## [244] NA
## [245] NA
## [246] NA
## [247] NA
## [248] NA
## [249] NA
## [250] NA
## # A tibble: 12 × 2
## First.Religion n
## <chr> <int>
## 1 Anabaptism 1
## 2 Anglican 4
## 3 Apostolic 1
## 4 Calvinism 11
## 5 Catholicism 36
## 6 Lutheran 15
## 7 Presbyterian 14
## 8 Presbyterian Calvinist 1
## 9 Protestantism 155
## 10 Quaker 1
## 11 Quakers/Trembling 1
## 12 <NA> 10