Diana Piskareva and Daria Rukosueva did an equivalent part of the work, it is difficult to distinguish who was doing what exactly, the project was done together

RQ: Are related persons satisfied with the national government, happy and trust the legal system with the ability of the population to influence on politics in Greek government?

df <- import("/Users/DP/OneDrive/Рабочий стол/ESS10.sav")
str(df)

## 'data.frame':    33351 obs. of  586 variables:
##  $ name     : chr  "ESS10e02_2" "ESS10e02_2" "ESS10e02_2" "ESS10e02_2" ...
##   ..- attr(*, "label")= chr "Title of dataset"
##   ..- attr(*, "format.spss")= chr "A10"
##  $ essround : num  10 10 10 10 10 10 10 10 10 10 ...
##   ..- attr(*, "label")= chr "ESS round"
##   ..- attr(*, "format.spss")= chr "F2.0"
##  $ edition  : chr  "2.2" "2.2" "2.2" "2.2" ...
##   ..- attr(*, "label")= chr "Edition"
##   ..- attr(*, "format.spss")= chr "A3"
##  $ proddate : chr  "21.12.2022" "21.12.2022" "21.12.2022" "21.12.2022" ...
##   ..- attr(*, "label")= chr "Production date"
##   ..- attr(*, "format.spss")= chr "A10"
##  $ idno     : num  10002 10006 10009 10024 10027 ...
##   ..- attr(*, "label")= chr "Respondent's identification number"
##   ..- attr(*, "format.spss")= chr "F5.0"
##  $ cntry    : chr  "BG" "BG" "BG" "BG" ...
##   ..- attr(*, "label")= chr "Country"
##   ..- attr(*, "format.spss")= chr "A2"
##   ..- attr(*, "labels")= Named chr [1:40] "AL" "AT" "BE" "BG" ...
##   .. ..- attr(*, "names")= chr [1:40] "Albania" "Austria" "Belgium" "Bulgaria" ...
##  $ dweight  : num  1.939 1.652 0.315 0.673 0.395 ...
##   ..- attr(*, "label")= chr "Design weight"
##   ..- attr(*, "format.spss")= chr "F3.2"
##  $ pspwght  : num  1.291 1.431 0.113 1.436 0.585 ...
##   ..- attr(*, "label")= chr "Post-stratification weight including design weight"
##   ..- attr(*, "format.spss")= chr "F3.2"
##  $ pweight  : num  0.218 0.218 0.218 0.218 0.218 ...
##   ..- attr(*, "label")= chr "Population size weight (must be combined with dweight or pspwght)"
##   ..- attr(*, "format.spss")= chr "F3.2"
##  $ anweight : num  0.281 0.3115 0.0246 0.3127 0.1273 ...
##   ..- attr(*, "label")= chr "Analysis weight"
##   ..- attr(*, "format.spss")= chr "F3.2"
##  $ prob     : num  0.000314 0.000368 0.001932 0.000904 0.00154 ...
##   ..- attr(*, "label")= chr "Sampling probability"
##   ..- attr(*, "format.spss")= chr "F3.2"
##  $ stratum  : num  185 186 175 148 138 182 157 168 156 135 ...
##   ..- attr(*, "label")= chr "Sampling stratum"
##   ..- attr(*, "format.spss")= chr "F4.0"
##  $ psu      : num  2429 2387 2256 2105 2065 ...
##   ..- attr(*, "label")= chr "Primary sampling unit"
##   ..- attr(*, "format.spss")= chr "F5.0"
##  $ nwspol   : num  80 63 390 60 120 60 30 70 60 60 ...
##   ..- attr(*, "label")= chr "News about politics and current affairs, watching, reading or listening, in minutes"
##   ..- attr(*, "format.spss")= chr "F4.0"
##   ..- attr(*, "labels")= Named num [1:3] 7777 8888 9999
##   .. ..- attr(*, "names")= chr [1:3] "Refusal" "Don't know" "No answer"
##  $ netusoft : num  1 5 5 5 5 5 5 5 1 1 ...
##   ..- attr(*, "label")= chr "Internet use, how often"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Never" "Only occasionally" "A few times a week" "Most days" ...
##  $ netustm  : num  NA 180 405 80 120 60 120 260 NA NA ...
##   ..- attr(*, "label")= chr "Internet use, how much time on typical day, in minutes"
##   ..- attr(*, "format.spss")= chr "F4.0"
##   ..- attr(*, "labels")= Named num [1:4] 6666 7777 8888 9999
##   .. ..- attr(*, "names")= chr [1:4] "Not applicable" "Refusal" "Don't know" "No answer"
##  $ ppltrst  : num  5 0 5 5 4 3 3 4 5 0 ...
##   ..- attr(*, "label")= chr "Most people can be trusted or you can't be too careful"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "You can't be too careful" "1" "2" "3" ...
##  $ pplfair  : num  5 6 3 5 4 3 5 4 7 1 ...
##   ..- attr(*, "label")= chr "Most people try to take advantage of you, or try to be fair"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Most people try to take advantage of me" "1" "2" "3" ...
##  $ pplhlp   : num  1 2 4 3 2 3 5 6 4 2 ...
##   ..- attr(*, "label")= chr "Most of the time people helpful or mostly looking out for themselves"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "People mostly look out for themselves" "1" "2" "3" ...
##  $ polintr  : num  4 1 3 4 1 1 3 3 3 3 ...
##   ..- attr(*, "label")= chr "How interested in politics"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:7] 1 2 3 4 7 8 9
##   .. ..- attr(*, "names")= chr [1:7] "Very interested" "Quite interested" "Hardly interested" "Not at all interested" ...
##  $ psppsgva : num  1 4 3 1 3 1 2 3 4 1 ...
##   ..- attr(*, "label")= chr "Political system allows people to have a say in what government does"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Not at all" "Very little" "Some" "A lot" ...
##  $ actrolga : num  2 4 2 1 1 1 2 2 1 1 ...
##   ..- attr(*, "label")= chr "Able to take active role in political group"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Not at all able" "A little able" "Quite able" "Very able" ...
##  $ psppipla : num  2 4 2 1 1 1 2 2 2 1 ...
##   ..- attr(*, "label")= chr "Political system allows people to have influence on politics"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Not at all" "Very little" "Some" "A lot" ...
##  $ cptppola : num  2 4 2 NA 1 1 1 2 NA 1 ...
##   ..- attr(*, "label")= chr "Confident in own ability to participate in politics"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Not at all confident" "A little confident" "Quite confident" "Very confident" ...
##  $ trstprl  : num  3 5 3 2 0 0 5 2 2 0 ...
##   ..- attr(*, "label")= chr "Trust in country's parliament"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ trstlgl  : num  2 8 3 2 0 0 4 2 3 0 ...
##   ..- attr(*, "label")= chr "Trust in the legal system"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ trstplc  : num  3 9 3 3 0 0 7 4 7 2 ...
##   ..- attr(*, "label")= chr "Trust in the police"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ trstplt  : num  3 6 3 0 0 0 5 1 2 0 ...
##   ..- attr(*, "label")= chr "Trust in politicians"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ trstprt  : num  3 7 2 0 0 0 3 1 2 0 ...
##   ..- attr(*, "label")= chr "Trust in political parties"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ trstep   : num  4 8 6 3 0 5 8 2 5 0 ...
##   ..- attr(*, "label")= chr "Trust in the European Parliament"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ trstun   : num  4 8 5 3 0 3 8 2 7 0 ...
##   ..- attr(*, "label")= chr "Trust in the United Nations"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ trstsci  : num  6 10 6 3 3 5 6 5 8 9 ...
##   ..- attr(*, "label")= chr "Trust in scientists"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "No trust at all" "1" "2" "3" ...
##  $ vote     : num  2 1 1 2 1 2 2 2 1 1 ...
##   ..- attr(*, "label")= chr "Voted last national election"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:6] 1 2 3 7 8 9
##   .. ..- attr(*, "names")= chr [1:6] "Yes" "No" "Not eligible to vote" "Refusal" ...
##  $ prtvtebg : num  NA 1 NA NA 2 NA NA NA 4 3 ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Bulgaria"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:17] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:17] "Grazhdani za evropeĭsko razvitie na Bulgariya (GERB)" "Balgarska sotsialisticheska partiya (BSP)" "Dvizhenie za prava i svobodi (DPS)" "Demokratichna Balgariya" ...
##  $ prtvthch : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Switzerland"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:23] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:23] "Swiss People's Party" "Social Democratic Party / Socialist Party" "FDP. The Liberals" "Green Party" ...
##  $ prtvtbhr : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Croatia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:14] "HDZ, HSLS" "SDP,  HSS, HSU" "DP, HS, Blok za Hrvatsku, HKS, Hrast" "Most" ...
##  $ prtvtecz : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Czechia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:14] "KSČM" "ČSSD" "TOP 09" "ANO 2011" ...
##  $ prtvthee : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Estonia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:16] 1 2 3 4 5 6 10 11 14 15 ...
##   .. ..- attr(*, "names")= chr [1:16] "Eesti Reformierakond" "Eesti Keskerakond" "Isamaa Erakond" "Sotsiaaldemokraatlik Erakond" ...
##  $ prtvtefi : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Finland"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:26] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:26] "The National Coalition Party" "The Swedish People's Party (SPP)" "The Centre Party" "The Seven-Star Movement" ...
##  $ prtvtefr : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, France (ballot 1)"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:18] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:18] "LO (Lutte Ouvrière)" "NPA (Nouveau Parti Anti-Capitaliste)" "PCF (Parti Communiste Français)" "FI (La France Insoumise)" ...
##  $ prtvtdgr : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Greece"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 1 2 3 4 5 6 7 8 9 31 ...
##   .. ..- attr(*, "names")= chr [1:14] "ΝΔ" "ΣΥΡΙΖΑ" "ΚΙΝ.ΑΛ." "ΚΚΕ" ...
##  $ prtvtghu : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Hungary"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:15] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:15] "DK (Demokratikus Koalíció)" "Együtt2014 Mozgalom" "Fidesz (Fidesz Magyar Polgári Párt)" "Jobbik (Jobbik Magyarországért Mozgalom)" ...
##  $ prtvtdis : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Iceland"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:19] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:19] "Alþýðufylkinguna" "Bjarta framtíð" "Dögun" "Flokk fólksins" ...
##  $ prtvtdit : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Italy"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:20] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:20] "Movimento 5 Stelle" "Partido Democratico (PD)" "Lega" "Forza Italia" ...
##  $ prtvclt1 : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election 1, Lithuania (first vote, party)"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:23] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:23] "Political Party 'The Way of Courage' (DK)" "Party 'Freedom and Justice' (LT)" "Freedom Party (LP)" "Lithuanian People's Party (LLP)" ...
##  $ prtvclt2 : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election 2, Lithuania (second vote, party)"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:23] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:23] "Political Party 'The Way of Courage' (DK)" "Party 'Freedom and Justice' (LT)" "Freedom Party (LP)" "Lithuanian People's Party (LLP)" ...
##  $ prtvclt3 : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election 3, Lithuania (third vote, party)"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:23] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:23] "Political Party 'The Way of Courage' (DK)" "Party 'Freedom and Justice' (LT)" "Freedom Party (LP)" "Lithuanian People's Party (LLP)" ...
##  $ prtvtame : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Montenegro"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:15] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:15] "Socijaldemokrate - SD" "Bošnjačka stranka - BS" "Hrvatska građanska inicijativa - HGI" "Socijaldemokratska partija - SDP" ...
##  $ prtvthnl : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Netherlands"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:23] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:23] "People's Party for Freedom and Democracy" "Labour Party" "Party for Freedom" "Socialist Party" ...
##  $ prtvtmk  : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, North Macedonia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:20] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:20] "Socijaldemokratski sojuz na Makedonija (SDSM) i Koalicija „Možeme“" "Vnatrešna makedonska revolucionerna organizacija - Demokratska partija za makedonsko nacionalno edinstvo (VMRO-DPMNE) i" "Demokratska unija za integracija (DUI)" "Alijansa za Albancite i Alternativa" ...
##  $ prtvtbno : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Norway"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:15] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:15] "Rødt" "Sosialistisk Venstreparti" "Arbeiderpartiet" "Venstre" ...
##  $ prtvtdpt : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Portugal"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:27] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:27] "A - Aliança" "B.E. - Bloco de Esquerda" "CDS-PP - CDS-Partido Popular" "CHEGA" ...
##  $ prtvtfsi : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Slovenia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:16] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:16] "DESUS - Demokraticna stranka upokojencev Slovenije" "L - Levica" "LMŠ - Lista Marjana Šarca" "NSI - Nova Slovenija – Kršcanski demokrati" ...
##  $ prtvtesk : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Party voted for in last national election, Slovakia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:12] 1 2 3 4 5 6 7 8 66 77 ...
##   .. ..- attr(*, "names")= chr [1:12] "Obyčajní Ľudia a nezávislé osobnosti" "Smer – SD" "SME Rodina" "ĽS Naše Slovensko" ...
##  $ contplt  : num  2 1 2 2 1 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Contacted politician or government official last 12 months"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ donprty  : num  2 2 2 2 2 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Donated to or participated in political party or pressure group last 12 months"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ badge    : num  2 2 2 2 1 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Worn or displayed campaign badge/sticker last 12 months"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ sgnptit  : num  2 2 2 2 2 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Signed petition last 12 months"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ pbldmna  : num  2 2 2 2 1 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Taken part in public demonstration last 12 months"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ bctprd   : num  2 1 2 2 1 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Boycotted certain products last 12 months"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ pstplonl : num  2 2 NA NA 1 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Posted or shared anything about politics online last 12 months"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ volunfp  : num  2 2 2 2 2 2 2 2 2 2 ...
##   ..- attr(*, "label")= chr "Volunteered for not-for-profit or charitable organisation"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ clsprty  : num  2 1 1 2 1 2 2 1 2 1 ...
##   ..- attr(*, "label")= chr "Feel closer to a particular party than all other parties"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:5] 1 2 7 8 9
##   .. ..- attr(*, "names")= chr [1:5] "Yes" "No" "Refusal" "Don't know" ...
##  $ prtclebg : num  NA 1 NA NA 9 NA NA 2 NA 3 ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Bulgaria"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:16] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:16] "Grazhdani za evropeĭsko razvitie na Bulgariya (GERB)" "Balgarska sotsialisticheska partiya (BSP)" "Dvizhenie za prava i svobodi (DPS)" "Demokratichna Balgariya" ...
##  $ prtclhch : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Switzerland"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:20] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:20] "Swiss People's Party" "Social Democratic Party / Socialist Party" "FDP. The Liberals" "Green Party" ...
##  $ prtclbhr : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Croatia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:24] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:24] "Centar" "Domovinski pokret" "Fokus" "GLAS - Građansko-liberalni savez" ...
##  $ prtclecz : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Czechia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:14] "KSČM" "ČSSD" "TOP 09" "ANO 2011" ...
##  $ prtclhee : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Estonia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:16] 1 2 3 4 5 6 10 11 14 15 ...
##   .. ..- attr(*, "names")= chr [1:16] "Eesti Reformierakond" "Eesti Keskerakond" "Isamaa Erakond" "Sotsiaaldemokraatlik Erakond" ...
##  $ prtclffi : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Finland"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:28] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:28] "The National Coalition Party" "The Swedish People's Party (SPP)" "The Centre Party" "The Seven-Star Movement" ...
##  $ prtclffr : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, France"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:16] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:16] "LO (Lutte Ouvrière)" "NPA (Nouveau Parti Anti-Capitaliste)" "PCF (Parti Communiste Français)" "FI (La France Insoumise)" ...
##  $ prtcldgr : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Greece"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 1 2 3 4 5 6 7 8 9 31 ...
##   .. ..- attr(*, "names")= chr [1:14] "ΝΔ" "ΣΥΡΙΖΑ" "ΚΙΝ.ΑΛ." "ΚΚΕ" ...
##  $ prtclhhu : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Hungary"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:16] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:16] "DK (Demokratikus Koalíció)" "Párbeszéd (Párbeszéd Magyarországért Párt)" "Fidesz (Fidesz Magyar Polgári Párt)" "Jobbik (Jobbik Magyarországért Mozgalom)" ...
##  $ prtcldis : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Iceland"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:19] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:19] "Alþýðufylkinguna" "Bjarta framtíð" "Dögun" "Flokk fólksins" ...
##  $ prtcleit : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Italy"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:27] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:27] "Movimento 5 Stelle" "Partido Democratico" "Lega" "Forza Italia" ...
##  $ prtclclt : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Lithuania"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:22] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:22] "Homeland Union - Lithuanian Christian Democrats (TS-LKD)" "Lithuanian Peasant and Greens Union (LVZS)" "Labour Party (DP)" "Lithuanian Social Democratic Party (LSDP)" ...
##  $ prtclame : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Montenegro"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:22] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:22] "Demokratska partija socijalista (DPS)" "Socijaldemokratska partija (SDP)" "Socijaldemokrate Crne Gore (SD)" "Socijalistička narodna partija (SNP)" ...
##  $ prtclgnl : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Netherlands"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:24] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:24] "People's Party for Freedom and Democracy" "Labour Party" "Party for Freedom" "Socialist Party" ...
##  $ prtclmk  : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, North Macedonia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:29] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:29] "Vnatrešna makedonska revolucionerna organizacija - Demokratska partija za makedonsko nacionalno edinstvo (VMRO-DPMNE)" "Socijaldemokratski sojuz na Makedonija (SDSM)" "Demokratska unija za integracija (DUI)" "Alijansa za Albancite (AA)" ...
##  $ prtclbno : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Norway"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:15] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:15] "Rødt" "Sosialistisk Venstreparti" "Arbeiderpartiet" "Venstre" ...
##  $ prtclfpt : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Portugal"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:30] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:30] "A - Aliança" "B.E. - Bloco de Esquerda" "CDS-PP - CDS-Partido Popular" "CHEGA" ...
##  $ prtclfsi : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Slovenia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:16] 1 2 3 4 5 6 7 8 9 10 ...
##   .. ..- attr(*, "names")= chr [1:16] "DESUS - Demokraticna stranka upokojencev Slovenije" "L - Levica" "LMŠ - Lista Marjana Šarca" "NSI - Nova Slovenija – Kršcanski demokrati" ...
##  $ prtclesk : num  NA NA NA NA NA NA NA NA NA NA ...
##   ..- attr(*, "label")= chr "Which party feel closer to, Slovakia"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:12] 1 2 3 4 5 6 7 8 66 77 ...
##   .. ..- attr(*, "names")= chr [1:12] "Obyčajní Ľudia a nezávislé osobnosti" "Smer – SD" "SME Rodina" "ĽS Naše Slovensko" ...
##  $ prtdgcl  : num  NA 1 NA NA 1 NA NA 2 NA 2 ...
##   ..- attr(*, "label")= chr "How close to party"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 6 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Very close" "Quite close" "Not close" "Not at all close" ...
##  $ lrscale  : num  NA 10 4 NA 4 5 5 1 6 7 ...
##   ..- attr(*, "label")= chr "Placement on left right scale"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Left" "1" "2" "3" ...
##  $ stflife  : num  8 10 5 4 3 5 6 5 5 5 ...
##   ..- attr(*, "label")= chr "How satisfied with life as a whole"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Extremely dissatisfied" "1" "2" "3" ...
##  $ stfeco   : num  3 3 5 2 2 0 4 2 2 0 ...
##   ..- attr(*, "label")= chr "How satisfied with present state of economy in country"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Extremely dissatisfied" "1" "2" "3" ...
##  $ stfgov   : num  4 3 2 2 9 0 3 1 2 0 ...
##   ..- attr(*, "label")= chr "How satisfied with the national government"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Extremely dissatisfied" "1" "2" "3" ...
##  $ stfdem   : num  4 8 2 0 3 0 6 1 5 0 ...
##   ..- attr(*, "label")= chr "How satisfied with the way democracy works in country"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Extremely dissatisfied" "1" "2" "3" ...
##  $ stfedu   : num  3 8 3 1 2 5 6 5 6 NA ...
##   ..- attr(*, "label")= chr "State of education in country nowadays"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Extremely bad" "1" "2" "3" ...
##  $ stfhlth  : num  4 3 2 1 1 3 7 3 7 3 ...
##   ..- attr(*, "label")= chr "State of health services in country nowadays"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Extremely bad" "1" "2" "3" ...
##  $ gincdif  : num  4 5 4 2 1 1 1 2 1 1 ...
##   ..- attr(*, "label")= chr "Government should reduce differences in income levels"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Agree strongly" "Agree" "Neither agree nor disagree" "Disagree" ...
##  $ freehms  : num  3 3 4 3 4 5 2 3 2 5 ...
##   ..- attr(*, "label")= chr "Gays and lesbians free to live life as they wish"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Agree strongly" "Agree" "Neither agree nor disagree" "Disagree" ...
##  $ hmsfmlsh : num  3 2 3 2 2 1 4 3 NA 1 ...
##   ..- attr(*, "label")= chr "Ashamed if close family member gay or lesbian"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Agree strongly" "Agree" "Neither agree nor disagree" "Disagree" ...
##  $ hmsacld  : num  3 4 3 4 5 5 4 3 5 5 ...
##   ..- attr(*, "label")= chr "Gay and lesbian couples right to adopt children"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Agree strongly" "Agree" "Neither agree nor disagree" "Disagree" ...
##  $ euftf    : num  2 10 6 4 8 5 7 4 4 0 ...
##   ..- attr(*, "label")= chr "European Union: European unification go further or gone too far"
##   ..- attr(*, "format.spss")= chr "F2.0"
##   ..- attr(*, "labels")= Named num [1:14] 0 1 2 3 4 5 6 7 8 9 ...
##   .. ..- attr(*, "names")= chr [1:14] "Unification already gone too far" "1" "2" "3" ...
##  $ lrnobed  : num  3 4 1 3 3 1 2 1 1 1 ...
##   ..- attr(*, "label")= chr "Obedience and respect for authority most important virtues children should learn"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Agree strongly" "Agree" "Neither agree nor disagree" "Disagree" ...
##  $ loylead  : num  3 2 2 3 5 2 3 2 2 1 ...
##   ..- attr(*, "label")= chr "Country needs most loyalty towards its leaders"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:8] 1 2 3 4 5 7 8 9
##   .. ..- attr(*, "names")= chr [1:8] "Agree strongly" "Agree" "Neither agree nor disagree" "Disagree" ...
##  $ imsmetn  : num  3 1 1 1 1 1 2 2 1 4 ...
##   ..- attr(*, "label")= chr "Allow many/few immigrants of same race/ethnic group as majority"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:7] 1 2 3 4 7 8 9
##   .. ..- attr(*, "names")= chr [1:7] "Allow many to come and live here" "Allow some" "Allow a few" "Allow none" ...
##  $ imdfetn  : num  3 2 3 3 3 2 3 2 2 4 ...
##   ..- attr(*, "label")= chr "Allow many/few immigrants of different race/ethnic group from majority"
##   ..- attr(*, "format.spss")= chr "F1.0"
##   ..- attr(*, "labels")= Named num [1:7] 1 2 3 4 7 8 9
##   .. ..- attr(*, "names")= chr [1:7] "Allow many to come and live here" "Allow some" "Allow a few" "Allow none" ...
##   [list output truncated]

View(head(df))

#in order to make it easier to work with data, we will create a separate dataset with Greek data

attributes(df$cntry)

## $label
## [1] "Country"
## 
## $format.spss
## [1] "A2"
## 
## $labels
##            Albania            Austria            Belgium           Bulgaria 
##               "AL"               "AT"               "BE"               "BG" 
##        Switzerland             Cyprus            Czechia            Germany 
##               "CH"               "CY"               "CZ"               "DE" 
##            Denmark            Estonia              Spain            Finland 
##               "DK"               "EE"               "ES"               "FI" 
##             France     United Kingdom            Georgia             Greece 
##               "FR"               "GB"               "GE"               "GR" 
##            Croatia            Hungary            Ireland            Iceland 
##               "HR"               "HU"               "IE"               "IS" 
##             Israel              Italy          Lithuania         Luxembourg 
##               "IL"               "IT"               "LT"               "LU" 
##             Latvia         Montenegro    North Macedonia        Netherlands 
##               "LV"               "ME"               "MK"               "NL" 
##             Norway             Poland           Portugal            Romania 
##               "NO"               "PL"               "PT"               "RO" 
##             Serbia Russian Federation             Sweden           Slovenia 
##               "RS"               "RU"               "SE"               "SI" 
##           Slovakia             Turkey            Ukraine             Kosovo 
##               "SK"               "TR"               "UA"               "XK"

greece <- filter(df, cntry=="GR")
dim(greece)

## [1] 2799  586

Let’s choose 3 continuous and 1 categorical variables

continuous

happy - How happy are you. C1 Taking all things together, how happy would you say you are?

stfgov - How satisfied with the national government

trstlgl - Trust in the legal system B6-12a Using this card, please tell me on a score of 0-10 how much you personally trust each of the institutions I read out. 0 means you do not trust an institution at all, and 10 means you have complete trust. Firstly… …the legal system?

categorical

psppipla - Political system allows people to have influence on politics. And how much would you say that the political system in [country] allows people like you to have an influence on politics?

greece <- select(greece, c("stfgov", "happy", "trstlgl", "psppipla"))

skim(greece)

Data summary
Name	greece
Number of rows	2799
Number of columns	4
_______________________
Column type frequency:
numeric	4
________________________
Group variables	None

Variable type: numeric

skim_variable	n_missing	complete_rate	mean	sd	p0	p25	p50	p75	p100	hist
stfgov	21	0.99	4.12	2.27	0	2	4	6	10	▇▇▇▅▁
happy	5	1.00	6.58	1.54	0	6	7	8	10	▁▁▅▇▁
trstlgl	13	1.00	6.43	2.26	0	5	7	8	10	▂▃▆▇▅
psppipla	78	0.97	1.90	0.96	1	1	2	3	5	▇▅▅▁▁

summary(greece)

##      stfgov          happy          trstlgl         psppipla  
##  Min.   : 0.00   Min.   : 0.00   Min.   : 0.00   Min.   :1.0  
##  1st Qu.: 2.00   1st Qu.: 6.00   1st Qu.: 5.00   1st Qu.:1.0  
##  Median : 4.00   Median : 7.00   Median : 7.00   Median :2.0  
##  Mean   : 4.12   Mean   : 6.58   Mean   : 6.43   Mean   :1.9  
##  3rd Qu.: 6.00   3rd Qu.: 8.00   3rd Qu.: 8.00   3rd Qu.:3.0  
##  Max.   :10.00   Max.   :10.00   Max.   :10.00   Max.   :5.0  
##  NA's   :21      NA's   :5       NA's   :13      NA's   :78

using the summary function, we saw that the missing values are encoded correctly and are reflected as NA, so we can remove them from the dataset so that they do not distort the results.

greece <- greece[complete.cases(greece),]
summary(greece)

##      stfgov          happy          trstlgl         psppipla  
##  Min.   : 0.00   Min.   : 0.00   Min.   : 0.00   Min.   :1.0  
##  1st Qu.: 2.00   1st Qu.: 6.00   1st Qu.: 5.00   1st Qu.:1.0  
##  Median : 4.00   Median : 7.00   Median : 7.00   Median :2.0  
##  Mean   : 4.11   Mean   : 6.59   Mean   : 6.43   Mean   :1.9  
##  3rd Qu.: 6.00   3rd Qu.: 8.00   3rd Qu.: 8.00   3rd Qu.:3.0  
##  Max.   :10.00   Max.   :10.00   Max.   :10.00   Max.   :5.0

EDA

descriptive statistics

continuous variables

We get general information about variables using the describe function

greece %>% 
  dplyr::select(-4) %>% 
  describe()

##         vars    n mean   sd median trimmed  mad min max range  skew kurtosis
## stfgov     1 2685 4.11 2.28      4    4.08 2.97   0  10    10  0.10    -0.75
## happy      2 2685 6.59 1.53      7    6.70 1.48   0  10    10 -0.73     0.96
## trstlgl    3 2685 6.43 2.26      7    6.65 1.48   0  10    10 -0.72    -0.13
##           se
## stfgov  0.04
## happy   0.03
## trstlgl 0.04

greece %>% 
  dplyr::select(-4) %>% 
  sjmisc::descr(show = c('n', "mean","sd", "md", "range")) %>% 
  rename("variable" = "var",
         "Number of obs." = "n",
         "Mean" = "mean",
         "SD" = "sd",
         "Median" = "md",
         "Range" = "range")

## 
## ## Basic descriptive statistics
## 
##  variable Number of obs. Mean   SD Median     Range
##    stfgov           2685 4.11 2.28      4 10 (0-10)
##     happy           2685 6.59 1.53      7 10 (0-10)
##   trstlgl           2685 6.43 2.26      7 10 (0-10)

greece %>% 
  pivot_longer(c(stfgov, happy, trstlgl),
               names_to = 'Var', values_to = 'Score') %>% 
  ggplot(aes(y=Score)) + 
  geom_boxplot() +
  ggtitle("Distribution of scores") +
  xlab("Variable") + 
  ylab("Score") +
  theme_bw()+
  theme(legend.position="none") +
  facet_wrap(~Var)

In boxplot we see several outliers in the values of happy and trust

greece %>% 
  pivot_longer(c(stfgov, happy, trstlgl),
               names_to = 'Var', values_to = 'Score') %>% 
  ggplot(aes(x=Score, fill=Var)) + 
  geom_histogram(aes(y=..density.., fill = Var), bins = 10) +
  geom_density(alpha = .5, color="blue")+
  ggtitle("Distribution of scores") +
  xlab("Variable") + 
  ylab("Score") +
  theme_bw()+
  theme(legend.position="none") +
  facet_wrap(~Var)

## Warning: The dot-dot notation (`..density..`) was deprecated in ggplot2 3.4.0.
## ℹ Please use `after_stat(density)` instead.
## This warning is displayed once every 8 hours.
## Call `lifecycle::last_lifecycle_warnings()` to see where this warning was
## generated.

As it can be seen from the histograms, satisfaction with government, happy close to normal distribution. As for the trust in the legal system the histogram is not normally distributed.

categorical

Let’s look at the categorical variable

table(greece$psppipla)

## 
##    1    2    3    4    5 
## 1205  710  614  146   10

#Category 5 (`A great deal’) contains only 5 observations. This may affect the evaluation of the coefficients. Therefore, let’s combine categories 4 and 5

greece$psppipla <- car::recode(greece$psppipla, "1 = 1;
                                      2 = 2;
                                      3 = 3;
                                      4 = 4;
                                      5 = 4")

greece %>% 
  group_by(psppipla) %>% 
  count()

## # A tibble: 4 × 2
## # Groups:   psppipla [4]
##   psppipla     n
##      <dbl> <int>
## 1        1  1205
## 2        2   710
## 3        3   614
## 4        4   156

greece$psppipla = factor(greece$psppipla)

greece %>% 
  ggplot(aes(x = psppipla)) +
  geom_bar(fill = "lightblue", color = "blue") +
  xlab("Category") +
  ylab("Frequency") +
  theme_bw()

scatter plot

par(mfrow = c(1, 3))

greece %>% 
  ggplot(aes(x=stfgov, y=happy)) +
  geom_point(size=2) +
  geom_smooth(method=lm)

## `geom_smooth()` using formula = 'y ~ x'

greece %>% 
  ggplot(aes(x=stfgov, y=trstlgl)) +
  geom_point(size=2) +
  geom_smooth(method=lm)

## `geom_smooth()` using formula = 'y ~ x'

greece %>% 
  ggplot(aes(x=trstlgl, y=happy)) +
  geom_point(size=2) +
  geom_smooth(method=lm)

## `geom_smooth()` using formula = 'y ~ x'

there is a positive correlation between happy and satisfaction with government there is a positive correlation between trust in the legal system and satisfaction with government there is a positive correlation between happy and trust in the legal system

Correlations

chart.Correlation(greece[,c('stfgov', 'happy', 'trstlgl')],
                  histogram = TRUE) # by default Pearson

## Warning in par(usr): argument 1 does not name a graphical parameter

## Warning in par(usr): argument 1 does not name a graphical parameter

## Warning in par(usr): argument 1 does not name a graphical parameter

chart.Correlation(greece[,c('stfgov', 'happy', 'trstlgl')],
                  histogram = TRUE,
                  method = "spearman") # Spearman's method

## Warning in cor.test.default(as.numeric(x), as.numeric(y), method = method):
## Есть совпадающие значения: не могу высчитать точное p-значение

## Warning in cor.test.default(as.numeric(x), as.numeric(y), method = method):
## argument 1 does not name a graphical parameter

## Warning in cor.test.default(as.numeric(x), as.numeric(y), method = method):
## Есть совпадающие значения: не могу высчитать точное p-значение

## Warning in par(usr): argument 1 does not name a graphical parameter

## Warning in cor.test.default(as.numeric(x), as.numeric(y), method = method):
## Есть совпадающие значения: не могу высчитать точное p-значение

## Warning in par(usr): argument 1 does not name a graphical parameter

chart.Correlation(greece[,c('stfgov', 'happy', 'trstlgl')],
                  histogram = TRUE,
                  method = "kendall") # Kendall's method

## Warning in par(usr): argument 1 does not name a graphical parameter

## Warning in par(usr): argument 1 does not name a graphical parameter

## Warning in par(usr): argument 1 does not name a graphical parameter

heatmap

heatmaply_cor(
  cor(greece[,c('stfgov', 'happy', 'trstlgl')], method = "spearman"),
  Colv=NA, Rowv=NA)

матрица корреляций

cor.test(greece$happy, greece$stfgov)

## 
##  Pearson's product-moment correlation
## 
## data:  greece$happy and greece$stfgov
## t = 12, df = 2683, p-value <2e-16
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
##  0.195 0.267
## sample estimates:
##   cor 
## 0.231

cor.test(greece$happy, greece$stfgov,method="spearman")

## Warning in cor.test.default(greece$happy, greece$stfgov, method = "spearman"):
## Есть совпадающие значения: не могу высчитать точное p-значение

## 
##  Spearman's rank correlation rho
## 
## data:  greece$happy and greece$stfgov
## S = 3e+09, p-value <2e-16
## alternative hypothesis: true rho is not equal to 0
## sample estimates:
##  rho 
## 0.21

cor_matrix <- cor(greece[,c('stfgov', 'happy', 'trstlgl')], method = "spearman")

stargazer(cor_matrix, title="Correlation Matrix", type = "latex")

## 
## % Table created by stargazer v.5.2.3 by Marek Hlavac, Social Policy Institute. E-mail: marek.hlavac at gmail.com
## % Date and time: Пт, май 12, 2023 - 14:51:22
## \begin{table}[!htbp] \centering 
##   \caption{Correlation Matrix} 
##   \label{} 
## \begin{tabular}{@{\extracolsep{5pt}} cccc} 
## \\[-1.8ex]\hline 
## \hline \\[-1.8ex] 
##  & stfgov & happy & trstlgl \\ 
## \hline \\[-1.8ex] 
## stfgov & $1$ & $0.210$ & $0.313$ \\ 
## happy & $0.210$ & $1$ & $0.324$ \\ 
## trstlgl & $0.313$ & $0.324$ & $1$ \\ 
## \hline \\[-1.8ex] 
## \end{tabular} 
## \end{table}

sjPlot::tab_corr(greece[,c('stfgov', 'happy', 'trstlgl')],
                 corr.method = "spearman")

	stfgov	happy	trstlgl
stfgov		0.210***	0.313***
happy	0.210***		0.324***
trstlgl	0.313***	0.324***
Computed correlation used spearman-method with listwise-deletion.

cor.test(greece$happy, greece$stfgov)

## 
##  Pearson's product-moment correlation
## 
## data:  greece$happy and greece$stfgov
## t = 12, df = 2683, p-value <2e-16
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
##  0.195 0.267
## sample estimates:
##   cor 
## 0.231

cor.test(greece$happy, greece$stfgov,method="kendall")

## 
##  Kendall's rank correlation tau
## 
## data:  greece$happy and greece$stfgov
## z = 11, p-value <2e-16
## alternative hypothesis: true tau is not equal to 0
## sample estimates:
##   tau 
## 0.168

cor_matrix <- cor(greece[,c('stfgov', 'happy', 'trstlgl')], method = "kendall")

stargazer(cor_matrix, title="Correlation Matrix", type = "latex")

## 
## % Table created by stargazer v.5.2.3 by Marek Hlavac, Social Policy Institute. E-mail: marek.hlavac at gmail.com
## % Date and time: Пт, май 12, 2023 - 14:51:23
## \begin{table}[!htbp] \centering 
##   \caption{Correlation Matrix} 
##   \label{} 
## \begin{tabular}{@{\extracolsep{5pt}} cccc} 
## \\[-1.8ex]\hline 
## \hline \\[-1.8ex] 
##  & stfgov & happy & trstlgl \\ 
## \hline \\[-1.8ex] 
## stfgov & $1$ & $0.168$ & $0.238$ \\ 
## happy & $0.168$ & $1$ & $0.256$ \\ 
## trstlgl & $0.238$ & $0.256$ & $1$ \\ 
## \hline \\[-1.8ex] 
## \end{tabular} 
## \end{table}

sjPlot::tab_corr(greece[,c('stfgov', 'happy', 'trstlgl')],
                 corr.method = "kendall")

	stfgov	happy	trstlgl
stfgov		0.168***	0.238***
happy	0.168***		0.256***
trstlgl	0.238***	0.256***
Computed correlation used kendall-method with listwise-deletion.

cor.test(greece$happy, greece$stfgov)

## 
##  Pearson's product-moment correlation
## 
## data:  greece$happy and greece$stfgov
## t = 12, df = 2683, p-value <2e-16
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
##  0.195 0.267
## sample estimates:
##   cor 
## 0.231

cor.test(greece$happy, greece$stfgov,method="spearman")

## Warning in cor.test.default(greece$happy, greece$stfgov, method = "spearman"):
## Есть совпадающие значения: не могу высчитать точное p-значение

## 
##  Spearman's rank correlation rho
## 
## data:  greece$happy and greece$stfgov
## S = 3e+09, p-value <2e-16
## alternative hypothesis: true rho is not equal to 0
## sample estimates:
##  rho 
## 0.21

cor_matrix <- cor(greece[,c('stfgov', 'happy', 'trstlgl')], method = "spearman")

stargazer(cor_matrix, title="Correlation Matrix", type = "latex")

## 
## % Table created by stargazer v.5.2.3 by Marek Hlavac, Social Policy Institute. E-mail: marek.hlavac at gmail.com
## % Date and time: Пт, май 12, 2023 - 14:51:25
## \begin{table}[!htbp] \centering 
##   \caption{Correlation Matrix} 
##   \label{} 
## \begin{tabular}{@{\extracolsep{5pt}} cccc} 
## \\[-1.8ex]\hline 
## \hline \\[-1.8ex] 
##  & stfgov & happy & trstlgl \\ 
## \hline \\[-1.8ex] 
## stfgov & $1$ & $0.210$ & $0.313$ \\ 
## happy & $0.210$ & $1$ & $0.324$ \\ 
## trstlgl & $0.313$ & $0.324$ & $1$ \\ 
## \hline \\[-1.8ex] 
## \end{tabular} 
## \end{table}

sjPlot::tab_corr(greece[,c('stfgov', 'happy', 'trstlgl')],
                 corr.method = "spearman")

	stfgov	happy	trstlgl
stfgov		0.210***	0.313***
happy	0.210***		0.324***
trstlgl	0.313***	0.324***
Computed correlation used spearman-method with listwise-deletion.

rcorr(as.matrix(greece[,c('stfgov', 'happy', 'trstlgl')]), type = "spearman")

##         stfgov happy trstlgl
## stfgov    1.00  0.21    0.31
## happy     0.21  1.00    0.32
## trstlgl   0.31  0.32    1.00
## 
## n= 2685 
## 
## 
## P
##         stfgov happy trstlgl
## stfgov          0     0     
## happy    0            0     
## trstlgl  0      0

cor_mat <- greece[,-4] %>% 
  rstatix::cor_mat()

cor_mat %>% 
  rstatix::cor_get_pval()

## # A tibble: 3 × 4
##   rowname   stfgov    happy  trstlgl
##   <chr>      <dbl>    <dbl>    <dbl>
## 1 stfgov  0        7.09e-34 1.03e-68
## 2 happy   7.09e-34 0        2.03e-63
## 3 trstlgl 1.03e-68 2.03e-63 0

cor_mat %>% 
  rstatix::cor_gather()

## # A tibble: 9 × 4
##   var1    var2      cor        p
##   <chr>   <chr>   <dbl>    <dbl>
## 1 stfgov  stfgov   1    0       
## 2 happy   stfgov   0.23 7.09e-34
## 3 trstlgl stfgov   0.33 1.03e-68
## 4 stfgov  happy    0.23 7.09e-34
## 5 happy   happy    1    0       
## 6 trstlgl happy    0.32 2.03e-63
## 7 stfgov  trstlgl  0.33 1.03e-68
## 8 happy   trstlgl  0.32 2.03e-63
## 9 trstlgl trstlgl  1    0

greece[,-4] %>% 
  apa.cor.table(filename = "cor_matrix_Greece.doc")

## 
## 
## Means, standard deviations, and correlations with confidence intervals
##  
## 
##   Variable   M    SD   1          2         
##   1. stfgov  4.11 2.28                      
##                                             
##   2. happy   6.59 1.53 .23**                
##                        [.19, .27]           
##                                             
##   3. trstlgl 6.43 2.26 .33**      .32**     
##                        [.29, .36] [.28, .35]
##                                             
## 
## Note. M and SD are used to represent mean and standard deviation, respectively.
## Values in square brackets indicate the 95% confidence interval.
## The confidence interval is a plausible range of population correlations 
## that could have caused the sample correlation (Cumming, 2014).
##  * indicates p < .05. ** indicates p < .01.
##

greece[,-4] %>% 
  sjPlot::sjp.corr()

## Warning: 'sjp.corr' is deprecated. Please use 'correlation::correlation()' and
## its related plot()-method.

## Computing correlation using pearson-method with listwise-deletion...

## Warning: Removed 6 rows containing missing values (`geom_text()`).

From what we can see, all the relationship between our variables are quite moderate and have positive direction. The highest correlation coefficient is between trstlgl and stfgov. The presented values confirm the situation on the scatterplots. It is also worth noting that there is a very high level of significance (p <0.001)

a boxplot for the categorical predictor and the outcome

greece %>% 
  ggplot(aes(x = factor (psppipla),
             y = happy,
             fill = factor (psppipla))) + 
  geom_boxplot() +
  ggtitle("Distribution of happy level") +
  xlab("Category") + 
  ylab("Happy level") +
  theme_bw()+
  theme(legend.position="none")

Regardless of the level of psppipla, we observe the same distribution of the level of happiness of citizens

Linear regression

model1 = lm(happy ~ stfgov, data = greece)
sjPlot::tab_model(model1)

	happy
Predictors	Estimates	CI	p
(Intercept)	5.96	5.84 – 6.07	<0.001
stfgov	0.15	0.13 – 0.18	<0.001
Observations	2685
R² / R² adjusted	0.053 / 0.053

we standardize - so we can compare the coefficients with each other, and interpret them as the size of the effect

greece <- greece %>% 
  mutate(Zhappy = scale(happy)[,1],
         Zstfgov = scale(stfgov)[,1],
         Ztrstlgl = scale(trstlgl)[,1])

model1_std = lm(Zhappy ~ Zstfgov, data = greece)
sjPlot::tab_model(model1_std)

	Zhappy
Predictors	Estimates	CI	p
(Intercept)	0.00	-0.04 – 0.04	1.000
Zstfgov	0.23	0.19 – 0.27	<0.001
Observations	2685
R² / R² adjusted	0.053 / 0.053

# let's add a categorical variable
model2 = lm(happy ~ stfgov + psppipla, data = greece)
sjPlot::tab_model(model2)

	happy
Predictors	Estimates	CI	p
(Intercept)	5.99	5.86 – 6.11	<0.001
stfgov	0.15	0.13 – 0.18	<0.001
psppipla [2]	-0.05	-0.19 – 0.09	0.497
psppipla [3]	-0.09	-0.24 – 0.05	0.206
psppipla [4]	0.18	-0.07 – 0.43	0.161
Observations	2685
R² / R² adjusted	0.055 / 0.054

# with standardized coefficients
model2_std = lm(Zhappy ~ Zstfgov + psppipla, data = greece)
sjPlot::tab_model(model2_std)

	Zhappy
Predictors	Estimates	CI	p
(Intercept)	0.02	-0.04 – 0.07	0.582
Zstfgov	0.23	0.19 – 0.27	<0.001
psppipla [2]	-0.03	-0.12 – 0.06	0.497
psppipla [3]	-0.06	-0.16 – 0.03	0.206
psppipla [4]	0.12	-0.05 – 0.28	0.161
Observations	2685
R² / R² adjusted	0.055 / 0.054

# comparison of models
anova(model1_std, model2_std)

## Analysis of Variance Table
## 
## Model 1: Zhappy ~ Zstfgov
## Model 2: Zhappy ~ Zstfgov + psppipla
##   Res.Df  RSS Df Sum of Sq    F Pr(>F)
## 1   2683 2541                         
## 2   2680 2536  3      4.48 1.58   0.19

Conclusion: Model 1 is statistically significantly better suited to the data than model 2 (p>0.05)

summary(model1)

## 
## Call:
## lm(formula = happy ~ stfgov, data = greece)
## 
## Residuals:
##    Min     1Q Median     3Q    Max 
## -6.732 -0.887  0.113  0.958  4.043 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)   5.9571     0.0591   100.7   <2e-16 ***
## stfgov        0.1550     0.0126    12.3   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 1.49 on 2683 degrees of freedom
## Multiple R-squared:  0.0534, Adjusted R-squared:  0.053 
## F-statistic:  151 on 1 and 2683 DF,  p-value: <2e-16

For a model with non-standardized coefficients

p-value: < 2.2-16 <0.05, maybe makes sense Adjusted R-squared: 0.053, i.e. this indicator is 5.3% of the expected variable (happy) with my independent time (stfgov) Coefficients 0.15 and p-value:< 0.001, with each increase in atf gov per unit, the happiness level increases by 0.15. intersept = 5.96 - this refers to the predicted value of happy when stfgov is 0. The regression equation looks like this: happy = 5.96 + 0.15*stfgov

Linear regression model with 2 continuous predictors Now we add another predictor to our model.

# Let's add another continuous variable to the model
model3 = lm(happy ~ stfgov + trstlgl, data = greece)
sjPlot::tab_model(model3)

	happy
Predictors	Estimates	CI	p
(Intercept)	5.03	4.86 – 5.20	<0.001
stfgov	0.10	0.07 – 0.12	<0.001
trstlgl	0.18	0.16 – 0.21	<0.001
Observations	2685
R² / R² adjusted	0.118 / 0.117

model3_std = lm(Zhappy ~ Zstfgov + Ztrstlgl, data = greece)
sjPlot::tab_model(model3_std)

	Zhappy
Predictors	Estimates	CI	p
(Intercept)	0.00	-0.04 – 0.04	1.000
Zstfgov	0.14	0.10 – 0.18	<0.001
Ztrstlgl	0.27	0.23 – 0.31	<0.001
Observations	2685
R² / R² adjusted	0.118 / 0.117

# model comparison
anova(model1_std, model3_std)

## Analysis of Variance Table
## 
## Model 1: Zhappy ~ Zstfgov
## Model 2: Zhappy ~ Zstfgov + Ztrstlgl
##   Res.Df  RSS Df Sum of Sq   F Pr(>F)    
## 1   2683 2541                            
## 2   2682 2367  1       174 197 <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Conclusion: Model 3 is statistically significantly better suited to the data than model 1 (p<0.05)

summary(model3_std)

## 
## Call:
## lm(formula = Zhappy ~ Zstfgov + Ztrstlgl, data = greece)
## 
## Residuals:
##    Min     1Q Median     3Q    Max 
## -4.365 -0.519  0.034  0.619  3.252 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept) 1.50e-15   1.81e-02    0.00        1    
## Zstfgov     1.43e-01   1.92e-02    7.42  1.5e-13 ***
## Ztrstlgl    2.69e-01   1.92e-02   14.03  < 2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.939 on 2682 degrees of freedom
## Multiple R-squared:  0.118,  Adjusted R-squared:  0.117 
## F-statistic:  180 on 2 and 2682 DF,  p-value: <2e-16

For a model with standardized coefficients

p-value: < 2.2-16 <0.05, maybe makes sense Adjusted R-squared: 0.1174, i.e. this indicator is 11.74% of the expected variable the model is quite high-quality (happy) with mine, independent, unchangeable (stfgov & trstlgl) Correlation coefficients 0.14 and 0.27 and p-value:< 0.001, correlation coefficient = 0.00. The regression equation outputs: Z happy = 0.14Zstfgov + 0.27Ztrstlgl

summary(model3)

## 
## Call:
## lm(formula = happy ~ stfgov + trstlgl, data = greece)
## 
## Residuals:
##    Min     1Q Median     3Q    Max 
## -6.670 -0.792  0.053  0.945  4.969 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)   5.0305     0.0873   57.62  < 2e-16 ***
## stfgov        0.0956     0.0129    7.42  1.5e-13 ***
## trstlgl       0.1821     0.0130   14.03  < 2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 1.44 on 2682 degrees of freedom
## Multiple R-squared:  0.118,  Adjusted R-squared:  0.117 
## F-statistic:  180 on 2 and 2682 DF,  p-value: <2e-16

For a model with non-standardized coefficients

p-value: < 2.2-16 <0.05, maybe makes sense Adjusted R-squared: 0.1174, i.e. this indicator is 11.74% of the expected variable the model is quite high-quality (happy) with mine, independent, unchangeable (stfgov & trstlgl) Coefficients 0.10 and 0.18 and p-value:< 0.001, correlation coefficient = 5.0 - this refers to the predicted value of happy when trstlgl indicators are 0.

The regression equation looks like this: happy = 5.03 + 0.10stfgov + 0.18trstlgl

With each increase in stfgov by one, happy rises by 0.10. With each increase in trstlgl by one, happy rises by 0.18

Let’s check the assumptions of linear regression

Checking Linear Regression Assumptions Linear regression makes several assumptions about the data, such as :

Linearity of the data
Normality of residuals
Homogeneity of residuals variance
Independence of residuals error terms
multicolleniarity

autoplot(model3_std)

let’s check in a little more detail, the assumptions of linear regression are fulfilled

#1) normality of the remainder distribution

res <- resid(model3_std)
hist(res, breaks = 20, col = 'lightblue', freq = FALSE)
lines(density(res), col = 'red', lwd = 2)

shapiro.test(res) # the leftovers are NOT distributed normally

## 
##  Shapiro-Wilk normality test
## 
## data:  res
## W = 1, p-value <2e-16

#QQ-plot
par(mfrow = c(1, 1))
qqnorm(res)
qqline(res)

car::qqPlot(model3_std)

## 1402 1484 
## 1346 1424

#The histogram, test and qqplot graphs DO NOT show the normal distribution of residuals

# homoscedasticity
plot(fitted(model3_std), res)
abline(0,0)

ggplot(data = model3_std, aes(x = .fitted, y = .stdresid)) + 
  geom_point() + 
  geom_hline(yintercept = 0)

bptest(model3_std) # The Broich — Pagan or Breusch — Pagan test

## 
##  studentized Breusch-Pagan test
## 
## data:  model3_std
## BP = 112, df = 2, p-value <2e-16

# we can say that homoscedasticity does NOT hold

# let's check multicollinearity
car::vif(model3_std)

##  Zstfgov Ztrstlgl 
##     1.12     1.12

# there is NO multicollinearity

Linearity assumption: at the Residuals vs.Fitted plot a horizontal line, without distinct patterns can be seen, which is surely a good thing. (Our data is linear) The histogram, test and qqplot graphs DO NOT show the normal distribution of residuals Scale-Location & Residuals vs. Leverage plot DO NOT show us a horizontal line with equally, though in a funny way, spread points. This corresponds with NO homoscedasticity of our data.

Project_3

Diana Piskareva

2023-05-09

Let’s choose 3 continuous and 1 categorical variables

continuous

categorical

EDA

descriptive statistics

continuous variables

categorical

scatter plot

Correlations

heatmap

матрица корреляций

a boxplot for the categorical predictor and the outcome

Linear regression

we standardize - so we can compare the coefficients with each other, and interpret them as the size of the effect

Let’s check the assumptions of linear regression

let’s check in a little more detail, the assumptions of linear regression are fulfilled