chunk_opts = list(echo = TRUE, fig.align = 'center', fig.height = 8, fig.width = 8)Multiple Correspondence Analysis - Faith Survey
1. Routine immunization
1.1 Data summary
summary(regVaxData) age sex geoLoc role
18-24:2552 f :3646 rural:4888 relLead:2452
25-34:3515 sex_nr: 623 urban:4783 member :3888
35-44:2242 m :5402 notPart:3331
45+ :1362
religion leadOpCVax
christian :3898 leadOpCVax_matters :4364
muslim :3771 leadOpCVax_ns :2361
hindu : 624 leadOpCVax_dntMatter:2742
other : 369 NA's : 204
traditional religions: 286
buddhist : 229
(Other) : 494
mostLeadEndorseCV vaxed influencerCVax
noLeadEndorseCVax :1331 unvaxed:2579 infCvax_relLead : 791
notSureLeadEndorseCVax:2218 vaxed :7092 infCvax_hWorker :3802
mostLeadEndorseCVax :6122 infCvax_other :1844
infCvax_peerFriend: 750
infCvax_FamMember :2484
factorCVax importanceRVax influencerRVax
fClearInfo :3109 regVaxVeryImp :5656 infRVax_famMember :2636
fLeaderTakingVax:2639 regVaxSwhatImp:2067 infRVax_hWorker :4489
fRecommendations: 894 regVaxnotImp :1230 infRVax_other :1412
fDoMyPart : 840 regVaxnotSure : 718 infRVax_peerFriend: 464
fOther : 839 infRVax_relLeader : 670
fMedia : 572
(Other) : 778
continuedRvax leadSupportRvax
continuedRVax :7352 noLeadSupportRvax : 921
notcontinuedRVax:2319 notSureLeadSupportRvax:1710
someLeadSupportRvax :1681
mostLeadSupportRvax :5359
howReligionInfRVax contact_type country
relDiscouragesRVax: 931 Religions For Peace:2292 Guatemala:1472
relDsntAffectRVax :1332 RDD / 3-2-1 :7379 Mali :1196
relNSRVax :1449 Iraq :1139
relEncouragesRVax :5959 Zambia :1069
Kenya : 957
Nigeria : 953
(Other) :2885
This dataset contains 9671 individuals and 48 variables, 7 qualitative variables are considered as illustrative.
1.2 Run the MCA
regVax.MCA<-MCA(regVaxData,ncp=8,quali.sup=c(1,2,3,4,5,16,17),graph=FALSE)1.3 Inertia distribution (auto text)
The inertia of the first dimensions shows if there are strong relationships between variables and suggests the number of dimensions that should be studied.
The first two dimensions of analyse express 17.39% of the total dataset inertia ; that means that 17.39% of the individuals (or variables) cloud total variability is explained by the plane. This is a very small percentage and the first plane represents a small part of the data variability. This value is greater than the reference value that equals 7.1%, the variability explained by this plane is thus significant (the reference value is the 0.95-quantile of the inertia percentages distribution obtained by simulating 532 data tables of equivalent size on the basis of a uniform distribution).
From these observations, it is important to also interpret the dimensions greater or equal to the third one.
An estimation of the right number of axis to interpret suggests to restrict the analysis to the description of the first 8 axis. These axis present an amount of inertia greater than those obtained by the 0.95-quantile of random distributions (42.06% against 27.69%). This observation suggests that only these axis are carrying a real information. As a consequence, the description will stand to these axis.
1.4 Description of the plane 1:2
Analyzing individuals
Figure 4.1 - Individuals factor map (MCA)
Analyzing specific individuals
regVaxData %>% tibble::rownames_to_column("rowname") %>%
filter(rowname %in% c(1683, 2175, 2908, 1562, 5027, 1839, 2539, 2377, 3128, 4961, 1376, 7069)) %>%
mutate(quadrant = c(3, 4, 1, 1, 1, 4, 1, 4, 4, 3, 3, 3), .before = 1) %>% arrange(quadrant) %>% kable()| quadrant | rowname | age | sex | geoLoc | role | religion | leadOpCVax | mostLeadEndorseCV | vaxed | influencerCVax | factorCVax | importanceRVax | influencerRVax | continuedRvax | leadSupportRvax | howReligionInfRVax | contact_type | country |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1683 | 18-24 | m | urban | member | muslim | leadOpCVax_dntMatter | noLeadEndorseCVax | unvaxed | infCvax_FamMember | fRecommendations | regVaxSwhatImp | infRVax_famMember | notcontinuedRVax | someLeadSupportRvax | relDiscouragesRVax | RDD / 3-2-1 | Guatemala |
| 1 | 1839 | 18-24 | f | rural | member | muslim | leadOpCVax_dntMatter | noLeadEndorseCVax | unvaxed | infCvax_FamMember | fClearInfo | regVaxSwhatImp | infRVax_relLeader | notcontinuedRVax | someLeadSupportRvax | relDiscouragesRVax | RDD / 3-2-1 | Guatemala |
| 1 | 2175 | 18-24 | m | urban | relLead | christian | leadOpCVax_dntMatter | noLeadEndorseCVax | unvaxed | infCvax_FamMember | fRecommendations | regVaxSwhatImp | infRVax_relLeader | notcontinuedRVax | someLeadSupportRvax | relDiscouragesRVax | RDD / 3-2-1 | Guatemala |
| 1 | 2539 | 35-44 | f | rural | notPart | traditional religions | leadOpCVax_dntMatter | noLeadEndorseCVax | unvaxed | infCvax_FamMember | fDoMyPart | regVaxSwhatImp | infRVax_famMember | notcontinuedRVax | someLeadSupportRvax | relDiscouragesRVax | RDD / 3-2-1 | Guatemala |
| 3 | 1376 | 45+ | sex_nr | rural | notPart | traditional religions | leadOpCVax_matters | mostLeadEndorseCVax | vaxed | infCvax_hWorker | fRecommendations | regVaxnotImp | infRVax_hWorker | continuedRVax | mostLeadSupportRvax | relEncouragesRVax | RDD / 3-2-1 | Guatemala |
| 3 | 4961 | 35-44 | m | urban | relLead | muslim | leadOpCVax_matters | mostLeadEndorseCVax | vaxed | infCvax_hWorker | fLeaderTakingVax | regVaxnotImp | infRVax_peerFriend | continuedRVax | mostLeadSupportRvax | relEncouragesRVax | RDD / 3-2-1 | Mali |
| 3 | 5027 | 25-34 | m | urban | relLead | muslim | leadOpCVax_matters | mostLeadEndorseCVax | vaxed | infCvax_hWorker | fGovGuidance | regVaxnotImp | infRVax_hWorker | continuedRVax | mostLeadSupportRvax | relEncouragesRVax | Religions For Peace | Mali |
| 3 | 7069 | 45+ | m | urban | notPart | muslim | leadOpCVax_matters | mostLeadEndorseCVax | vaxed | infCvax_hWorker | fOther | regVaxVeryImp | infRVax_hWorker | continuedRVax | mostLeadSupportRvax | relEncouragesRVax | RDD / 3-2-1 | Nigeria |
| 4 | 1562 | 18-24 | f | rural | member | not religious | NA | notSureLeadEndorseCVax | unvaxed | infCvax_peerFriend | fOther | regVaxnotSure | infRVax_other | notcontinuedRVax | notSureLeadSupportRvax | relNSRVax | RDD / 3-2-1 | Guatemala |
| 4 | 2377 | 45+ | sex_nr | rural | notPart | other | leadOpCVax_ns | notSureLeadEndorseCVax | unvaxed | infCvax_other | fDoMyPart | regVaxnotSure | infRVax_other | continuedRVax | notSureLeadSupportRvax | relNSRVax | RDD / 3-2-1 | Guatemala |
| 4 | 2908 | 25-34 | sex_nr | rural | notPart | buddhist | leadOpCVax_ns | notSureLeadEndorseCVax | vaxed | infCvax_peerFriend | fOther | regVaxnotSure | infRVax_other | continuedRVax | notSureLeadSupportRvax | relNSRVax | RDD / 3-2-1 | Iraq |
| 4 | 3128 | 45+ | sex_nr | urban | notPart | muslim | leadOpCVax_ns | notSureLeadEndorseCVax | unvaxed | infCvax_other | fOther | regVaxnotSure | infRVax_hWorker | continuedRVax | notSureLeadSupportRvax | relNSRVax | RDD / 3-2-1 | Iraq |
Analyzing categories
plot.MCA(regVax.MCA, invisible= 'ind',col.quali.sup='#006400',label =c('var'))Warning: ggrepel: 10 unlabeled data points (too many overlaps). Consider
increasing max.overlaps
Figure 4.2 - Categories factor map (MCA)
Analyzing variables
plot.MCA(regVax.MCA, choix='var',col.quali.sup='#006400')Warning: ggrepel: 2 unlabeled data points (too many overlaps). Consider
increasing max.overlaps
Figure 4.3 - Variable representation (MCA)
The square correlation of variables with axes are not too high, but considering the large sample, those with a correlation closer to 0.5 are considered relevant: For dimension 1:
- leadSupportRvax, howReligionInfRVax, factorCVax, mostLeadEndorseCV, leadOpCVax, influencerCVax, influencerRVax
For dimension 2:
- leadSupportRvax, howReligionInfRVax.
Analyzing categories by variable
plot.MCA(regVax.MCA,invisible= c('ind','quali.sup'),col.var=c(1,1,1,1,1,2,2,2,3,3,4,4,4,4,4,5,5,5,5,5,5,5,5,6,6,6,6,7,7,7,7,7,8,8,9,9,9,9,10,10,10,10),col.quali.sup=c(11,11,11,11,12,12,12,13,13,14,14,14,15,15,15,15,15,15,15,15,15,15,16,16,17,17,17,17,17,17,17,17,17,17),cex=0.8,cex.main=0.8,cex.axis=0.8,label =c('var'))Warning: ggrepel: 7 unlabeled data points (too many overlaps). Consider
increasing max.overlaps
Figure 4.5 - Category factors, by variable (MCA)
The positive diagonal seems to distinguish pro-vaccine individuals (vaccination, its willingness) and individuals who believe their leaders or religious beliefs are in favour of vaccination (in the third quadrant) from those against vaccination (unvaccinated, those who did not continue routine immunization) and those who believe their religion or their leaders discourage vaccination (in the first quadrant). This diagonal show differences on influencers of vaccination (both regular and covid), having health workers close to those who are pro-vaccine, and family close to those who are against vaccination. Those believing religious leaders support vaccination are also close to those who feel influenced by their leaders taking the vaccine, whereas those who are not sure about religious support are close to those who feel most influenced by recommendations of influential people, public statements or media.
Duboious categories (those answering “not sure”, “other” or “prefer not to answer”) in several questions seem to be grouped in the fourth quadrant.
Analyzing categories and supporting variables
plot.MCA(regVax.MCA,invisible= 'ind',col.quali.sup='#006400',label =c('var','quali.sup')) %>% ggplotlyFigure 4.5 - Category factors, with supporting variable categories (MCA)
Looking at supportive variables, we find respondents from RFP, and those from Lenya, Zambia, Mali (and also Nigeria), as well as those who declared themselves as religious leaders, close to those who believe religious leaders support vaccination, in the third quadrant.
Jewish, and traditional religions seem to group in the first quadrant, close to those who believe religious leaders discourage vaccination.
1.5 Clustering
regVax.MCA$eig #%>% head() eigenvalue percentage of variance cumulative percentage of variance
dim 1 0.36619820 11.812845 11.81285
dim 2 0.17289135 5.577140 17.38999
dim 3 0.14953521 4.823716 22.21370
dim 4 0.13691707 4.416680 26.63038
dim 5 0.13137543 4.237917 30.86830
dim 6 0.12615782 4.069607 34.93791
dim 7 0.11372916 3.668683 38.60659
dim 8 0.10703568 3.452764 42.05935
dim 9 0.10288402 3.318839 45.37819
dim 10 0.10056265 3.243956 48.62215
dim 11 0.10033287 3.236544 51.85869
dim 12 0.09687487 3.124996 54.98369
dim 13 0.09263062 2.988085 57.97177
dim 14 0.09164328 2.956235 60.92801
dim 15 0.09014075 2.907766 63.83577
dim 16 0.08832636 2.849237 66.68501
dim 17 0.08713409 2.810777 69.49579
dim 18 0.08302324 2.678169 72.17396
dim 19 0.08161335 2.632689 74.80665
dim 20 0.07893656 2.546341 77.35299
dim 21 0.07745531 2.498558 79.85154
dim 22 0.07429881 2.396736 82.24828
dim 23 0.07273790 2.346384 84.59466
dim 24 0.07199724 2.322492 86.91716
dim 25 0.06883658 2.220535 89.13769
dim 26 0.06549124 2.112621 91.25031
dim 27 0.06120324 1.974298 93.22461
dim 28 0.05668930 1.828687 95.05330
dim 29 0.05417428 1.747557 96.80085
dim 30 0.05100437 1.645302 98.44616
dim 31 0.04816916 1.553844 100.00000
Choosing 22 dimension to get >80% of variance
Performing clustering
# Run this in console to choose the number of clusters
#regVax.hcpc <- HCPC(regVax.MCA)
#Leave this on chunk for rendering
regVax.hcpc <- HCPC(regVax.MCA, nb.clust = 4)This plot made with 4 clusters suggested by dendogram
# plot.HCPC(res.HCPC,choice='map',draw.tree=FALSE,title='Factor map', label = "none")
plot(regVax.hcpc,choice="map",ind.names=FALSE)round(regVax.hcpc$call$t$inert.gain,3) %>% head()[1] 0.178 0.100 0.079 0.071 0.058 0.045
round(regVax.MCA$eig[,1],3) %>% head()dim 1 dim 2 dim 3 dim 4 dim 5 dim 6
0.366 0.173 0.150 0.137 0.131 0.126
Defining the clusters
regVax.hcpc$desc.var$test p.value df
leadOpCVax 0.000000e+00 9
mostLeadEndorseCV 0.000000e+00 6
influencerCVax 0.000000e+00 12
factorCVax 0.000000e+00 21
importanceRVax 0.000000e+00 9
influencerRVax 0.000000e+00 12
leadSupportRvax 0.000000e+00 9
howReligionInfRVax 0.000000e+00 9
country 0.000000e+00 27
vaxed 5.549269e-311 3
continuedRvax 6.781057e-310 3
religion 1.974150e-229 27
role 2.782092e-198 6
sex 2.305549e-167 6
contact_type 2.867685e-134 3
age 1.034460e-17 9
geoLoc 5.436906e-05 3
regVax.hcpc$desc.var$category %>% lapply(head)$`1`
Cla/Mod Mod/Cla Global p.value
howReligionInfRVax=relEncouragesRVax 61.97349 90.44820 61.61721 0.000000e+00
leadSupportRvax=mostLeadSupportRvax 65.42265 85.86823 55.41309 0.000000e+00
influencerCVax=infCvax_hWorker 66.54392 61.96424 39.31341 0.000000e+00
mostLeadEndorseCV=mostLeadEndorseCVax 60.07841 90.08082 63.30266 0.000000e+00
leadOpCVax=leadOpCVax_matters 68.21723 72.91207 45.12460 0.000000e+00
factorCVax=fLeaderTakingVax 71.84540 46.43644 27.28777 5.089468e-288
v.test
howReligionInfRVax=relEncouragesRVax Inf
leadSupportRvax=mostLeadSupportRvax Inf
influencerCVax=infCvax_hWorker Inf
mostLeadEndorseCV=mostLeadEndorseCVax Inf
leadOpCVax=leadOpCVax_matters Inf
factorCVax=fLeaderTakingVax 36.26844
$`2`
Cla/Mod Mod/Cla Global p.value
influencerRVax=infRVax_relLeader 82.83582 53.62319 6.927929 0.000000e+00
influencerCVax=infCvax_relLead 87.98989 67.24638 8.179092 0.000000e+00
role=relLead 18.39315 43.57488 25.354152 7.562078e-42
contact_type=Religions For Peace 18.71728 41.44928 23.699721 3.431460e-41
leadOpCVax=leadOpCVax_matters 14.84876 62.60870 45.124599 6.871624e-33
country=Nigeria 22.98006 21.15942 9.854203 4.037931e-31
v.test
influencerRVax=infRVax_relLeader Inf
influencerCVax=infCvax_relLead Inf
role=relLead 13.55342
contact_type=Religions For Peace 13.44198
leadOpCVax=leadOpCVax_matters 11.94528
country=Nigeria 11.60174
$`3`
Cla/Mod Mod/Cla Global p.value
leadOpCVax=leadOpCVax_dntMatter 59.40919 57.68414 28.352807 0.000000e+00
howReligionInfRVax=relDsntAffectRVax 71.62162 33.78187 13.773136 3.560404e-266
mostLeadEndorseCV=noLeadEndorseCVax 65.74005 30.98442 13.762796 6.468841e-198
leadSupportRvax=someLeadSupportRvax 55.68114 33.14448 17.381863 4.111474e-140
leadSupportRvax=noLeadSupportRvax 59.82628 19.51133 9.523317 2.077776e-92
influencerRVax=infRVax_famMember 41.72989 38.95184 27.256747 1.649112e-59
v.test
leadOpCVax=leadOpCVax_dntMatter Inf
howReligionInfRVax=relDsntAffectRVax 34.85513
mostLeadEndorseCV=noLeadEndorseCVax 30.01387
leadSupportRvax=someLeadSupportRvax 25.19897
leadSupportRvax=noLeadSupportRvax 20.38938
influencerRVax=infRVax_famMember 16.26859
$`4`
Cla/Mod Mod/Cla Global p.value
howReligionInfRVax=relNSRVax 66.18357 55.46559 14.982939 0
leadSupportRvax=notSureLeadSupportRvax 62.28070 61.59630 17.681729 0
influencerRVax=infRVax_other 60.19830 49.16136 14.600352 0
importanceRVax=regVaxnotSure 80.64067 33.48757 7.424258 0
factorCVax=fOther 75.80453 36.78427 8.675421 0
influencerCVax=infCvax_other 56.83297 60.61307 19.067315 0
v.test
howReligionInfRVax=relNSRVax Inf
leadSupportRvax=notSureLeadSupportRvax Inf
influencerRVax=infRVax_other Inf
importanceRVax=regVaxnotSure Inf
factorCVax=fOther Inf
influencerCVax=infCvax_other Inf
Cluster 1 seems to group:
- Those whose leader's opinion matters to them and who believe in their endosement or support for vaccines
- Those who feel most influenced by health workers
- Those who believe the most important factor for decisions on COVID vaccination is seeing their religious leader taking the vaccine
Cluster 2 seems to group:
- Those who declare religious leaders as their main influencers on vaccinations
- Religious leaders or RFP members
Cluster 3 seems to group:
- Those whose leader's opinion does not matter to them
- Those who believe leaders do not support vaccination, or that religious beliefs do not affect vaccination decisions
- Those who believe the most important factor for decisions on COVID vaccination is government guidance
Cluster 4 seems to group:
- Those not sure about religious leaders or religious beliefs stance/influence on vaccination
Important vars are: p.value df religion 0.000000e+00 45 Only relevant in the last clusters leadOpCVax 0.000000e+00 15 Cluster 1 has those whose leader’s opinion matters to them, Cluster 5 those who are NAs influencerCVax 0.000000e+00 20 Cluster 1 has those that feel most influenced by health workers, Cluster 2 has respondents influenced by peers and family, Cluster 6 has those influenced by other people factorCVax 0.000000e+00 35 Cluster 1 has those who believe the most important factor for decisions on COVID vaccination is seeing their religious leader taking the vaccine, for those in cluster 2 is recommendations and media, those in cluster 3 are more likely to regocnize government guidance as a factor. Those in cluster 4, in public statements by influencing people, and those in cluster 6 are likeley to believe it is because they want to do their part on vaccination influencerRVax 0.000000e+00 20 Similar to influencerCVax country 0.000000e+00 45
Cluster 1 factor leader taking vax and clear info $1 Cla/Mod Mod/Cla factorCVax=fLeaderTakingVax 86.8889731 48.20264873 factorCVax=fClearInfo 75.4261821 49.29577465 influencerCVax=infCvax_hWorker 69.0426092 55.18183729 howReligionInfRVax=relEncouragesRVax 60.9666051 76.37166281 leadOpCVax=leadOpCVax_matters 65.6049496 60.18499054 mostLeadEndorseCV=mostLeadEndorseCVax 59.8007187 76.96026908 leadSupportRvax=mostLeadSupportRvax 61.2054488 68.95101955 influencerRVax=infRVax_hWorker 63.0875473 59.53331932
$2 factor recomendations and media Cla/Mod Mod/Cla factorCVax=fRecommendations 97.0917226 41.83132530 factorCVax=fMedia 98.9510490 27.27710843 influencerCVax=infCvax_peerFriend 86.8000000 31.37349398 influencerRVax=infRVax_peerFriend 86.6379310 19.37349398 leadOpCVax=leadOpCVax_ns 28.5048708 32.43373494 leadSupportRvax=notSureLeadSupportRvax 29.3567251 24.19277108 mostLeadEndorseCV=notSureLeadEndorseCVax 27.9080252 29.83132530 howReligionInfRVax=relNSRVax 28.7094548 20.04819277 role=notPart 25.5478835 41.01204819 contact_type=RDD / 3-2-1 22.9841442 81.73493976
$3 factor government guidance Cla/Mod Mod/Cla factorCVax=fGovGuidance 98.544699 100.000000 leadSupportRvax=notSureLeadSupportRvax 9.415205 33.966245 leadOpCVax=leadOpCVax_ns 8.428632 41.983122 mostLeadEndorseCV=notSureLeadEndorseCVax 8.295762 38.818565 country=Egypt 10.911271 19.198312 role=notPart 7.205044 50.632911 howReligionInfRVax=relNSRVax 8.902692 27.215190 contact_type=RDD / 3-2-1 5.610516 87.341772 country=Pakistan 9.020902 17.299578 influencerCVax=infCvax_other 7.375271 28.691983 influencerRVax=infRVax_other 7.648725 22.784810 importanceRVax=regVaxVeryImp 5.657709 67.510549
$4 factor public statements Cla/Mod Mod/Cla factorCVax=fPubStatements 97.979798 100.000000 influencerCVax=infCvax_other 5.368764 34.020619 country=Iraq 5.882353 23.024055 vaxed=unvaxed 4.614192 40.893471 influencerRVax=infRVax_other 5.240793 25.429553 howReligionInfRVax=relNSRVax 5.037957 25.085911 leadOpCVax=leadOpCVax_ns 4.404913 35.738832 religion=buddhist 8.733624 6.872852 mostLeadEndorseCV=notSureLeadEndorseCVax 4.373309 33.333333 leadSupportRvax=notSureLeadSupportRvax 4.444444 26.116838 contact_type=RDD / 3-2-1 3.306681 83.848797 continuedRvax=notcontinuedRVax 4.010349 31.958763 mostLeadEndorseCV=noLeadEndorseCVax 4.432757 20.274914 importanceRVax=regVaxnotSure 5.013928 12.371134
$5 Cla/Mod Mod/Cla leadOpCVax=leadOpCVax.NA 100.0000000 100.0000000 religion=not religious 94.4444444 100.0000000 country=Guatemala 8.9673913 64.7058824 role=notPart 4.8934254 79.9019608 country=Brazil 9.9722992 17.6470588 factorCVax=fDoMyPart 6.4285714 26.4705882 mostLeadEndorseCV=notSureLeadEndorseCVax 4.2831380 46.5686275 sex=f 3.4009874 60.7843137 factorCVax=fOther 5.4827175 22.5490196 contact_type=RDD / 3-2-1 2.5071148 90.6862745 influencerRVax=infRVax_other 4.1784703 28.9215686 influencerCVax=infCvax_other 3.7418655 33.8235294 howReligionInfRVax=relDsntAffectRVax 3.9039039 25.4901961 howReligionInfRVax=relNSRVax 3.7267081 26.4705882 leadSupportRvax=notSureLeadSupportRvax 3.4502924 28.9215686 leadSupportRvax=someLeadSupportRvax 3.3908388 27.9411765
$6 Cla/Mod Mod/Cla factorCVax=fDoMyPart 78.2142857 35.13368984 influencerCVax=infCvax_other 57.4837310 56.68449198 influencerRVax=infRVax_other 58.5694051 44.22459893 factorCVax=fOther 66.8653159 30.00000000 leadOpCVax=leadOpCVax_ns 35.8322745 45.24064171 howReligionInfRVax=relNSRVax 40.3036577 31.22994652 leadSupportRvax=notSureLeadSupportRvax 37.8947368 34.65240642 importanceRVax=regVaxnotSure 49.8607242 19.14438503 mostLeadEndorseCV=notSureLeadEndorseCVax 34.0847610 40.42780749 country=Iraq 37.8402107 23.04812834 vaxed=unvaxed 28.9647150 39.94652406 continuedRvax=notcontinuedRVax 29.0211298 35.98930481 religion=other 47.4254743 9.35828877 sex=sex_nr 40.1284109 13.36898396 role=notPart 25.6079255 45.61497326
2. COVID immunization
2.1. Data summary
summary(covidData) age sex geoLoc role
18-24:5737 f : 7698 rural: 9051 relLead:4627
25-34:5997 sex_nr: 917 urban:10795 member :8016
35-44:4045 m :11231 notPart:7203
45+ :4067
religion leadOpCVax
christian :8287 leadOpCVax_matters :8451
muslim :7402 leadOpCVax_ns :4285
hindu :1551 leadOpCVax_dntMatter:6601
other : 592 NA's : 509
not religious : 537
traditional religions: 534
(Other) : 943
mostLeadEndorseCV vaxed influencerCVax
noLeadEndorseCVax : 2828 unvaxed: 4805 infCvax_relLead :1548
notSureLeadEndorseCVax: 4332 vaxed :15041 infCvax_hWorker :7629
mostLeadEndorseCVax :12686 infCvax_other :3930
infCvax_peerFriend:1482
infCvax_FamMember :5257
factorCVax contact_type country
fClearInfo :6972 Religions For Peace: 5586 Zambia :2125
fLeaderTakingVax:4823 RDD / 3-2-1 :14260 Kenya :2116
fRecommendations:1998 Nigeria:2087
fDoMyPart :1820 Egypt :2026
fOther :1514 Mali :2023
fMedia :1173 Nepal :2014
(Other) :1546 (Other):7455
This dataset contains 19846 individuals and 28 variables, 7 qualitative variables are considered as illustrative.
2.2 Run the MCA
dfcompleted <- missMDA::imputeMCA(covidData,ncp=14,quali.sup=c(1,2,3,4,5,11,12))
covidVax.MCA<-MCA(covidData,tab.disj = dfcompleted$tab.disj,quali.sup=c(1,2,3,4,5,11,12),graph=FALSE)2.3 Inertia distribution (autotext)
The inertia of the first dimensions shows if there are strong relationships between variables and suggests the number of dimensions that should be studied.
The first two dimensions of analyse express 21.3% of the total dataset inertia ; that means that 21.3% of the individuals (or variables) cloud total variability is explained by the plane. This is a small percentage and the first plane just represents a part of the data variability. This value is greater than the reference value that equals 13.05%, the variability explained by this plane is thus significant (the reference value is the 0.95-quantile of the inertia percentages distribution obtained by simulating 570 data tables of equivalent size on the basis of a uniform distribution).
From these observations, it is interesting to consider the next dimensions which also express a high percentage of the total inertia.
An estimation of the right number of axis to interpret suggests to restrict the analysis to the description of the first 6 axis. These axis present an amount of inertia greater than those obtained by the 0.95-quantile of random distributions (48.32% against 38.56%). This observation suggests that only these axis are carrying a real information. As a consequence, the description will stand to these axis.
2.4. Description of the plane 1:2
Analyzing individuals
Figure 2.4.1 Individuals factor map (MCA)
Horseshoe shape?
Analyzing specific individuals
regVaxData %>% tibble::rownames_to_column("rowname") %>%
filter(rowname %in% c(1683, 2175, 2908, 1562, 5027, 1839, 2539, 2377, 3128, 4961, 1376, 7069)) %>%
mutate(quadrant = c(3, 4, 1, 1, 1, 4, 1, 4, 4, 3, 3, 3), .before = 1) %>% arrange(quadrant)# A tibble: 12 × 19
quadrant rowname age sex geoLoc role religion leadOpCVax
<dbl> <chr> <fct> <fct> <fct> <fct> <fct> <fct>
1 1 1683 18-24 m urban member muslim leadOpCVa…
2 1 1839 18-24 f rural member muslim leadOpCVa…
3 1 2175 18-24 m urban relLead christian leadOpCVa…
4 1 2539 35-44 f rural notPart traditional religions leadOpCVa…
5 3 1376 45+ sex_nr rural notPart traditional religions leadOpCVa…
6 3 4961 35-44 m urban relLead muslim leadOpCVa…
7 3 5027 25-34 m urban relLead muslim leadOpCVa…
8 3 7069 45+ m urban notPart muslim leadOpCVa…
9 4 1562 18-24 f rural member not religious <NA>
10 4 2377 45+ sex_nr rural notPart other leadOpCVa…
11 4 2908 25-34 sex_nr rural notPart buddhist leadOpCVa…
12 4 3128 45+ sex_nr urban notPart muslim leadOpCVa…
# ℹ 11 more variables: mostLeadEndorseCV <fct>, vaxed <fct>,
# influencerCVax <fct>, factorCVax <fct>, importanceRVax <fct>,
# influencerRVax <fct>, continuedRvax <fct>, leadSupportRvax <fct>,
# howReligionInfRVax <fct>, contact_type <fct>, country <fct>
Analyzing categories
plot.MCA(covidVax.MCA, invisible= 'ind',col.quali.sup='#006400',label =c('var'))Figure 2.4.2 Categories factor map (MCA)
Analyzing variables
plot.MCA(covidVax.MCA, choix='var',col.quali.sup='#006400')Figure 2.4.3 - Individuals factor map (MCA)
The square correlation of variables with axes are not too high, but considering the large sample, those with a correlation closer to 0.5 are considered relevant: For dimension 1:
- factorCVax, mostLeadEndorseCV, leadOpCVax, influencerCVax, vaxed
For dimension 2:
- factorCVax, leadOpCVax, influencerCVax
Analyzing categories by variable
plot.MCA(covidVax.MCA,invisible= c('ind','quali.sup'),col.var=c(1,1,1,2,2,2,3,3,4,4,4,4,4,5,5,5,5,5,5,5,5),col.quali.sup=c(6,6,6,6,7,7,7,8,8,9,9,9,10,10,10,10,10,10,10,10,10,10,11,11,12,12,12,12,12,12,12,12,12,12),label =c('var'))Figure 2.4.4 - Category factors, by variable (MCA)
A horseshoe shape is observed for the influence of leader’s opinion on vaccination, as well as for the COVID vaccination endorsement of leaders and for the most important influencers of it. Those who belive in leader’s endorsement of vaccination are close to those who feel influenced by the opinion of their leaders, and those who are most influenced by them, locate on negative values of dimension 1 and positive values for dimension 2. Those who do not believe in leader’s endorsement are close to those who feel influenced by the opinion of their leaders locate close to the origin of dimension 1. Those uncertain about the influence of leaders locate at positive values of dimensions 1 and 2.
Dimension 1 seems to distinguish between those vaccinated and those who are not.
Analyzing categories and supporting variables
plot.MCA(covidVax.MCA,invisible= 'ind',col.quali.sup='#006400',label =c('var','quali.sup')) %>% ggplotlyFigure 2.4.5 - Category factors, with supporting variable categories (MCA)
When comparing to supporting variables, and as it happened with routine immunization, Nigeria, Zambia, Kenya and Mali, as well as those who declared themselves as religious leaders, seem closer to those who believe religious leaders support vaccination and to those who feel influenced by them.
Uncertain respondents are closer to Iraquí respondents, Shik, and those who did not respond to the gender question.
2.5 Clustering
covidVax.MCA$eig #%>% head() eigenvalue percentage of variance cumulative percentage of variance
dim 1 0.4200437 13.089489 13.08949
dim 2 0.2632400 8.203140 21.29263
dim 3 0.2327096 7.251746 28.54437
dim 4 0.2160854 6.733700 35.27807
dim 5 0.2076532 6.470933 41.74901
dim 6 0.2075218 6.466837 48.21585
dim 7 0.2004154 6.245388 54.46123
dim 8 0.1987885 6.194688 60.65592
dim 9 0.1911314 5.956075 66.61200
dim 10 0.1797189 5.600437 72.21243
dim 11 0.1777702 5.539713 77.75215
dim 12 0.1700613 5.299485 83.05163
dim 13 0.1594057 4.967434 88.01907
dim 14 0.1467724 4.573751 92.59282
dim 15 0.1222062 3.808214 96.40103
dim 16 0.1154914 3.598968 100.00000
Choosing 14 dimension to get >90% of variance
Performing clustering
# Run this in console to choose the number of clusters
# covidVax.hcpc <- HCPC(covidVax.MCA)
# Leave this on chunk for rendering
covidVax.hcpc <- HCPC(covidVax.MCA, nb.clust = 3) This plot identifies 3 clusters, suggested by dendogram
# plot.HCPC(res.HCPC,choice='map',draw.tree=FALSE,title='Factor map', label = "none")
plot(covidVax.hcpc,choice="map",ind.names=FALSE)round(covidVax.hcpc$call$t$inert.gain,3) %>% head()[1] 0.235 0.145 0.121 0.100 0.063 0.062
round(covidVax.MCA$eig[,1],3) %>% head()dim 1 dim 2 dim 3 dim 4 dim 5 dim 6
0.420 0.263 0.233 0.216 0.208 0.208
Defining the clusters
covidVax.hcpc$desc.var$test p.value df
leadOpCVax 0.000000e+00 6
mostLeadEndorseCV 0.000000e+00 4
vaxed 0.000000e+00 2
influencerCVax 0.000000e+00 8
factorCVax 0.000000e+00 14
country 0.000000e+00 18
role 1.901006e-211 4
religion 3.317285e-188 18
contact_type 2.362398e-138 2
sex 1.157525e-128 4
age 6.615220e-19 6
geoLoc 1.247335e-03 2
Many variables appear relevant to define the cluster, mostly the importance of leader’s opinion on Covid vaccination, the view on leader’s endorsement of covid vaccination, the vaccination status, the choice of an influencer on covid vaccination, the influencing factor, and the country.
Categories show
Cluster 1:
- Those who believe the most important factor for decisions on COVID vaccination is seeing their religious leader taking the vaccine
- Those who feel most influenced by health workers
- Those who are vaccinated
- Those who believe their religious leader endorse COVID vaccination
- Those who believe their leader's opinion matters to them
Cluster 2:
- Those who believe the most important factor for decisions on COVID vaccination is the media
- Those who feel most influenced by peers and family
Cluster 3:
- Those who feel most influenced by other people, and by other factors
- Those who are not vaccinated
- Those unsure of their leader's opinion on COVID vaccination and those who are not sure if that opinion matters to them
covidVax.hcpc$desc.var$category %>% lapply(head)$`1`
Cla/Mod Mod/Cla Global p.value
factorCVax=fLeaderTakingVax 89.21833 33.96480 24.30213 0.000000e+00
influencerCVax=infCvax_hWorker 84.02150 50.59594 38.44100 0.000000e+00
vaxed=vaxed 72.86085 86.50249 75.78857 0.000000e+00
mostLeadEndorseCV=mostLeadEndorseCVax 80.70314 80.81143 63.92220 0.000000e+00
leadOpCVax=leadOpCVax_matters 85.30351 56.90268 42.58289 0.000000e+00
factorCVax=fClearInfo 78.72920 43.32623 35.13050 5.247104e-237
v.test
factorCVax=fLeaderTakingVax Inf
influencerCVax=infCvax_hWorker Inf
vaxed=vaxed Inf
mostLeadEndorseCV=mostLeadEndorseCVax Inf
leadOpCVax=leadOpCVax_matters Inf
factorCVax=fClearInfo 32.87356
$`2`
Cla/Mod Mod/Cla Global
factorCVax=fMedia 100.00000 48.81398 5.910511
influencerCVax=infCvax_peerFriend 93.99460 57.96921 7.467500
leadOpCVax=leadOpCVax_ns 16.84947 30.04578 21.591253
mostLeadEndorseCV=notSureLeadEndorseCVax 16.38966 29.54640 21.828076
role=notPart 14.70221 44.06991 36.294467
contact_type=RDD / 3-2-1 13.28892 78.85976 71.853270
p.value v.test
factorCVax=fMedia 0.000000e+00 Inf
influencerCVax=infCvax_peerFriend 0.000000e+00 Inf
leadOpCVax=leadOpCVax_ns 3.080650e-25 10.379288
mostLeadEndorseCV=notSureLeadEndorseCVax 2.863931e-21 9.467583
role=notPart 6.186671e-17 8.361613
contact_type=RDD / 3-2-1 6.929390e-17 8.348229
$`3`
Cla/Mod Mod/Cla Global
factorCVax=fOther 93.65918 29.70256 7.628741
influencerCVax=infCvax_other 68.04071 56.01173 19.802479
vaxed=unvaxed 48.99063 49.30876 24.211428
mostLeadEndorseCV=notSureLeadEndorseCVax 66.25115 60.11730 21.828076
leadOpCVax=leadOpCVax_ns 61.02684 54.77587 21.591253
role=notPart 34.79106 52.49267 36.294467
p.value v.test
factorCVax=fOther 0.000000e+00 Inf
influencerCVax=infCvax_other 0.000000e+00 Inf
vaxed=unvaxed 0.000000e+00 Inf
mostLeadEndorseCV=notSureLeadEndorseCVax 0.000000e+00 Inf
leadOpCVax=leadOpCVax_ns 0.000000e+00 Inf
role=notPart 5.920643e-153 26.34459