Who is the identifiable victim? Caste and charitable giving in modern India.
Author
Dean Spears, Ashwini Deshpande (dspears@utexas.edu, ashwini.deshpande@ashoka.edu.in)
Published
December 17, 2023
Introduction
In their study, Deshpande & Spears (2016) attempt to test how caste and religious identity markers may shape how sympathetic individuals feel towards the out-group. They seek to embed this question within a previously established finding & paradigm pertaining to the identifiable victim effect: whereby researchers have found that people tend to experience more sympathy when victims are defined as specific individuals as against a group identified via broad group-based categorical labels such as “Hindus, Dalits, Whites or Blacks”(Henceforth referred to as a statistical group). Deshpande & Spears (2016) find that indeed participants are more generous to victims who are identified via generic Indian, upper caste name, and Muslim background as against their statistical counterparts. However, they find no evidence for the impact of the identifiability of victims on donations made to Dalits. Whereby Dalit recipients receive equally few donations when they are identified as individual victims or statistical groups. Researchers also find no significant discrimination between donations made to Dalits, Muslims, and upper caste groups when identified as ‘social or statistical groups’. Participants in the study were from largely upper-caste, internet-savvy, literate, high SES backgrounds.
##Motivation to replicate:
The results obtained by Deshpande & Spears (2016) differ from one of my experiments where I tested the impact of exposure to nationalistic ideas on donations made to upper-caste, Dalit and Hindu recipients. In the control group of my study (n =90) I found no significant discrimination between donations to Dalits (n =30) and Upper-caste recipients (n =30) (identified via their caste names). However, I find marginally significant discrimination towards Muslims (n = 30, p = 0.108). Upon exposure to nationalistic norms, I found that donations made to Muslims were reduced significantly as compared to donations made to upper-caste hindus & Dalits. However, donations made to Dalits remain unaffected in response to exposure to nationalistic norms, any changes are statistically insignificant in comparison to the upper caste. I have recently conducted another study to test whether these effects replicate. Replicating Deshpande & Spears (2016) gives me an additional opportunity to re-test the relative authenticity of the results obtained in the control group of my experiment. And allows me to further build on these results to aid a third replication of my nationalism experiment.
##The present study:
Since finding discrimination against Dalits, but non against Muslims (in the identifiability condition) is the most striking finding reported by Deshpande & Spears, my study will focus on replicating this finding.
##Challenges:
Bots & Turk farms: Mturk samples in India are known to be fraught with bots, and Turk farms, which have been increasingly reported to skew the quality of data collection. Based on anecdotes from researchers, I have learnt that such problems have increased over the years, and therefore I am more likely to experience such concerns than the authors of the study did in 2015. I am therefore sceptical of using a Mturk sample to replicate the study. There are other agencies that offer online panel services in India, alongside data quality assurances. Given that my replication isn’t geared towards mapping data quality concerns on Mturk, I believe I should ideally access other services that will provide a better, more robust replication test of the experiment design.
Regional skew: Anecdotal evidence, based on the experience of other researchers, points towards an increasing regional skew within the Mturk working population. Whereby participants are known to be located in Southern parts of India. Deshpande & Spears (2016) however utilize north Indian caste names to indicate caste. A South Indian participant will not be able to accurately categorize and infer the appropriate caste via the names signalled in the prime. Given that Mturk doesn’t allow us to sample the population via region If I use Mturk I am likely to recruit a larger South India population, this will in turn affect the replicability of the experiment.
Power: Deshpande & Spears (2016) recruited a sample of 400 participants, incidently their sample was a homogeneous group of upper-caste hindus. This led the researchers to obtain 50 upper caste Hindu participants per treatment group. Such homogeneous recruitment of upper case hindus is however not always possible. It is likely that in our recruitment we will not be able to pre-emptively control for demographics. If we ask participants to report their caste at the start of the survey and exclude them on the basis of caste, this will amount to discrimination, and also inadvertently prime caste/ religious identity. Therefore accounting for potential heterogeneity in the recruited sample, and the statistical procedures needed to control for the same, we would need to increase our sample to 800 to have an 80 % chance of seeing the effect size at 0.05 significance level. I assume that we will have at least 100-150 people from OBC, SC, and Muslim backgrounds in our sample (each). However, if we work with an agency we can work with a pre-defined panel of upper caste groups as our sample.
https://github.com/monishad127/dhingram-2023-.git
Methods
Power Analysis
The original effect size isnt reported, however based on the power calculation and keeping mind limitations noted above I note the following estimate re-power: I would need a sample of 400 to detect a small effect (0.2) at 80 % power, and 0.05 level of significance.
Planned Sample
I will be recruiting the sample from Mturk
Materials
Participants will be randomly assigned to read about an identified recipient from a Dalit, Muslim or upper caste Hindu background.The identification of the recipient’s category will be made only implicitly by their name, using well-known male names commonly associated with Dalit, Muslim, Upper Caste Hindu categories. Deshpande & Spears (2015) utilized 20 names, five for each group; each participant was assigned to read about an identifiable recipient and read only one of these five names, randomly presented. For the control treatment, researchers used names that are commonly found across caste levels and are not identified with a particular group. However, since the authors have established (through study 3) that each of these names signals a caste & religious category, I will utilize names with the highest signalling value (factor loading) as indicated in study 3 of the paper.
The following materials will be utilized to indicate ‘individual’ characteristics of the recipients.
Those assigned to an identifiable victim treatment read: The family of NAME is very poor. For much of each year, they cannot find work. His family frequently cannot afford enough basic food to eat. As a result, his children go without medicine if they get sick, and often go to bed hungry.
The following attention checks and instructional manipulations will be administered:
“How often have you suffered a fatal heart attack?” Only those who selected “never” were included in the analyzed sample.
“On many important issues, people have different opinions. Some people agree, and some people disagree, even very strongly. Here in this question, please select the number four in the slider below, to rule out random clicking.”
The following manipulation check will be used:
Please identify which of the following social categories does the “recepient name” belong to?
To understand how participants self-categorize themselves into a social group, participants will be requested to answer the following questions:
How much do you believe your family is like a typical family of each of the following types?” The 10 groups, as they will be written on the survey form are as follows: Brahmin, Forward/Upper Castes, OBC ½Other Backward Classes, Dalits, Dalit/SC, Adivasi/ST, Scheduled Tribe; marginalized tribal communities, Muslim, Poor, Middle class, Rural, and Urban.
Procedure
Experiment Design
Deshpande & Spears (2016) recruited an Indian Sample via Mturk to conduct an online survey experiment to test the interaction of caste and religious identity with the identifiable victim effect. First, participants are shown the experimental prompt: a few sentences of text describing an opportunity for charitable giving. The identity of the recipient receiving the charity is randomised. Participants either see recipients from Muslim, Dalit, upper-caste Hindu or general Indian backgrounds. The experiment is thus set up as 4(social identity) x 1 (identifiable victim) between group factorial design. Immediately after exposure to the stimuli, participants are asked to rate their willingness to donate. To measure this key dependent variable, participants are shown a scale ranging from Rs. 0 to 100 and are asked how much money they would be willing to donate. Following this, participants are asked a set of attention check & manipulation check questions. Finally, participants rated the similarity of their family to typical members of 10 different social groups cutting across class, caste, religious and neighbourhood characteristics. Followed by a demographic questionnaire.
Link to the questionnaire: https://lse.eu.qualtrics.com/jfe/form/SV_3lRdcGRosB4KV5c
Analysis Plan
Following Deshpande & Spears (2016), I will replicate the main effect of social identity of the recipient on donations. Utilizing linear regressionI will test the impact of social identity on donations.
Clarify key analysis of interest here
Expecting socio-economic diversity in my sample,I will add controls for income in the main regression analysis.
Differences from Original Study
I am only replicating the impact of social identity on donations. And not the interaction between identifiability and social identity.
2.I am utilizing caste & religous names with the highest factor loading, instead of randomly assigning 5 different names to indicate caste/ religion.
I collected data via Mturk platform. Despite the procedural issues, the data collection was smooth & quick. Unfortunately, the effeciacy of data collection doesn’t match up to the quality. I note below the actual sample I am able to utlize in my study.
Actual Sample
I collected a sample of 324 participants.81 Participants were from general category/ Upper caste backrgound; 3 were from EWS backgriund, 48 were OBCcreamy layer, 23 were OBC non creamy layer, 22 were from Scheduled Caste background, and 7 were from scheduled tribe backrgound. I 29excluded participants from SC & ST backrgound. I excluded 32 participants from Muslim & Christian Background (this overlapped with caste), and I removed 120 participants who failed the attention check.After removing participants who did not meet the Manipulation check, I was left with a final sample of 88 participants.
Differences from pre-data collection methods plan
Any differences from what was described as the original plan, or “none”.
Results
Data preparation
Data preparation following the analysis plan.
##reading data into R
##descriptive statistics: summary(D)
Consent Region MI MCMM MCIM
Min. :1 Min. :1.000 Min. : 10.00 Min. :1 Min. :1
1st Qu.:1 1st Qu.:2.000 1st Qu.: 50.00 1st Qu.:1 1st Qu.:1
Median :1 Median :2.000 Median : 76.50 Median :1 Median :1
Mean :1 Mean :1.801 Mean : 69.28 Mean :1 Mean :1
3rd Qu.:1 3rd Qu.:2.000 3rd Qu.: 89.00 3rd Qu.:1 3rd Qu.:1
Max. :1 Max. :3.000 Max. :100.00 Max. :1 Max. :1
NA's :1 NA's :1 NA's :152 NA's :160 NA's :178
MCDM MCHM GI MCMG MCIG
Min. :1 Min. :1 Min. : 5.00 Min. :1 Min. :1
1st Qu.:1 1st Qu.:1 1st Qu.: 51.00 1st Qu.:1 1st Qu.:1
Median :1 Median :1 Median : 62.00 Median :1 Median :1
Mean :1 Mean :1 Mean : 65.35 Mean :1 Mean :1
3rd Qu.:1 3rd Qu.:1 3rd Qu.: 83.00 3rd Qu.:1 3rd Qu.:1
Max. :1 Max. :1 Max. :100.00 Max. :1 Max. :1
NA's :191 NA's :189 NA's :149 NA's :189 NA's :168
MCDG MCHG MCBG MCFCG DI
Min. :1 Min. :1 Min. : NA Min. : NA Min. : 20.00
1st Qu.:1 1st Qu.:1 1st Qu.: NA 1st Qu.: NA 1st Qu.: 50.00
Median :1 Median :1 Median : NA Median : NA Median : 62.00
Mean :1 Mean :1 Mean :NaN Mean :NaN Mean : 66.11
3rd Qu.:1 3rd Qu.:1 3rd Qu.: NA 3rd Qu.: NA 3rd Qu.: 85.00
Max. :1 Max. :1 Max. : NA Max. : NA Max. :100.00
NA's :189 NA's :166 NA's :192 NA's :192 NA's :154
MCMD MCID MCDD MCHD MCBD
Min. :1 Min. :1 Min. :1 Min. :1 Mode:logical
1st Qu.:1 1st Qu.:1 1st Qu.:1 1st Qu.:1 NA's:192
Median :1 Median :1 Median :1 Median :1
Mean :1 Mean :1 Mean :1 Mean :1
3rd Qu.:1 3rd Qu.:1 3rd Qu.:1 3rd Qu.:1
Max. :1 Max. :1 Max. :1 Max. :1
NA's :191 NA's :173 NA's :188 NA's :165
MCFCD BI MC_B AC_1 AC_2
Min. : NA Min. : 0.00 Min. :1.000 Min. :4 Min. :4
1st Qu.: NA 1st Qu.: 51.00 1st Qu.:2.000 1st Qu.:4 1st Qu.:4
Median : NA Median : 71.00 Median :4.000 Median :4 Median :4
Mean :NaN Mean : 65.24 Mean :3.273 Mean :4 Mean :4
3rd Qu.: NA 3rd Qu.: 83.00 3rd Qu.:4.000 3rd Qu.:4 3rd Qu.:4
Max. : NA Max. :100.00 Max. :4.000 Max. :4 Max. :4
NA's :192 NA's :159 NA's :159
SFC SOBC SD SA
Min. : 0.00 Min. : 0.00 Min. : 0.00 Min. : 0.00
1st Qu.: 45.00 1st Qu.: 53.00 1st Qu.: 21.50 1st Qu.:25.00
Median : 65.00 Median : 67.00 Median : 52.00 Median :53.00
Mean : 58.91 Mean : 63.56 Mean : 49.07 Mean :49.87
3rd Qu.: 79.00 3rd Qu.: 81.50 3rd Qu.: 75.00 3rd Qu.:76.00
Max. :100.00 Max. :100.00 Max. :100.00 Max. :97.00
NA's :1 NA's :1 NA's :1 NA's :1
SSC...30 SM SP SMC
Min. : 0.00 Min. : 0.00 Min. : 0.00 Min. : 0.00
1st Qu.: 26.00 1st Qu.: 34.50 1st Qu.: 41.00 1st Qu.: 50.50
Median : 54.00 Median : 60.00 Median : 56.00 Median : 66.00
Mean : 50.29 Mean : 54.62 Mean : 56.31 Mean : 65.51
3rd Qu.: 75.50 3rd Qu.: 79.50 3rd Qu.: 78.00 3rd Qu.: 82.00
Max. :100.00 Max. :100.00 Max. :100.00 Max. :100.00
NA's :1 NA's :1 NA's :1 NA's :1
SR SBC SB SSC...37
Min. : 0.00 Min. : 0.00 Min. : 0.00 Min. : 0.00
1st Qu.: 45.00 1st Qu.: 51.50 1st Qu.: 26.50 1st Qu.: 39.50
Median : 59.00 Median : 71.00 Median : 53.00 Median : 56.00
Mean : 57.61 Mean : 66.56 Mean : 51.19 Mean : 55.84
3rd Qu.: 79.00 3rd Qu.: 83.00 3rd Qu.: 77.00 3rd Qu.: 78.00
Max. :100.00 Max. :100.00 Max. :100.00 Max. :100.00
NA's :1 NA's :1 NA's :1 NA's :1
GEP Income_1 Income_2 ED_M
Min. : 1.000 Min. :1.000 Min. : 1.000 Min. :1.000
1st Qu.: 1.000 1st Qu.:3.000 1st Qu.: 3.000 1st Qu.:2.000
Median : 1.000 Median :4.000 Median : 5.000 Median :4.000
Mean : 3.293 Mean :4.607 Mean : 5.251 Mean :4.042
3rd Qu.: 5.000 3rd Qu.:6.000 3rd Qu.: 8.000 3rd Qu.:5.000
Max. :20.000 Max. :9.000 Max. :10.000 Max. :7.000
NA's :1 NA's :1 NA's :1 NA's :1
ED_F ED_GF ED_GM ED_PP
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:3.000 1st Qu.:2.000 1st Qu.:1.000 1st Qu.:5.000
Median :4.000 Median :3.000 Median :3.000 Median :5.000
Mean :4.251 Mean :3.209 Mean :2.806 Mean :5.152
3rd Qu.:5.000 3rd Qu.:4.000 3rd Qu.:4.000 3rd Qu.:6.000
Max. :7.000 Max. :7.000 Max. :7.000 Max. :8.000
NA's :1 NA's :1 NA's :1 NA's :1
Gender Caste_category Religion Objects_Bicycle
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:1.000 1st Qu.:1.000 1st Qu.:1.000 1st Qu.:2.000
Median :1.000 Median :4.000 Median :1.000 Median :2.000
Mean :1.241 Mean :3.283 Mean :1.257 Mean :2.309
3rd Qu.:1.000 3rd Qu.:5.000 3rd Qu.:1.000 3rd Qu.:3.000
Max. :2.000 Max. :7.000 Max. :3.000 Max. :5.000
NA's :1 NA's :1 NA's :1 NA's :1
Objects_Motorbike Objects_Car Objects_Refrigerator Objects_washing machine
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:2.000 1st Qu.:1.000 1st Qu.:2.000 1st Qu.:2.000
Median :2.000 Median :2.000 Median :2.000 Median :2.000
Mean :2.545 Mean :1.817 Mean :2.068 Mean :2.047
3rd Qu.:3.000 3rd Qu.:2.000 3rd Qu.:2.000 3rd Qu.:2.000
Max. :5.000 Max. :5.000 Max. :5.000 Max. :5.000
NA's :1 NA's :1 NA's :1 NA's :1
Objects_Television Objects_Air Con Objects_Air Cooler Objects_Computer
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:2.000 1st Qu.:1.500 1st Qu.:1.000 1st Qu.:2.000
Median :2.000 Median :2.000 Median :2.000 Median :2.000
Mean :2.183 Mean :1.932 Mean :1.801 Mean :2.429
3rd Qu.:2.000 3rd Qu.:2.000 3rd Qu.:2.000 3rd Qu.:3.000
Max. :5.000 Max. :4.000 Max. :5.000 Max. :5.000
NA's :1 NA's :1 NA's :1 NA's :1
Objects_Electric Room heater Obejcts_Electric Water Heater
Min. :1.000 Min. :1.000
1st Qu.:1.000 1st Qu.:1.000
Median :2.000 Median :2.000
Mean :1.728 Mean :1.827
3rd Qu.:2.000 3rd Qu.:2.000
Max. :5.000 Max. :5.000
NA's :1 NA's :1
Objects-_Water Purifier Objects_IPAD Objects_Music Obejcts_Microwwave
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:1.000 1st Qu.:1.000 1st Qu.:1.500 1st Qu.:1.000
Median :2.000 Median :2.000 Median :2.000 Median :2.000
Mean :1.838 Mean :1.759 Mean :1.937 Mean :1.696
3rd Qu.:2.000 3rd Qu.:2.000 3rd Qu.:2.000 3rd Qu.:2.000
Max. :4.000 Max. :5.000 Max. :5.000 Max. :5.000
NA's :1 NA's :1 NA's :1 NA's :1
Obejcts_oven Objects_Tvset Objects_Electric Stove Objects_Air Purifier
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:1.000 1st Qu.:1.000 1st Qu.:1.000 1st Qu.:1.000
Median :2.000 Median :2.000 Median :2.000 Median :2.000
Mean :1.628 Mean :1.921 Mean :1.895 Mean :1.738
3rd Qu.:2.000 3rd Qu.:2.000 3rd Qu.:2.000 3rd Qu.:2.000
Max. :5.000 Max. :5.000 Max. :5.000 Max. :5.000
NA's :1 NA's :1 NA's :1 NA's :1
Objects_Door Bell HS_Bedroom HS_Bathroom HS_Living Room
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:1.000 1st Qu.:2.000 1st Qu.:2.000 1st Qu.:2.000
Median :2.000 Median :3.000 Median :3.000 Median :2.000
Mean :1.817 Mean :3.147 Mean :3.047 Mean :2.482
3rd Qu.:2.000 3rd Qu.:4.000 3rd Qu.:4.000 3rd Qu.:3.000
Max. :5.000 Max. :7.000 Max. :7.000 Max. :7.000
NA's :1 NA's :1 NA's :1 NA's :1
HS_Study HS_Balcony HS_Gardens SSS RandomID
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000 Min. :1077
1st Qu.:2.000 1st Qu.:2.000 1st Qu.:1.500 1st Qu.:2.000 1st Qu.:3515
Median :2.000 Median :2.000 Median :2.000 Median :4.000 Median :5552
Mean :2.356 Mean :2.262 Mean :2.168 Mean :3.806 Mean :5620
3rd Qu.:3.000 3rd Qu.:3.000 3rd Qu.:2.000 3rd Qu.:5.000 3rd Qu.:8133
Max. :8.000 Max. :7.000 Max. :7.000 Max. :8.000 Max. :9966
NA's :1 NA's :1 NA's :1 NA's :1
##subsetting the data##Checking the caste categroy of the participants.table(D$Caste_category)
1 3 4 5 6 7
81 3 48 23 22 14
caste_freq <-table(D$Caste_category)caste_freq
1 3 4 5 6 7
81 3 48 23 22 14
# Custom names for each bar/categorybar_names <-c("General", "EWS", " OBC-C","OBC", "SC", "ST") bp <-barplot(caste_freq,names.arg = bar_names, # Using custom namesxlab ="Caste category", ylab ="Frequency",main ="Caste distribution in the sample",col ="yellow")
##most participants belong to General caste, followed by OBC-C##The key question was to understand whether upper caste hindus are discriminatory towards Dalits & Muslims -- however the data collected has participants from different caste groups, Deshpande & Spears only had upper caste participants in their sample. Therefore I will work with two data sets: dataset 1 will have only upper caste & OBC creamy layer participants, datset 2 will have participants from heterogenous caste groups. I will control for caste while analysing dataset 2. ##checking for religion: table(D$Religion)
1 2 3
159 15 17
Religion_freq <-table(D$Religion)# Custom names for each bar/categorybar_names2 <-c("Hindu", "Muslim", "Christian") bp2 <-barplot(Religion_freq,names.arg = bar_names2, # Using custom namesxlab ="Religion", ylab ="Frequency",main ="reliogious distribution in the sample",col ="orange")
##removing participants from Muslim,Christian, Scheduled Caste, Scheduled tribe backgrounds:## subsetting the data via Manipulation checksDM <- D %>%filter(Religion ==1& Caste_category %in%c(1, 3, 4, 5) & (MCMM ==1| MCIM ==1| MCIG ==1| MCDD ==1| MCID ==1| MCHD ==1| MC_B ==4| MC_B ==2| MC_B ==5| MC_B ==6))
# A tibble: 88 × 53
Consent Region MCMM MCIM MCDM MCHM MCMG MCIG MCDG MCHG MCBG MCFCG
<dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 1 1 NA NA NA NA NA NA NA NA NA NA
2 1 1 NA NA NA NA NA NA NA NA NA NA
3 1 1 NA NA NA NA NA NA NA NA NA NA
4 1 1 1 NA NA NA NA NA NA NA NA NA
5 1 1 NA NA NA NA NA 1 NA 1 NA NA
6 1 1 1 NA NA NA NA NA NA NA NA NA
7 1 1 1 NA NA NA NA NA NA NA NA NA
8 1 1 NA NA NA NA NA NA NA NA NA NA
9 1 1 NA NA NA NA NA NA NA NA NA NA
10 1 2 1 NA NA NA NA NA NA NA NA NA
# ℹ 78 more rows
# ℹ 41 more variables: MCMD <dbl>, MCID <dbl>, MCDD <dbl>, MCHD <dbl>,
# MCBD <lgl>, MCFCD <dbl>, MC_B <dbl>, AC_1 <dbl>, AC_2 <dbl>, SFC <dbl>,
# SOBC <dbl>, SD <dbl>, SA <dbl>, SSC...30 <dbl>, SM <dbl>, SP <dbl>,
# SMC <dbl>, SR <dbl>, SBC <dbl>, SB <dbl>, SSC...37 <dbl>, GEP <dbl>,
# Income_1 <dbl>, Income_2 <dbl>, ED_M <dbl>, ED_F <dbl>, ED_GF <dbl>,
# ED_GM <dbl>, ED_PP <dbl>, Gender <dbl>, Caste_category <dbl>, …
##plotting donations across treatment conditionslibrary(ggplot2)FDlongx$treatment <-factor(FDlongx$treatment,levels =c("BI", "GI", "MI", "DI"),labels =c("Brahmin", "Generic-Indian", "Muslim", "Dalit")) ggplot(data = FDlongx, aes(x = treatment, y = donations)) +geom_bar(stat ="identity", fill ="blue") +scale_y_continuous(limits =c(0, 100)) +labs(title ="Donations Across Recipients",x ="Recipients", y ="Donations")
##main result:##Linear regression for the main effect:model1 <-lm(donations ~ MI + GI + DI + BI + Caste_category,data = dummy_dat9)summary(model1)
Call:
lm(formula = donations ~ MI + GI + DI + BI + Caste_category,
data = dummy_dat9)
Residuals:
Min 1Q Median 3Q Max
-56.063 -13.930 -0.065 20.298 36.070
Coefficients: (1 not defined because of singularities)
Estimate Std. Error t value Pr(>|t|)
(Intercept) 62.8387 6.4028 9.814 1.54e-15 ***
MI -0.4503 7.4153 -0.061 0.952
GI -2.8668 8.3798 -0.342 0.733
DI 0.7719 7.5620 0.102 0.919
BI NA NA NA NA
Caste_category 1.0914 1.5376 0.710 0.480
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 23.91 on 83 degrees of freedom
Multiple R-squared: 0.008397, Adjusted R-squared: -0.03939
F-statistic: 0.1757 on 4 and 83 DF, p-value: 0.9503
stargazer(model1, type ="text",covariate.labels =c("Muslim for MI", "Brahmin for BI", "Generic Indian for GI", "Dalit for DI"),title ="Results of Model1")
Results of Model1
=================================================
Dependent variable:
---------------------------
donations
-------------------------------------------------
Muslim for MI -0.450
(7.415)
Brahmin for BI -2.867
(8.380)
Generic Indian for GI 0.772
(7.562)
Dalit for DI
Caste_category 1.091
(1.538)
Constant 62.839***
(6.403)
-------------------------------------------------
Observations 88
R2 0.008
Adjusted R2 -0.039
Residual Std. Error 23.912 (df = 83)
F Statistic 0.176 (df = 4; 83)
=================================================
Note: *p<0.1; **p<0.05; ***p<0.01
##Results: The sample is too small for me to seek any evidence of replication from the linear model presented above.
Discussion & summary:
Unfortunately, the results of the present study cannot speak to the results obtained by Deshpande & Spears (2016). The limitations and the quality of the data collected prevents me from taking any stance on whether or not the results of the original study replicate. I am now going to sequentially note down each of these limitations. Though these limitations emerge in referece to Deshpande & Spears (2016) sample, I hope that these limitations will help further attempts to collect experimental data via Mturk from Indian participants.
First, the sample of participants who took the survey from a heterogeneous Caste & Religius backgrounds. I have noted this heterogeniety above. Since this was an experimental study, asking participants to report their caste & religion at the start as a filter would have introduced a prime that can confound main results.Given that only about 130 participants belong to UC background, I relaxed the caste criteria of inclusion by adding participants from the OBC background to my data. I later controlled for caste.
Participants were from different regions across India. This stands consonent with Deshpande & Spears, who didn’t have a location based filter based on region. However Deshpande & Spears used north Indian caste names, and these names can likely be unfamiliar to sample from South India. Keeping this limitation in mind,in order to increase the chances of replication I had tried to restrict the sample to North India, however this affected the timeline of data collection, as there were fewer north Indian participants. Consequently, I relaxed this criteria. I consequently was strict with removing participants who didn’t meet the manipulation check. This reduced the sample size, and the affected my ability to come up with any conclusive inference vis-a-vis replication of these results.
Lastly, a large number of participants failed the attention check questions. I had field a very short survey, and the Attention check questions were very straight forward. However, despite this I was forced to remove more than 100 participants, nearly 40 % of the entire sample collected. This is indicative of the general engagement of the Mturk workers, and the consecuent data quality of Mturk studies fielded in India.
Overall, I’d like to thank Mike, Sarah, Veronica and Ben for allowing me to replicate this study, funding the data collection, and supporting me through the mores of replication, Rstudio, and in engaging with Mturk. I think the study was revealing, re the potential feasibility of using Mturk to collect experimental, & geographically sensitive political data.