Who is the identifiable victim? Caste and charitable giving in modern India.
Author
Dean Spears, Ashwini Deshpande (dspears@utexas.edu, ashwini.deshpande@ashoka.edu.in)
Published
November 20, 2023
Introduction
In their study, Deshpande & Spears (2016) attempt to test how caste and religious identity markers may shape how sympathetic individuals feel towards the out-group. They seek to embed this question within a previously established finding & paradigm pertaining to the identifiable victim effect: whereby researchers have found that people tend to experience more sympathy when victims are defined as specific individuals as against a group identified via broad group-based categorical labels such as “Hindus, Dalits, Whites or Blacks”(Henceforth referred to as a statistical group). Deshpande & Spears (2016) find that indeed participants are more generous to victims who are identified via generic Indian, upper caste name, and Muslim background as against their statistical counterparts. However, they find no evidence for the impact of the identifiability of victims on donations made to Dalits. Whereby Dalit recipients receive equally few donations when they are identified as individual victims or statistical groups. Researchers also find no significant discrimination between donations made to Dalits, Muslims, and upper caste groups when identified as ‘social or statistical groups’. Participants in the study were from largely upper-caste, internet-savvy, literate, high SES backgrounds.
##Motivation to replicate:
The results obtained by Deshpande & Spears (2016) differ from one of my experiments where I tested the impact of exposure to nationalistic ideas on donations made to upper-caste, Dalit and Hindu recipients. In the control group of my study (n =90) I found no significant discrimination between donations to Dalits (n =30) and Upper-caste recipients (n =30) (identified via their caste names). However, I find marginally significant discrimination towards Muslims (n = 30, p = 0.108). Upon exposure to nationalistic norms, I found that donations made to Muslims were reduced significantly as compared to donations made to upper-caste hindus & Dalits. However, donations made to Dalits remain unaffected in response to exposure to nationalistic norms, any changes are statistically insignificant in comparison to the upper caste. I have recently conducted another study to test whether these effects replicate. Replicating Deshpande & Spears (2016) gives me an additional opportunity to re-test the relative authenticity of the results obtained in the control group of my experiment. And allows me to further build on these results to aid a third replication of my nationalism experiment.
##The present study:
Since finding discrimination against Dalits, but non against Muslims (in the identifiability condition) is the most striking finding reported by Deshpande & Spears, my study will focus on replicating this finding.
##Challenges:
Bots & Turk farms: Mturk samples in India are known to be fraught with bots, and Turk farms, which have been increasingly reported to skew the quality of data collection. Based on anecdotes from researchers, I have learnt that such problems have increased over the years, and therefore I am more likely to experience such concerns than the authors of the study did in 2015. I am therefore sceptical of using a Mturk sample to replicate the study. There are other agencies that offer online panel services in India, alongside data quality assurances. Given that my replication isn’t geared towards mapping data quality concerns on Mturk, I believe I should ideally access other services that will provide a better, more robust replication test of the experiment design.
Regional skew: Anecdotal evidence, based on the experience of other researchers, points towards an increasing regional skew within the Mturk working population. Whereby participants are known to be located in Southern parts of India. Deshpande & Spears (2016) however utilize north Indian caste names to indicate caste. A South Indian participant will not be able to accurately categorize and infer the appropriate caste via the names signalled in the prime. Given that Mturk doesn’t allow us to sample the population via region If I use Mturk I am likely to recruit a larger South India population, this will in turn affect the replicability of the experiment.
Power: Deshpande & Spears (2016) recruited a sample of 400 participants, incidently their sample was a homogeneous group of upper-caste hindus. This led the researchers to obtain 50 upper caste Hindu participants per treatment group. Such homogeneous recruitment of upper case hindus is however not always possible. It is likely that in our recruitment we will not be able to pre-emptively control for demographics. If we ask participants to report their caste at the start of the survey and exclude them on the basis of caste, this will amount to discrimination, and also inadvertently prime caste/ religious identity. Therefore accounting for potential heterogeneity in the recruited sample, and the statistical procedures needed to control for the same, we would need to increase our sample to 800 to have an 80 % chance of seeing the effect size at 0.05 significance level. I assume that we will have at least 100-150 people from OBC, SC, and Muslim backgrounds in our sample (each). However, if we work with an agency we can work with a pre-defined panel of upper caste groups as our sample.
https://github.com/monishad127/dhingram-2023-.git
Methods
Power Analysis
The original effect size isnt reported, however based on the power calculation and keeping mind limitations noted above I note the following estimate re-power: I would need a sample of 400 to detect a small effect (0.2) at 80 % power, and 0.05 level of significance.
Planned Sample
I will be recruiting the sample from Mturk
Materials
Participants will be randomly assigned to read about an identified recipient from a Dalit, Muslim or upper caste Hindu background.The identification of the recipient’s category will be made only implicitly by their name, using well-known male names commonly associated with Dalit, Muslim, Upper Caste Hindu categories. Deshpande & Spears (2015) utilized 20 names, five for each group; each participant was assigned to read about an identifiable recipient and read only one of these five names, randomly presented. For the control treatment, researchers used names that are commonly found across caste levels and are not identified with a particular group. However, since the authors have established (through study 3) that each of these names signals a caste & religious category, I will utilize names with the highest signalling value (factor loading) as indicated in study 3 of the paper.
The following materials will be utilized to indicate ‘individual’ characteristics of the recipients.
Those assigned to an identifiable victim treatment read: The family of NAME is very poor. For much of each year, they cannot find work. His family frequently cannot afford enough basic food to eat. As a result, his children go without medicine if they get sick, and often go to bed hungry.
The following attention checks and instructional manipulations will be administered:
“How often have you suffered a fatal heart attack?” Only those who selected “never” were included in the analyzed sample.
“On many important issues, people have different opinions. Some people agree, and some people disagree, even very strongly. Here in this question, please select the number four in the slider below, to rule out random clicking.”
The following manipulation check will be used:
Please identify which of the following social categories does the “recepient name” belong to?
To understand how participants self-categorize themselves into a social group, participants will be requested to answer the following questions:
How much do you believe your family is like a typical family of each of the following types?” The 10 groups, as they will be written on the survey form are as follows: Brahmin, Forward/Upper Castes, OBC ½Other Backward Classes, Dalits, Dalit/SC, Adivasi/ST, Scheduled Tribe; marginalized tribal communities, Muslim, Poor, Middle class, Rural, and Urban.
Procedure
Experiment Design
Deshpande & Spears (2016) recruited an Indian Sample via Mturk to conduct an online survey experiment to test the interaction of caste and religious identity with the identifiable victim effect. First, participants are shown the experimental prompt: a few sentences of text describing an opportunity for charitable giving. The identity of the recipient receiving the charity is randomised. Participants either see recipients from Muslim, Dalit, upper-caste Hindu or general Indian backgrounds. The experiment is thus set up as 4(social identity) x 1 (identifiable victim) between group factorial design. Immediately after exposure to the stimuli, participants are asked to rate their willingness to donate. To measure this key dependent variable, participants are shown a scale ranging from Rs. 0 to 100 and are asked how much money they would be willing to donate. Following this, participants are asked a set of attention check & manipulation check questions. Finally, participants rated the similarity of their family to typical members of 10 different social groups cutting across class, caste, religious and neighbourhood characteristics. Followed by a demographic questionnaire.
Link to the questionnaire: https://lse.eu.qualtrics.com/jfe/form/SV_3lRdcGRosB4KV5c
Analysis Plan
Following Deshpande & Spears (2016), I will perform the following steps to understand and analyse the data I obtain via replication:
1)Representativeness of the sample:
I will calculate summary statistics of the participants, and track the extent to which it may be representative of a population from Uttar Pradesh & India.
Effect of self-categorization & identification with a social group on donations towards recipients from different social categories:
I will use local polynomial, kernel-weighted regressions to test whether identification with caste, religion, tribe, and SES impacts willingness to donate to recipients from different social groups. Following the researchers, I will control for sex and age in performing my analysis.
Main effect of identifiability & social identity of the recipient on donations:
Utilizing OLS techniques, I will perform the following steps:
First, I will pool the data (across treatments) to test the effect of identifiability on donations made to a recipient. Second, I will test the impact of social identity (irrespective of the identifiability) on donations. Third, I will test how identifiability interacts with social identity to impact willingness to donate, controlling for gender, education, size of city/town, and age.
Clarify key analysis of interest here
Expecting socio-economic diversity in my sample,I will add controls for income in the main regression analysis.
Differences from Original Study
I am collapsing these two treatment conditions into one, I will use the word Scheduled Caste instead of Dalit: The variation in the treatments attempt to check whether the name used to refer to ex-untouchable groups makes a difference in how they are identified and categorized.
2.I am utilizing caste & religous names with the highest factor loading, instead of randomly assigning 5 different names to indicate caste/ religion.
I am recruiting the sample via an agency called: ‘House of research’, the study will be limited to Uttar-Pradesh & Delhi.
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag() masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
MS MSI GS MI MII GI
Min. :15 Min. :25 Min. :35 Min. :45 Min. :35 Min. :45
1st Qu.:20 1st Qu.:30 1st Qu.:40 1st Qu.:50 1st Qu.:40 1st Qu.:50
Median :20 Median :30 Median :40 Median :50 Median :40 Median :50
Mean :20 Mean :30 Mean :40 Mean :50 Mean :40 Mean :50
3rd Qu.:20 3rd Qu.:30 3rd Qu.:40 3rd Qu.:50 3rd Qu.:40 3rd Qu.:50
Max. :25 Max. :35 Max. :45 Max. :55 Max. :45 Max. :55
NA's :51 NA's :51 NA's :51 NA's :51 NA's :51 NA's :51
DI HI HS DS AC1 AC2
Min. :15 Min. :60 Min. :55.0 Min. :30 Min. :4 Min. :4
1st Qu.:20 1st Qu.:60 1st Qu.:55.0 1st Qu.:30 1st Qu.:4 1st Qu.:4
Median :20 Median :65 Median :60.0 Median :35 Median :4 Median :4
Mean :20 Mean :68 Mean :62.8 Mean :40 Mean :4 Mean :4
3rd Qu.:20 3rd Qu.:75 3rd Qu.:70.0 3rd Qu.:45 3rd Qu.:4 3rd Qu.:4
Max. :25 Max. :80 Max. :74.0 Max. :60 Max. :4 Max. :4
NA's :51 NA's :51 NA's :51 NA's :51 NA's :6 NA's :6
IDUC IDOBC IDDalits IDAdivasi IDST
Min. :1.00 Min. :0.00 Min. :0.00 Min. :1.00 Min. :1.000
1st Qu.:6.00 1st Qu.:3.00 1st Qu.:0.00 1st Qu.:1.00 1st Qu.:1.000
Median :9.00 Median :3.00 Median :2.50 Median :1.00 Median :3.000
Mean :7.46 Mean :3.68 Mean :2.84 Mean :1.82 Mean :2.878
3rd Qu.:9.00 3rd Qu.:4.00 3rd Qu.:3.00 3rd Qu.:1.00 3rd Qu.:4.000
Max. :9.00 Max. :9.00 Max. :9.00 Max. :7.00 Max. :8.000
NA's :6 NA's :6 NA's :6 NA's :6 NA's :7
IDMuslim IDPoor IDMiddle Class IDRich IDUrban
Min. :1.000 Min. :1.00 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:1.000 1st Qu.:3.00 1st Qu.:2.000 1st Qu.:2.500 1st Qu.:2.000
Median :3.000 Median :4.00 Median :4.000 Median :5.000 Median :4.000
Mean :3.408 Mean :4.32 Mean :4.224 Mean :4.383 Mean :3.787
3rd Qu.:6.000 3rd Qu.:5.75 3rd Qu.:6.000 3rd Qu.:6.000 3rd Qu.:5.500
Max. :9.000 Max. :9.00 Max. :9.000 Max. :9.000 Max. :9.000
NA's :7 NA's :6 NA's :7 NA's :9 NA's :9
DMIdeology DMincome_1 DMincome_2 DMEducation_1
Min. :1.00 Min. :1.000 Min. : 1.000 Min. : 1.000
1st Qu.:2.00 1st Qu.:2.000 1st Qu.: 3.000 1st Qu.: 3.000
Median :3.00 Median :3.000 Median : 6.000 Median : 4.000
Mean :3.37 Mean :2.979 Mean : 5.565 Mean : 4.375
3rd Qu.:5.00 3rd Qu.:4.000 3rd Qu.: 8.000 3rd Qu.: 6.000
Max. :8.00 Max. :7.000 Max. :10.000 Max. :10.000
NA's :10 NA's :8 NA's :10 NA's :8
DMEducation_2 DMEducation_3 DMEducation_4 DMEducation_5 DMGender
Min. : 1.000 Min. :1.000 Min. : 1.0 Min. :1.00 Min. :1.00
1st Qu.: 2.000 1st Qu.:2.000 1st Qu.: 2.0 1st Qu.:1.00 1st Qu.:2.00
Median : 4.000 Median :3.000 Median : 4.0 Median :2.00 Median :3.00
Mean : 3.837 Mean :4.021 Mean : 4.1 Mean :2.38 Mean :2.96
3rd Qu.: 5.000 3rd Qu.:6.000 3rd Qu.: 5.0 3rd Qu.:3.00 3rd Qu.:4.00
Max. :10.000 Max. :9.000 Max. :10.0 Max. :7.00 Max. :7.00
NA's :7 NA's :8 NA's :6 NA's :6 NA's :6
DMcaste DMreligion Objects_1 Objects_2 Objects_3
Min. :1.00 Min. :1.00 Min. :1.0 Min. :1.0 Min. :1.00
1st Qu.:2.00 1st Qu.:2.00 1st Qu.:4.0 1st Qu.:4.0 1st Qu.:3.00
Median :3.00 Median :3.00 Median :4.5 Median :4.0 Median :4.00
Mean :2.86 Mean :2.84 Mean :4.5 Mean :4.6 Mean :3.72
3rd Qu.:3.75 3rd Qu.:3.00 3rd Qu.:6.0 3rd Qu.:6.0 3rd Qu.:5.00
Max. :6.00 Max. :6.00 Max. :7.0 Max. :7.0 Max. :7.00
NA's :6 NA's :6 NA's :6 NA's :6 NA's :6
Objects_4 Objects_5 Objects_6 Objects_7 Objects_8
Min. :1.00 Min. : 1.00 Min. :1.00 Min. :1.0 Min. : 1.00
1st Qu.:3.00 1st Qu.: 2.25 1st Qu.:1.00 1st Qu.:1.0 1st Qu.: 1.25
Median :4.00 Median : 3.00 Median :2.00 Median :2.5 Median : 3.00
Mean :3.88 Mean : 3.54 Mean :2.28 Mean :2.9 Mean : 3.14
3rd Qu.:5.00 3rd Qu.: 4.00 3rd Qu.:3.00 3rd Qu.:4.0 3rd Qu.: 4.00
Max. :7.00 Max. :10.00 Max. :6.00 Max. :7.0 Max. :10.00
NA's :6 NA's :6 NA's :6 NA's :6 NA's :6
Objects_9 Objects_10 Objects_11 Objects_12 Objects_13
Min. :1.000 Min. :1.00 Min. :1.0 Min. : 1.0 Min. : 1.00
1st Qu.:3.000 1st Qu.:3.00 1st Qu.:1.0 1st Qu.: 2.0 1st Qu.: 3.00
Median :5.000 Median :4.00 Median :3.0 Median : 4.0 Median : 4.00
Mean :4.592 Mean :4.74 Mean :2.8 Mean : 4.1 Mean : 4.62
3rd Qu.:6.000 3rd Qu.:7.00 3rd Qu.:4.0 3rd Qu.: 6.0 3rd Qu.: 6.00
Max. :9.000 Max. :9.00 Max. :9.0 Max. :10.0 Max. :10.00
NA's :7 NA's :6 NA's :6 NA's :6 NA's :6
Objects_14 Objects_15 Objects_16 Objects_17
Min. :1.000 Min. : 1.000 Min. : 1.000 Min. : 1.00
1st Qu.:1.000 1st Qu.: 2.750 1st Qu.: 3.000 1st Qu.: 2.00
Median :2.000 Median : 5.000 Median : 5.000 Median : 4.00
Mean :2.271 Mean : 4.896 Mean : 5.146 Mean : 4.26
3rd Qu.:3.000 3rd Qu.: 7.000 3rd Qu.: 7.000 3rd Qu.: 6.00
Max. :8.000 Max. :10.000 Max. :10.000 Max. :10.00
NA's :8 NA's :8 NA's :8 NA's :6
Objects_18 Objects_19 Objects_20 House size_1
Min. :1.00 Min. : 1.000 Min. :1.000 Min. :1.000
1st Qu.:2.00 1st Qu.: 3.000 1st Qu.:1.000 1st Qu.:1.000
Median :3.00 Median : 4.000 Median :2.000 Median :2.000
Mean :3.42 Mean : 4.408 Mean :2.673 Mean :2.653
3rd Qu.:5.00 3rd Qu.: 6.000 3rd Qu.:4.000 3rd Qu.:4.000
Max. :7.00 Max. :10.000 Max. :7.000 Max. :6.000
NA's :6 NA's :7 NA's :7 NA's :7
House size_2 House size_3 House size_4 House size_5
Min. :1.000 Min. :1.000 Min. :1.000 Min. :1.000
1st Qu.:1.000 1st Qu.:1.750 1st Qu.:1.750 1st Qu.:2.000
Median :1.000 Median :3.000 Median :2.000 Median :3.000
Mean :2.054 Mean :2.839 Mean :2.696 Mean :2.893
3rd Qu.:3.000 3rd Qu.:4.000 3rd Qu.:4.000 3rd Qu.:4.000
Max. :7.000 Max. :7.000 Max. :9.000 Max. :6.000
House size_6 SESladderoptions
Min. :1.000 Min. :1.000
1st Qu.:2.000 1st Qu.:2.000
Median :3.000 Median :3.000
Mean :3.018 Mean :3.357
3rd Qu.:4.000 3rd Qu.:4.250
Max. :6.000 Max. :9.000
NA's :1
library(ggplot2)ggplot(data = FD_long, aes(x = treatment, y = donations)) +geom_bar(stat ="identity", fill ="blue") +labs(title ="donations across treatments", x ="Treatments", y ="donations")
##Changing treatments to dummy variables in wide form
# A tibble: 20 × 34
MS MSI MII HS DS AC1 AC2 IDUC IDOBC IDDalits IDAdivasi
<dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 NA NA NA NA NA 4 4 6 3 0 1
2 NA NA NA NA NA 4 4 9 3 0 1
3 NA NA NA NA NA 4 4 4 6 0 1
4 NA NA NA NA NA 4 4 9 3 0 1
5 NA NA NA NA NA 4 4 9 3 0 1
6 NA NA NA NA NA 4 4 9 1 0 1
7 NA NA NA NA NA 4 4 8 3 0 1
8 NA NA NA NA NA 4 4 9 3 0 1
9 NA NA NA NA NA 4 4 9 0 3 1
10 NA NA NA NA NA 4 4 6 3 3 1
11 NA NA NA NA NA 4 4 9 3 0 1
12 NA NA NA NA NA 4 4 9 3 3 1
13 NA NA NA NA NA 4 4 5 5 3 1
14 NA NA NA NA NA 4 4 4 3 3 1
15 NA NA NA NA NA 4 4 9 3 5 1
16 NA NA NA NA NA 4 4 3 6 3 1
17 NA NA NA NA NA 4 4 9 3 3 1
18 NA NA NA NA NA 4 4 3 6 6 1
19 NA NA NA NA NA 4 4 9 3 3 1
20 NA NA NA NA NA 4 4 1 3 0 1
# ℹ 23 more variables: IDST <dbl>, IDMuslim <dbl>, IDPoor <dbl>,
# `IDMiddle Class` <dbl>, IDRich <dbl>, IDUrban <dbl>, DMIdeology <dbl>,
# DMincome_1 <dbl>, DMincome_2 <dbl>, DMEducation_1 <dbl>,
# DMEducation_2 <dbl>, DMEducation_3 <dbl>, DMEducation_4 <dbl>,
# DMEducation_5 <dbl>, DMGender <dbl>, DMcaste <dbl>, DMreligion <dbl>,
# SESladderoptions <dbl>, donations <dbl>, MI <int>, GI <int>, DI <int>,
# HI <int>
##Linear regression for the main effect:model1 <-lm(donations ~ MI + GI + DI + HI, data = dummy_data)summary(model1)
Call:
lm(formula = donations ~ MI + GI + DI + HI, data = dummy_data)
Residuals:
Min 1Q Median 3Q Max
-8.00 -3.50 0.00 1.25 12.00
Coefficients: (1 not defined because of singularities)
Estimate Std. Error t value Pr(>|t|)
(Intercept) 68.000 2.449 27.761 5.80e-15 ***
MI -18.000 3.464 -5.196 8.83e-05 ***
GI -18.000 3.464 -5.196 8.83e-05 ***
DI -48.000 3.464 -13.856 2.49e-10 ***
HI NA NA NA NA
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 5.477 on 16 degrees of freedom
Multiple R-squared: 0.9252, Adjusted R-squared: 0.9112
F-statistic: 66 on 3 and 16 DF, p-value: 3.15e-09
Exploratory analyses
Any follow-up analyses desired (not required).
Discussion
Summary of Replication Attempt
Open the discussion section with a paragraph summarizing the primary result from the confirmatory analysis and the assessment of whether it replicated, partially replicated, or failed to replicate the original result.
Commentary
Add open-ended commentary (if any) reflecting (a) insights from follow-up exploratory analysis, (b) assessment of the meaning of the replication (or not) - e.g., for a failure to replicate, are the differences between original and present study ones that definitely, plausibly, or are unlikely to have been moderators of the result, and (c) discussion of any objections or challenges raised by the current and original authors about the replication attempt. None of these need to be long.