Who is the identifiable victim? Caste and charitable giving in modern India.

Author

Dean Spears, Ashwini Deshpande (dspears@utexas.edu, ashwini.deshpande@ashoka.edu.in)

Published

December 17, 2023

Introduction

In their study, Deshpande & Spears (2016) attempt to test how caste and religious identity markers may shape how sympathetic individuals feel towards the out-group. They seek to embed this question within a previously established finding & paradigm pertaining to the identifiable victim effect: whereby researchers have found that people tend to experience more sympathy when victims are defined as specific individuals as against a group identified via broad group-based categorical labels such as “Hindus, Dalits, Whites or Blacks”(Henceforth referred to as a statistical group). Deshpande & Spears (2016) find that indeed participants are more generous to victims who are identified via generic Indian, upper caste name, and Muslim background as against their statistical counterparts. However, they find no evidence for the impact of the identifiability of victims on donations made to Dalits. Whereby Dalit recipients receive equally few donations when they are identified as individual victims or statistical groups. Researchers also find no significant discrimination between donations made to Dalits, Muslims, and upper caste groups when identified as ‘social or statistical groups’. Participants in the study were from largely upper-caste, internet-savvy, literate, high SES backgrounds.

##Motivation to replicate:

The results obtained by Deshpande & Spears (2016) differ from one of my experiments where I tested the impact of exposure to nationalistic ideas on donations made to upper-caste, Dalit and Hindu recipients. In the control group of my study (n =90) I found no significant discrimination between donations to Dalits (n =30) and Upper-caste recipients (n =30) (identified via their caste names). However, I find marginally significant discrimination towards Muslims (n = 30, p = 0.108). Upon exposure to nationalistic norms, I found that donations made to Muslims were reduced significantly as compared to donations made to upper-caste hindus & Dalits. However, donations made to Dalits remain unaffected in response to exposure to nationalistic norms, any changes are statistically insignificant in comparison to the upper caste. I have recently conducted another study to test whether these effects replicate. Replicating Deshpande & Spears (2016) gives me an additional opportunity to re-test the relative authenticity of the results obtained in the control group of my experiment. And allows me to further build on these results to aid a third replication of my nationalism experiment.

##The present study:

Since finding discrimination against Dalits, but non against Muslims (in the identifiability condition) is the most striking finding reported by Deshpande & Spears, my study will focus on replicating this finding.

##Challenges:

Bots & Turk farms: Mturk samples in India are known to be fraught with bots, and Turk farms, which have been increasingly reported to skew the quality of data collection. Based on anecdotes from researchers, I have learnt that such problems have increased over the years, and therefore I am more likely to experience such concerns than the authors of the study did in 2015. I am therefore sceptical of using a Mturk sample to replicate the study. There are other agencies that offer online panel services in India, alongside data quality assurances. Given that my replication isn’t geared towards mapping data quality concerns on Mturk, I believe I should ideally access other services that will provide a better, more robust replication test of the experiment design.

Regional skew: Anecdotal evidence, based on the experience of other researchers, points towards an increasing regional skew within the Mturk working population. Whereby participants are known to be located in Southern parts of India. Deshpande & Spears (2016) however utilize north Indian caste names to indicate caste. A South Indian participant will not be able to accurately categorize and infer the appropriate caste via the names signalled in the prime. Given that Mturk doesn’t allow us to sample the population via region If I use Mturk I am likely to recruit a larger South India population, this will in turn affect the replicability of the experiment.

Power: Deshpande & Spears (2016) recruited a sample of 400 participants, incidently their sample was a homogeneous group of upper-caste hindus. This led the researchers to obtain 50 upper caste Hindu participants per treatment group. Such homogeneous recruitment of upper case hindus is however not always possible. It is likely that in our recruitment we will not be able to pre-emptively control for demographics. If we ask participants to report their caste at the start of the survey and exclude them on the basis of caste, this will amount to discrimination, and also inadvertently prime caste/ religious identity. Therefore accounting for potential heterogeneity in the recruited sample, and the statistical procedures needed to control for the same, we would need to increase our sample to 800 to have an 80 % chance of seeing the effect size at 0.05 significance level. I assume that we will have at least 100-150 people from OBC, SC, and Muslim backgrounds in our sample (each). However, if we work with an agency we can work with a pre-defined panel of upper caste groups as our sample.

https://github.com/monishad127/dhingram-2023-.git

Methods

Power Analysis

The original effect size isnt reported, however based on the power calculation and keeping mind limitations noted above I note the following estimate re-power: I would need a sample of 400 to detect a small effect (0.2) at 80 % power, and 0.05 level of significance.

Planned Sample

I will be recruiting the sample from Mturk

Materials

Participants will be randomly assigned to read about an identified recipient from a Dalit, Muslim or upper caste Hindu background.The identification of the recipient’s category will be made only implicitly by their name, using well-known male names commonly associated with Dalit, Muslim, Upper Caste Hindu categories. Deshpande & Spears (2015) utilized 20 names, five for each group; each participant was assigned to read about an identifiable recipient and read only one of these five names, randomly presented. For the control treatment, researchers used names that are commonly found across caste levels and are not identified with a particular group. However, since the authors have established (through study 3) that each of these names signals a caste & religious category, I will utilize names with the highest signalling value (factor loading) as indicated in study 3 of the paper.

The following materials will be utilized to indicate ‘individual’ characteristics of the recipients.

Those assigned to an identifiable victim treatment read: The family of NAME is very poor. For much of each year, they cannot find work. His family frequently cannot afford enough basic food to eat. As a result, his children go without medicine if they get sick, and often go to bed hungry.

The following attention checks and instructional manipulations will be administered:

“How often have you suffered a fatal heart attack?” Only those who selected “never” were included in the analyzed sample.

“On many important issues, people have different opinions. Some people agree, and some people disagree, even very strongly. Here in this question, please select the number four in the slider below, to rule out random clicking.”

The following manipulation check will be used:

Please identify which of the following social categories does the “recepient name” belong to?

To understand how participants self-categorize themselves into a social group, participants will be requested to answer the following questions:

How much do you believe your family is like a typical family of each of the following types?” The 10 groups, as they will be written on the survey form are as follows: Brahmin, Forward/Upper Castes, OBC ½Other Backward Classes, Dalits, Dalit/SC, Adivasi/ST, Scheduled Tribe; marginalized tribal communities, Muslim, Poor, Middle class, Rural, and Urban.

Procedure

Experiment Design

Deshpande & Spears (2016) recruited an Indian Sample via Mturk to conduct an online survey experiment to test the interaction of caste and religious identity with the identifiable victim effect. First, participants are shown the experimental prompt: a few sentences of text describing an opportunity for charitable giving. The identity of the recipient receiving the charity is randomised. Participants either see recipients from Muslim, Dalit, upper-caste Hindu or general Indian backgrounds. The experiment is thus set up as 4(social identity) x 1 (identifiable victim) between group factorial design. Immediately after exposure to the stimuli, participants are asked to rate their willingness to donate. To measure this key dependent variable, participants are shown a scale ranging from Rs. 0 to 100 and are asked how much money they would be willing to donate. Following this, participants are asked a set of attention check & manipulation check questions. Finally, participants rated the similarity of their family to typical members of 10 different social groups cutting across class, caste, religious and neighbourhood characteristics. Followed by a demographic questionnaire.

Link to the questionnaire: https://lse.eu.qualtrics.com/jfe/form/SV_3lRdcGRosB4KV5c

Analysis Plan

Following Deshpande & Spears (2016), I will replicate the main effect of social identity of the recipient on donations. Utilizing linear regressionI will test the impact of social identity on donations.

Clarify key analysis of interest here
Expecting socio-economic diversity in my sample,I will add controls for income in the main regression analysis.

Differences from Original Study

  1. I am only replicating the impact of social identity on donations. And not the interaction between identifiability and social identity.

2.I am utilizing caste & religous names with the highest factor loading, instead of randomly assigning 5 different names to indicate caste/ religion.

Methods Addendum (Post Data Collection)

LINK to the paradigm:

https://lse.eu.qualtrics.com/jfe/form/SV_do54v6hPmmOmhTg

I collected data via Mturk platform. Despite the procedural issues, the data collection was smooth & quick. Unfortunately, the effeciacy of data collection doesn’t match up to the quality. I note below the actual sample I am able to utlize in my study.

Actual Sample

I collected a sample of 324 participants.81 Participants were from general category/ Upper caste backrgound; 3 were from EWS backgriund, 48 were OBCcreamy layer, 23 were OBC non creamy layer, 22 were from Scheduled Caste background, and 7 were from scheduled tribe backrgound. I 29excluded participants from SC & ST backrgound. I excluded 32 participants from Muslim & Christian Background (this overlapped with caste), and I removed 120 participants who failed the attention check.After removing participants who did not meet the Manipulation check, I was left with a final sample of 88 participants.

Differences from pre-data collection methods plan

Any differences from what was described as the original plan, or “none”.

Results

Data preparation

Data preparation following the analysis plan.

##reading data into R

##descriptive statistics: 

summary(D)
    Consent      Region            MI              MCMM          MCIM    
 Min.   :1   Min.   :1.000   Min.   : 10.00   Min.   :1     Min.   :1    
 1st Qu.:1   1st Qu.:2.000   1st Qu.: 50.00   1st Qu.:1     1st Qu.:1    
 Median :1   Median :2.000   Median : 76.50   Median :1     Median :1    
 Mean   :1   Mean   :1.801   Mean   : 69.28   Mean   :1     Mean   :1    
 3rd Qu.:1   3rd Qu.:2.000   3rd Qu.: 89.00   3rd Qu.:1     3rd Qu.:1    
 Max.   :1   Max.   :3.000   Max.   :100.00   Max.   :1     Max.   :1    
 NA's   :1   NA's   :1       NA's   :152      NA's   :160   NA's   :178  
      MCDM          MCHM           GI              MCMG          MCIG    
 Min.   :1     Min.   :1     Min.   :  5.00   Min.   :1     Min.   :1    
 1st Qu.:1     1st Qu.:1     1st Qu.: 51.00   1st Qu.:1     1st Qu.:1    
 Median :1     Median :1     Median : 62.00   Median :1     Median :1    
 Mean   :1     Mean   :1     Mean   : 65.35   Mean   :1     Mean   :1    
 3rd Qu.:1     3rd Qu.:1     3rd Qu.: 83.00   3rd Qu.:1     3rd Qu.:1    
 Max.   :1     Max.   :1     Max.   :100.00   Max.   :1     Max.   :1    
 NA's   :191   NA's   :189   NA's   :149      NA's   :189   NA's   :168  
      MCDG          MCHG          MCBG         MCFCG           DI        
 Min.   :1     Min.   :1     Min.   : NA   Min.   : NA   Min.   : 20.00  
 1st Qu.:1     1st Qu.:1     1st Qu.: NA   1st Qu.: NA   1st Qu.: 50.00  
 Median :1     Median :1     Median : NA   Median : NA   Median : 62.00  
 Mean   :1     Mean   :1     Mean   :NaN   Mean   :NaN   Mean   : 66.11  
 3rd Qu.:1     3rd Qu.:1     3rd Qu.: NA   3rd Qu.: NA   3rd Qu.: 85.00  
 Max.   :1     Max.   :1     Max.   : NA   Max.   : NA   Max.   :100.00  
 NA's   :189   NA's   :166   NA's   :192   NA's   :192   NA's   :154     
      MCMD          MCID          MCDD          MCHD       MCBD        
 Min.   :1     Min.   :1     Min.   :1     Min.   :1     Mode:logical  
 1st Qu.:1     1st Qu.:1     1st Qu.:1     1st Qu.:1     NA's:192      
 Median :1     Median :1     Median :1     Median :1                   
 Mean   :1     Mean   :1     Mean   :1     Mean   :1                   
 3rd Qu.:1     3rd Qu.:1     3rd Qu.:1     3rd Qu.:1                   
 Max.   :1     Max.   :1     Max.   :1     Max.   :1                   
 NA's   :191   NA's   :173   NA's   :188   NA's   :165                 
     MCFCD           BI              MC_B            AC_1        AC_2  
 Min.   : NA   Min.   :  0.00   Min.   :1.000   Min.   :4   Min.   :4  
 1st Qu.: NA   1st Qu.: 51.00   1st Qu.:2.000   1st Qu.:4   1st Qu.:4  
 Median : NA   Median : 71.00   Median :4.000   Median :4   Median :4  
 Mean   :NaN   Mean   : 65.24   Mean   :3.273   Mean   :4   Mean   :4  
 3rd Qu.: NA   3rd Qu.: 83.00   3rd Qu.:4.000   3rd Qu.:4   3rd Qu.:4  
 Max.   : NA   Max.   :100.00   Max.   :4.000   Max.   :4   Max.   :4  
 NA's   :192   NA's   :159      NA's   :159                            
      SFC              SOBC              SD               SA       
 Min.   :  0.00   Min.   :  0.00   Min.   :  0.00   Min.   : 0.00  
 1st Qu.: 45.00   1st Qu.: 53.00   1st Qu.: 21.50   1st Qu.:25.00  
 Median : 65.00   Median : 67.00   Median : 52.00   Median :53.00  
 Mean   : 58.91   Mean   : 63.56   Mean   : 49.07   Mean   :49.87  
 3rd Qu.: 79.00   3rd Qu.: 81.50   3rd Qu.: 75.00   3rd Qu.:76.00  
 Max.   :100.00   Max.   :100.00   Max.   :100.00   Max.   :97.00  
 NA's   :1        NA's   :1        NA's   :1        NA's   :1      
    SSC...30            SM               SP              SMC        
 Min.   :  0.00   Min.   :  0.00   Min.   :  0.00   Min.   :  0.00  
 1st Qu.: 26.00   1st Qu.: 34.50   1st Qu.: 41.00   1st Qu.: 50.50  
 Median : 54.00   Median : 60.00   Median : 56.00   Median : 66.00  
 Mean   : 50.29   Mean   : 54.62   Mean   : 56.31   Mean   : 65.51  
 3rd Qu.: 75.50   3rd Qu.: 79.50   3rd Qu.: 78.00   3rd Qu.: 82.00  
 Max.   :100.00   Max.   :100.00   Max.   :100.00   Max.   :100.00  
 NA's   :1        NA's   :1        NA's   :1        NA's   :1       
       SR              SBC               SB            SSC...37     
 Min.   :  0.00   Min.   :  0.00   Min.   :  0.00   Min.   :  0.00  
 1st Qu.: 45.00   1st Qu.: 51.50   1st Qu.: 26.50   1st Qu.: 39.50  
 Median : 59.00   Median : 71.00   Median : 53.00   Median : 56.00  
 Mean   : 57.61   Mean   : 66.56   Mean   : 51.19   Mean   : 55.84  
 3rd Qu.: 79.00   3rd Qu.: 83.00   3rd Qu.: 77.00   3rd Qu.: 78.00  
 Max.   :100.00   Max.   :100.00   Max.   :100.00   Max.   :100.00  
 NA's   :1        NA's   :1        NA's   :1        NA's   :1       
      GEP            Income_1        Income_2           ED_M      
 Min.   : 1.000   Min.   :1.000   Min.   : 1.000   Min.   :1.000  
 1st Qu.: 1.000   1st Qu.:3.000   1st Qu.: 3.000   1st Qu.:2.000  
 Median : 1.000   Median :4.000   Median : 5.000   Median :4.000  
 Mean   : 3.293   Mean   :4.607   Mean   : 5.251   Mean   :4.042  
 3rd Qu.: 5.000   3rd Qu.:6.000   3rd Qu.: 8.000   3rd Qu.:5.000  
 Max.   :20.000   Max.   :9.000   Max.   :10.000   Max.   :7.000  
 NA's   :1        NA's   :1       NA's   :1        NA's   :1      
      ED_F           ED_GF           ED_GM           ED_PP      
 Min.   :1.000   Min.   :1.000   Min.   :1.000   Min.   :1.000  
 1st Qu.:3.000   1st Qu.:2.000   1st Qu.:1.000   1st Qu.:5.000  
 Median :4.000   Median :3.000   Median :3.000   Median :5.000  
 Mean   :4.251   Mean   :3.209   Mean   :2.806   Mean   :5.152  
 3rd Qu.:5.000   3rd Qu.:4.000   3rd Qu.:4.000   3rd Qu.:6.000  
 Max.   :7.000   Max.   :7.000   Max.   :7.000   Max.   :8.000  
 NA's   :1       NA's   :1       NA's   :1       NA's   :1      
     Gender      Caste_category     Religion     Objects_Bicycle
 Min.   :1.000   Min.   :1.000   Min.   :1.000   Min.   :1.000  
 1st Qu.:1.000   1st Qu.:1.000   1st Qu.:1.000   1st Qu.:2.000  
 Median :1.000   Median :4.000   Median :1.000   Median :2.000  
 Mean   :1.241   Mean   :3.283   Mean   :1.257   Mean   :2.309  
 3rd Qu.:1.000   3rd Qu.:5.000   3rd Qu.:1.000   3rd Qu.:3.000  
 Max.   :2.000   Max.   :7.000   Max.   :3.000   Max.   :5.000  
 NA's   :1       NA's   :1       NA's   :1       NA's   :1      
 Objects_Motorbike  Objects_Car    Objects_Refrigerator Objects_washing machine
 Min.   :1.000     Min.   :1.000   Min.   :1.000        Min.   :1.000          
 1st Qu.:2.000     1st Qu.:1.000   1st Qu.:2.000        1st Qu.:2.000          
 Median :2.000     Median :2.000   Median :2.000        Median :2.000          
 Mean   :2.545     Mean   :1.817   Mean   :2.068        Mean   :2.047          
 3rd Qu.:3.000     3rd Qu.:2.000   3rd Qu.:2.000        3rd Qu.:2.000          
 Max.   :5.000     Max.   :5.000   Max.   :5.000        Max.   :5.000          
 NA's   :1         NA's   :1       NA's   :1            NA's   :1              
 Objects_Television Objects_Air Con Objects_Air Cooler Objects_Computer
 Min.   :1.000      Min.   :1.000   Min.   :1.000      Min.   :1.000   
 1st Qu.:2.000      1st Qu.:1.500   1st Qu.:1.000      1st Qu.:2.000   
 Median :2.000      Median :2.000   Median :2.000      Median :2.000   
 Mean   :2.183      Mean   :1.932   Mean   :1.801      Mean   :2.429   
 3rd Qu.:2.000      3rd Qu.:2.000   3rd Qu.:2.000      3rd Qu.:3.000   
 Max.   :5.000      Max.   :4.000   Max.   :5.000      Max.   :5.000   
 NA's   :1          NA's   :1       NA's   :1          NA's   :1       
 Objects_Electric Room heater Obejcts_Electric Water Heater
 Min.   :1.000                Min.   :1.000                
 1st Qu.:1.000                1st Qu.:1.000                
 Median :2.000                Median :2.000                
 Mean   :1.728                Mean   :1.827                
 3rd Qu.:2.000                3rd Qu.:2.000                
 Max.   :5.000                Max.   :5.000                
 NA's   :1                    NA's   :1                    
 Objects-_Water Purifier  Objects_IPAD   Objects_Music   Obejcts_Microwwave
 Min.   :1.000           Min.   :1.000   Min.   :1.000   Min.   :1.000     
 1st Qu.:1.000           1st Qu.:1.000   1st Qu.:1.500   1st Qu.:1.000     
 Median :2.000           Median :2.000   Median :2.000   Median :2.000     
 Mean   :1.838           Mean   :1.759   Mean   :1.937   Mean   :1.696     
 3rd Qu.:2.000           3rd Qu.:2.000   3rd Qu.:2.000   3rd Qu.:2.000     
 Max.   :4.000           Max.   :5.000   Max.   :5.000   Max.   :5.000     
 NA's   :1               NA's   :1       NA's   :1       NA's   :1         
  Obejcts_oven   Objects_Tvset   Objects_Electric Stove Objects_Air Purifier
 Min.   :1.000   Min.   :1.000   Min.   :1.000          Min.   :1.000       
 1st Qu.:1.000   1st Qu.:1.000   1st Qu.:1.000          1st Qu.:1.000       
 Median :2.000   Median :2.000   Median :2.000          Median :2.000       
 Mean   :1.628   Mean   :1.921   Mean   :1.895          Mean   :1.738       
 3rd Qu.:2.000   3rd Qu.:2.000   3rd Qu.:2.000          3rd Qu.:2.000       
 Max.   :5.000   Max.   :5.000   Max.   :5.000          Max.   :5.000       
 NA's   :1       NA's   :1       NA's   :1              NA's   :1           
 Objects_Door Bell   HS_Bedroom     HS_Bathroom    HS_Living Room 
 Min.   :1.000     Min.   :1.000   Min.   :1.000   Min.   :1.000  
 1st Qu.:1.000     1st Qu.:2.000   1st Qu.:2.000   1st Qu.:2.000  
 Median :2.000     Median :3.000   Median :3.000   Median :2.000  
 Mean   :1.817     Mean   :3.147   Mean   :3.047   Mean   :2.482  
 3rd Qu.:2.000     3rd Qu.:4.000   3rd Qu.:4.000   3rd Qu.:3.000  
 Max.   :5.000     Max.   :7.000   Max.   :7.000   Max.   :7.000  
 NA's   :1         NA's   :1       NA's   :1       NA's   :1      
    HS_Study       HS_Balcony      HS_Gardens         SSS           RandomID   
 Min.   :1.000   Min.   :1.000   Min.   :1.000   Min.   :1.000   Min.   :1077  
 1st Qu.:2.000   1st Qu.:2.000   1st Qu.:1.500   1st Qu.:2.000   1st Qu.:3515  
 Median :2.000   Median :2.000   Median :2.000   Median :4.000   Median :5552  
 Mean   :2.356   Mean   :2.262   Mean   :2.168   Mean   :3.806   Mean   :5620  
 3rd Qu.:3.000   3rd Qu.:3.000   3rd Qu.:2.000   3rd Qu.:5.000   3rd Qu.:8133  
 Max.   :8.000   Max.   :7.000   Max.   :7.000   Max.   :8.000   Max.   :9966  
 NA's   :1       NA's   :1       NA's   :1       NA's   :1                     
##subsetting the data

##Checking the caste categroy of the participants.

table(D$Caste_category)

 1  3  4  5  6  7 
81  3 48 23 22 14 
caste_freq <- table(D$Caste_category)

caste_freq

 1  3  4  5  6  7 
81  3 48 23 22 14 
# Custom names for each bar/category
bar_names <- c("General", "EWS", " OBC-C","OBC", "SC", "ST")  


bp <- barplot(caste_freq,
              names.arg = bar_names,  # Using custom names
              xlab = "Caste category", 
              ylab = "Frequency",
              main = "Caste distribution in the sample",
              col = "yellow")

##most participants belong to General caste, followed by OBC-C
##The key question was to understand whether upper caste hindus are discriminatory towards Dalits & Muslims -- however the data collected has participants from different caste groups, Deshpande & Spears only had upper caste participants in their sample. Therefore I will work with two data sets: dataset 1 will have only upper caste & OBC creamy layer participants, datset 2 will have participants from heterogenous caste groups. I will control for caste while analysing dataset 2. 

##checking for religion: 

table(D$Religion)

  1   2   3 
159  15  17 
Religion_freq <- table(D$Religion)

# Custom names for each bar/category
bar_names2 <- c("Hindu", "Muslim", "Christian")  

bp2 <- barplot(Religion_freq,
              names.arg = bar_names2,  # Using custom names
              xlab = "Religion", 
              ylab = "Frequency",
              main = "reliogious distribution in the sample",
              col = "orange")

##removing participants from Muslim,Christian, Scheduled Caste, Scheduled tribe backgrounds:

## subsetting the data via Manipulation checks

DM <- D %>%
  filter(Religion == 1 & Caste_category %in% c(1, 3, 4, 5) &
         (MCMM == 1 | MCIM == 1 | MCIG == 1 | MCDD == 1 | MCID == 1 | MCHD == 1 | 
          MC_B == 4 | MC_B == 2 | MC_B == 5 | MC_B == 6))
 ##Selecting relevant variables


DD1 <- select(DM, -contains("Objects"), -contains("HS"), -RandomID)

`



##pivoting the data

FDlongx <- DD1 %>%
  pivot_longer(
    cols = c("MI", "GI", "DI", "BI"),
    names_to = 'treatment',
    values_to = 'donations',
    values_drop_na = TRUE
  )


## converting the key variables into dummies


dummy_dat9 <- FDlongx %>% mutate(treatment = factor(treatment)) %>% pivot_wider(names_from = treatment, values_from = treatment, values_fn = length, values_fill = 0)

dummy_dat9
# A tibble: 88 × 53
   Consent Region  MCMM  MCIM  MCDM  MCHM  MCMG  MCIG  MCDG  MCHG  MCBG MCFCG
     <dbl>  <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
 1       1      1    NA    NA    NA    NA    NA    NA    NA    NA    NA    NA
 2       1      1    NA    NA    NA    NA    NA    NA    NA    NA    NA    NA
 3       1      1    NA    NA    NA    NA    NA    NA    NA    NA    NA    NA
 4       1      1     1    NA    NA    NA    NA    NA    NA    NA    NA    NA
 5       1      1    NA    NA    NA    NA    NA     1    NA     1    NA    NA
 6       1      1     1    NA    NA    NA    NA    NA    NA    NA    NA    NA
 7       1      1     1    NA    NA    NA    NA    NA    NA    NA    NA    NA
 8       1      1    NA    NA    NA    NA    NA    NA    NA    NA    NA    NA
 9       1      1    NA    NA    NA    NA    NA    NA    NA    NA    NA    NA
10       1      2     1    NA    NA    NA    NA    NA    NA    NA    NA    NA
# ℹ 78 more rows
# ℹ 41 more variables: MCMD <dbl>, MCID <dbl>, MCDD <dbl>, MCHD <dbl>,
#   MCBD <lgl>, MCFCD <dbl>, MC_B <dbl>, AC_1 <dbl>, AC_2 <dbl>, SFC <dbl>,
#   SOBC <dbl>, SD <dbl>, SA <dbl>, SSC...30 <dbl>, SM <dbl>, SP <dbl>,
#   SMC <dbl>, SR <dbl>, SBC <dbl>, SB <dbl>, SSC...37 <dbl>, GEP <dbl>,
#   Income_1 <dbl>, Income_2 <dbl>, ED_M <dbl>, ED_F <dbl>, ED_GF <dbl>,
#   ED_GM <dbl>, ED_PP <dbl>, Gender <dbl>, Caste_category <dbl>, …
##plotting donations across treatment conditions

library(ggplot2)


FDlongx$treatment <- factor(FDlongx$treatment,
                            levels = c("BI", "GI", "MI", "DI"),
                            labels = c("Brahmin", "Generic-Indian", "Muslim", "Dalit")) 




ggplot(data = FDlongx, aes(x = treatment, y = donations)) +
  geom_bar(stat = "identity", fill = "blue") +
  scale_y_continuous(limits = c(0, 100)) +
  labs(title = "Donations Across Recipients",
       x = "Recipients", y = "Donations")
Warning: Removed 84 rows containing missing values (`geom_bar()`).

##main result:
##Linear regression for the main effect:
  

model1 <- lm(donations ~ MI + GI + DI + BI + Caste_category,
               data = dummy_dat9)

summary(model1)

Call:
lm(formula = donations ~ MI + GI + DI + BI + Caste_category, 
    data = dummy_dat9)

Residuals:
    Min      1Q  Median      3Q     Max 
-56.063 -13.930  -0.065  20.298  36.070 

Coefficients: (1 not defined because of singularities)
               Estimate Std. Error t value Pr(>|t|)    
(Intercept)     62.8387     6.4028   9.814 1.54e-15 ***
MI              -0.4503     7.4153  -0.061    0.952    
GI              -2.8668     8.3798  -0.342    0.733    
DI               0.7719     7.5620   0.102    0.919    
BI                   NA         NA      NA       NA    
Caste_category   1.0914     1.5376   0.710    0.480    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 23.91 on 83 degrees of freedom
Multiple R-squared:  0.008397,  Adjusted R-squared:  -0.03939 
F-statistic: 0.1757 on 4 and 83 DF,  p-value: 0.9503
stargazer(model1, type = "text",
          covariate.labels = c("Muslim for MI", "Brahmin for BI", "Generic Indian for GI", "Dalit for DI"),
          title = "Results of Model1")

Results of Model1
=================================================
                          Dependent variable:    
                      ---------------------------
                               donations         
-------------------------------------------------
Muslim for MI                   -0.450           
                                (7.415)          
                                                 
Brahmin for BI                  -2.867           
                                (8.380)          
                                                 
Generic Indian for GI            0.772           
                                (7.562)          
                                                 
Dalit for DI                                     
                                                 
                                                 
Caste_category                   1.091           
                                (1.538)          
                                                 
Constant                       62.839***         
                                (6.403)          
                                                 
-------------------------------------------------
Observations                      88             
R2                               0.008           
Adjusted R2                     -0.039           
Residual Std. Error        23.912 (df = 83)      
F Statistic               0.176 (df = 4; 83)     
=================================================
Note:                 *p<0.1; **p<0.05; ***p<0.01

##Results: The sample is too small for me to seek any evidence of replication from the linear model presented above.

Discussion & summary:

Unfortunately, the results of the present study cannot speak to the results obtained by Deshpande & Spears (2016). The limitations and the quality of the data collected prevents me from taking any stance on whether or not the results of the original study replicate. I am now going to sequentially note down each of these limitations. Though these limitations emerge in referece to Deshpande & Spears (2016) sample, I hope that these limitations will help further attempts to collect experimental data via Mturk from Indian participants.

  1. First, the sample of participants who took the survey from a heterogeneous Caste & Religius backgrounds. I have noted this heterogeniety above. Since this was an experimental study, asking participants to report their caste & religion at the start as a filter would have introduced a prime that can confound main results.Given that only about 130 participants belong to UC background, I relaxed the caste criteria of inclusion by adding participants from the OBC background to my data. I later controlled for caste.

  2. Participants were from different regions across India. This stands consonent with Deshpande & Spears, who didn’t have a location based filter based on region. However Deshpande & Spears used north Indian caste names, and these names can likely be unfamiliar to sample from South India. Keeping this limitation in mind,in order to increase the chances of replication I had tried to restrict the sample to North India, however this affected the timeline of data collection, as there were fewer north Indian participants. Consequently, I relaxed this criteria. I consequently was strict with removing participants who didn’t meet the manipulation check. This reduced the sample size, and the affected my ability to come up with any conclusive inference vis-a-vis replication of these results.

  3. Lastly, a large number of participants failed the attention check questions. I had field a very short survey, and the Attention check questions were very straight forward. However, despite this I was forced to remove more than 100 participants, nearly 40 % of the entire sample collected. This is indicative of the general engagement of the Mturk workers, and the consecuent data quality of Mturk studies fielded in India.

Overall, I’d like to thank Mike, Sarah, Veronica and Ben for allowing me to replicate this study, funding the data collection, and supporting me through the mores of replication, Rstudio, and in engaging with Mturk. I think the study was revealing, re the potential feasibility of using Mturk to collect experimental, & geographically sensitive political data.

https://github.com/psych251/dhingram-2023-/blob/main/writeup/Replication%20Report%20Template.qmd

https://github.com/psych251/dhingram-2023-