library(readr)
library(dplyr)
library(irr)
In this activity, our group members individually will first coded prompts in csv files and then merged into a shared codebook. Then we measured agreement using Cohen’s Kappa as we had 2 people on assingment.
The codebook:
Before running this R Markdown, our group filled in the CSV file so that:
df1 <- read.csv("genai_prompt_intent_uditi_coding.csv")
df2 <- read.csv("genai_prompt_intent_group_coding.csv")
combined_df <- data.frame(
prompt_id = df1$prompt_id,
prompt = df1$prompt,
rater1 = df1$rater1,
rater2 = df2$rater2
)
# Combining the 2 CSV Files in one
write.csv(combined_df,"genai_prompt_intent_combined_coding.csv",row.names = FALSE)
# Assigning name "codes" for the merged csv file for easier in code use
codes <- read_csv("genai_prompt_intent_combined_coding.csv", show_col_types = FALSE)
codes %>% select(prompt_id, prompt, rater1, rater2)
## # A tibble: 40 × 4
## prompt_id prompt rater1 rater2
## <dbl> <chr> <chr> <chr>
## 1 1 What is photosynthesis? Defin… Defin…
## 2 2 Can you define cellular respiration in simple terms? Simpl… Simpl…
## 3 3 What is chlorophyll? Defin… Defin…
## 4 4 Define ATP for a high school student. Test … Simpl…
## 5 5 Explain how photosynthesis works. Expla… Expla…
## 6 6 How does cellular respiration make energy for the ce… Expla… Expla…
## 7 7 Can you walk me through the stages of photosynthesis? Expla… Expla…
## 8 8 Why do plants need sunlight to make food? Expla… Expla…
## 9 9 How are photosynthesis and cellular respiration conn… Expla… Diffe…
## 10 10 What is the difference between photosynthesis and ce… Diffe… Diffe…
## # ℹ 30 more rows
codes_2r <- codes %>%
select(rater1, rater2)
kappa2(codes_2r)
## Cohen's Kappa for 2 Raters (Weights: unweighted)
##
## Subjects = 40
## Raters = 2
## Kappa = 0.757
##
## z = 10.6
## p-value = 0
A common rough interpretation scale is:
ANS: Cohen’s Kappa as we have only 2 people in our group (also makes the calculation way easier)
ANS: Kappa = 0.757
ANS: Substantial agreement
ANS: Prompts mentioning show the nature of high schoolers,“What should I absolutely know for a high school quiz on these topics?” and “Define ATP for a high school student.”, showed main differences which tells that maybe different levels of education rises a different sense of opinions. Also some testing questions of “Make me a short study guide on photosynthesis and respiration” and “Give me a simple chart comparing photosynthesis and respiration.” were not in agreement.
ANS: Testing Knowledge and Differentiate Concepts
ANS: Having a well defined codebook is essential to correctly map the propmts to their code with logical reasoning behind as to why we chose a specific code and why it works well with it (with correct justification)