Abstract
This paper presents an experimental analysis of the Stag Hunt and
Prisoner’s Dilemma games. The experiment was run at the Faculty of
Management, Comenius University, with the participation of 16 and 18
players, respectively. In analyzing the experiment results, we note that
the game-theoretical Nash Equilibria were achieved by the majority of
players, albeit after experiencing a few introductory rounds. We
interpret this fact as an absence of the players’ perfect rationality
and we incline in our interpretations more to the players’ bounded
rationality assumption.
Stag Hunt Game
The Stug Hunt game is a popular theoretical game that highlights the
cooperation of players in light of their risk and payoff
preferences.
Table 1. Payoff matrix of the Prisoner’s
Dilemma.
| Cooperate |
2 \ 2 |
0 \ 1 |
| Non-Cooperate |
1 \ 0 |
1 \ 1 |
The Stag Hunt is a coordination game in which two players must choose
between hunting a stag or a hare. Hunting the stag is a great way to
win, but only if both players work together. Hunting the hare is a
safer, lower-reward option that does not require cooperation. The game
reflects the tension between trust and risk on one side and high mutual
benefits on the other.
If analysing the payoff matrix (see Table. 1) and
experiment results (Fig. 1), the following is
apparent:
- There is just one Pareto-efficient solution: both hunters should
cooperate (red). This strategy profile is the only theoretical
prediction (dominant Nash equilibrium) based on Game theory maximising
payoffs of both players. The second Nash equilibrium consists of the
safe Non-cooperative strategies, i.e., both hunters are hunting the
rabbit individually.
- If playing eight rounds of the experiment, the dominant Nash
equilibrium was not played automatically by all the players; on the
contrary, the game results reflect how payoffs, trust, risk preferences,
and experience collected in previous rounds influence the players’
strategic decisions.
The majority of players converged to the Pareto-efficient Nash
equilibrium based on cooperation, while a minority of players maintain
their Nash equilibrium strategies, focusing on risk elimination. The
other non-coordination solutions remain minor and not systematic.
Evolution
Fig. 1 depicts the evolution of the Cooperation /
Non-Cooperation strategies in 8 rounds. There were 18 participants, who
were randomly matched at the start of the experiment. The created pairs
persisted during the whole experiment. The evolution of the played
strategies shows that the participants played the first round somewhat
randomly, but in later rounds, the players converged mainly to the
cooperating strategy. In the last round, five pairs of players out of
nine employed the Nash equilibrium strategy, in which both players
cooperate. Thus, we show that convergence towards a dominant Nash
equilibrium does not emerge suddenly, but rather it is the result of
trial-and-error experiments by the players. At the same time, two pairs
of players played the dominated Nash equilibrium of mutual
non-cooperation. Just two pairs of players played out of the equilibria
in the last round.
Fig. 1. Eight rounds of the strategies evolution

The blue line in Fig. 2 depicts the evolution of the
average payoffs achieved during the simulation. The solid red line fits
the experimental results, showing that the average payoffs increase as
the simulation progresses through the rounds. The dotted red line
denotes the payoff if playing the cooperative Nash equilibrium, which is
the maximum possible payoff in the game. The dashed red line denotes the
payoffs of playing the non-cooperative Nash equilibrium. The real
average payoffs evolve progressively between the two horizontal lines;
however, there is a slight chance of reaching the cooperative strategies
played by all the experiment participants.
Fig 2. Evolution of the average payoffs

Prisoner’s Dilemma Game
The Prisoner’s Dilemma is a strategic game that captures the conflict
between mutual benefits of cooperation and selfish incentives to
unilaterally break the cooperation.
Table 2. Payoff matrix of the Prisoner’s
Dilemma.
| Cooperate |
3 \ 3 |
0 \ 5 |
| Non-Cooperate (Defect) |
5 \ 0 |
2 \ 2 |
Two players each choose between cooperating and defecting.
Cooperation leads to a moderate payoff for both, while unilateral
defection offers a tempting, higher reward if the other remains
cooperative — but at the cost of severely punishing the cooperator. If
both defect, they each receive a lower payoff than if they had
cooperated. The dilemma lies in the fact that mutual defection is a Nash
Equilibrium strategy, even though cooperation payoff dominates mutual
defection, i.e., mutual cooperation would lead to a better outcome for
both (red color). The game reveals the challenges of trust and the
tempting option of unilateral defection, showing how rational
self-interest can lead to collectively suboptimal outcomes. Mutual
cooperation is Pareto efficient, while mutual defection constitutes a
Pareto inefficient Nash equilibrium.
Evolution
Upon inspecting Fig. 3, the apparent dominance in
mutually playing the defective strategies is evident from the very start
of the simulation. Five pairs of players played the Nash equilibrium in
the first round; however, no pair played the Pareto-efficient strategy
profile of mutually playing the Cooperative strategies. However, the
other strategies are tempting, causing the Nash equilibrium to be less
stable, and some players try individually to restore the Pareto-optimal
strategy profile of mutual cooperation. It was achieved only once in the
fourth round; otherwise, no pair converted to this strategy profile
throughout the entire simulation. We explain it so that there is no
cooperation incentive preventing unilateral violation of the
cooperation.
Fig.3 Eight rounds of the strategies evolution

If inspecting average payoffs, as introduced in Fig.
4, the following is apparent: the red solid line demonstrates
the general trend, which is sloping down; however, we do not consider it
to be statistically significant. During the entire experiment, the
number of defective player pairs oscillates around five. It means that
the Nash equilibrium of mutual defection (meaning the lowest average
payoff (2) for both players) is played most frequently; however, the
players always try to destroy this kind of disadvantageous equilibrium.
Attempts to disrupt this kind of equilibrium result in significant
oscillations in the average payoffs (blue curve) around the trend
line.
Fig 4. Evolution of the average payoffs

Inspecting the payoff matrix given in Table 2.
concludes that selection of the second row, or selection of the left
column is not rationale, as both strategies dominate their counterpart
(i.e., first row or the right column) as regardless the turn of the
opponent, the second raw (right column) provides higher payoff than the
first raw (left column). Fig. 5 demonstrates that the
number of these irrational strategies selected decreases as the number
of rounds passed increases. However, there is a relatively small chance
they converge to zero, as the incentives to deviate from the high
cooperation payoffs strategies are high.
Fig.5 Number of Not Best Responses - evolution

---
title: "Prisoner's Dilemma & Stag and Hunt Games"
subtitle: "Experimental controversy"
author: "Michal Páleník and Vladimír Gazda"
output: html_notebook
---


```{r setup2, include=FALSE}
rm(list=ls())
library(knitr)

knitr::opts_chunk$set(
  echo = FALSE,
  message = FALSE,
  warning = FALSE,
  error = FALSE,
  include = TRUE,

  # ✳️ GLOBAL figure settings
  fig.width = 5,      # width in inches
  fig.height = 3.75,     # height in inches
  out.width = "75%",  # relative size in HTML
  fig.align = "center",
  dpi = 400,          # sharp text and lines
  dev = "svg"         # vector graphics (no blur in HTML)
)
```

## Abstract

This paper presents an experimental analysis of the Stag Hunt and Prisoner’s Dilemma games. The experiment was run at the Faculty of Management, Comenius University, with the participation of 16 and 18 players, respectively. In analyzing the experiment results, we note that the game-theoretical Nash Equilibria were achieved by the majority of players, albeit after experiencing a few introductory rounds. We interpret this fact as an absence of the players' perfect rationality and we incline in our interpretations more to the players' bounded rationality assumption.

## Stag Hunt Game

The Stug Hunt game is a popular theoretical game that highlights the cooperation of players in light of their risk and payoff preferences. 

**Table 1.** Payoff matrix of the Prisoner’s Dilemma.

| Player R \ Player C | Cooperate | Non-Cooperate  |
|:--------------------:|:----------:|:-------:|
| Cooperate | <span style="color:red">2 \\ 2</span> | 0 \\ 1 |
| Non-Cooperate | 1 \\ 0 | <span style="color:blue">1 \\ 1</span> |


The Stag Hunt is a coordination game in which two players must choose between hunting a stag or a hare. Hunting the stag is a great way to win, but only if both players work together. Hunting the hare is a safer, lower-reward option that does not require cooperation. The game reflects the tension between trust and risk on one side and high mutual benefits on the other.  

If analysing the payoff matrix (see **Table. 1**) and experiment results (**Fig. 1**), the following is apparent:

- There is just one Pareto-efficient solution: both hunters should cooperate (red). This strategy profile is the only theoretical prediction (dominant Nash equilibrium) based on Game theory maximising payoffs of both players. The second Nash equilibrium consists of the safe Non-cooperative strategies, i.e., both hunters are hunting the rabbit individually. 
- If playing eight rounds of the experiment, the dominant Nash equilibrium was not played automatically by all the players; on the contrary,  the game results reflect how payoffs, trust, risk preferences, and experience collected in previous rounds influence the players' strategic decisions. 

The majority of players converged to the Pareto-efficient Nash equilibrium based on cooperation, while a minority of players maintain their Nash equilibrium strategies, focusing on risk elimination. The other non-coordination solutions remain minor and not systematic. 




### Evolution

**Fig. 1** depicts the evolution of the Cooperation / Non-Cooperation strategies in 8 rounds. There were 18 participants, who were randomly matched at the start of the experiment. The created pairs persisted during the whole experiment. The evolution of the played strategies shows that the participants played the first round somewhat randomly, but in later rounds, the players converged mainly to the cooperating strategy. In the last round, five pairs of players out of nine employed the Nash equilibrium strategy, in which both players cooperate. Thus, we show that convergence towards a dominant Nash equilibrium does not emerge suddenly, but rather it is the result of trial-and-error experiments by the players. At the same time, two pairs of players played the dominated Nash equilibrium of mutual non-cooperation. Just two pairs of players played out of the equilibria in the last round.

**Fig. 1.** Eight rounds of the strategies evolution

```{r}
# Load required packages
library(tidyverse)

# Load data
# udajeSH <- read_csv2("StagHuntData.csv")
udajeSH <- read_csv2("udajeSH.csv")

# Filter for Role == 1
role1 <- udajeSH %>% 
  filter(Role == 1)

# --- Compute 2x2 occurrence tables ---
round_tables <- role1 %>%
  group_by(Round, Decision, OtherDecision) %>%
  summarise(count = n(), .groups = "drop") %>%
  complete(Round, Decision = 1:2, OtherDecision = 1:2, fill = list(count = 0))

# --- Print 2x2 tables for each round ---
for (r in sort(unique(round_tables$Round))) {
  #cat("\nRound", r, "\n")
  tbl <- round_tables %>%
    filter(Round == r) %>%
    pivot_wider(names_from = OtherDecision, values_from = count)
  #print(tbl)
}
```




```{r heatmapsSH}

# --- Heatmaps across rounds (white → light blue → dark blue) ---
ggplot(round_tables, aes(x = factor(OtherDecision),
                         y = factor(Decision),
                         fill = count)) +
  geom_tile(color = "white") +
  geom_text(aes(label = count), color = "black", size = 4) +
  scale_fill_gradientn(
    colours = c("white", "#ADD8E6", "#00008B"),  # white → light blue → dark blue
    limits = c(0, 6),
    name = "Count"
  ) +
  scale_y_discrete(limits = c("2", "1")) +  # Reverse Y-axis: 1 on top
  facet_wrap(~ Round, ncol = 4) +
  labs(
    # title = "",
    x = "Column Player 1-cooperate, 2 non-cooperate",
    y = "Row Player 1-cooperate, 2 non-cooperate"
  ) +
  theme_minimal(base_size = 9) +
  theme(
    panel.grid = element_blank(),
    strip.text = element_text(face = "bold")
  )


```


The blue line in **Fig. 2** depicts the evolution of the average payoffs achieved during the simulation. The solid red line fits the experimental results, showing that the average payoffs increase as the simulation progresses through the rounds. The dotted red line denotes the payoff if playing the cooperative Nash equilibrium, which is the maximum possible payoff in the game. The dashed red line denotes the payoffs of playing the non-cooperative Nash equilibrium. The real average payoffs evolve progressively between the two horizontal lines; however, there is a slight chance of reaching the cooperative strategies played by all the experiment participants.

**Fig 2.** Evolution of the average payoffs

```{r linegraphSH}
# --- Compute average payoff per round (symmetric game) ---
avg_payoffs <- udajeSH %>%
  group_by(Round) %>%
  summarise(Average_Payoff = mean(Earnings, na.rm = TRUE)) %>%
  ungroup()

# --- Line plot of average payoffs by round ---
ggplot(avg_payoffs, aes(x = Round, y = Average_Payoff)) +
  # Evolution line (dark blue)
  geom_line(aes(color = "Evolution of average payoffs"), linewidth = 0.6) +
  geom_point(color = "darkblue", size = 2) +
  


  # Trend line (solid red)
  geom_smooth(aes(color = "Trend evolution line"),
              method = "lm", se = FALSE, linetype = "solid", linewidth = 0.5) +
  
  # Horizontal lines (dotted & dashed red)
  geom_hline(aes(yintercept = 2, color = "Avg. payoff if both cooperating (3)"),
             linetype = "dotted", linewidth = 0.5) +
  geom_hline(aes(yintercept = 1, color = "Avg. payoff if both defecting (2)"),
             linetype = "dashed", linewidth = 0.5) +
  
  # Define legend and colors
  scale_color_manual(
    name = "Legend",
    values = c(
      "Evolution of average payoffs" = "darkblue",
      "Avg. payoff if both cooperating (2)" = "red",
      "Avg. payoff if both defecting (1)" = "red",
      "Trend evolution line" = "red"
    )
  ) +
  
  # Wrap legend into two rows
  guides(color = guide_legend(nrow = 4, byrow = TRUE)) +
  
  labs(
    # title = "",
    x = "Round",
    y = "Average Payoff"
  ) +
  theme_minimal(base_size = 9) +
  theme(
    legend.position = "bottom",
    legend.title = element_text(face = "bold"),
    legend.text = element_text(size = 8),
    panel.grid.minor = element_blank()
  )


```



## Prisoner's Dilemma Game


```{r}
rm(list=ls())
```

The Prisoner's Dilemma is a strategic game that captures the conflict between mutual benefits of cooperation and selfish incentives to unilaterally break the cooperation.

**Table 2.** Payoff matrix of the Prisoner’s Dilemma.

| Player R \ Player C | Cooperate | Non-Cooperate (Defect) |
|:--------------------:|:----------:|:-------:|
| Cooperate | <span style="color:red">3 \\ 3</span> | 0 \\ 5 |
| Non-Cooperate (Defect) | 5 \\ 0 | <span style="color:blue">2 \\ 2</span> |

Two players each choose between cooperating and defecting. Cooperation leads to a moderate payoff for both, while unilateral defection offers a tempting, higher reward if the other remains cooperative — but at the cost of severely punishing the cooperator. If both defect, they each receive a lower payoff than if they had cooperated. The dilemma lies in the fact that mutual defection is a Nash Equilibrium strategy, even though cooperation payoff dominates mutual defection, i.e., mutual cooperation would lead to a better outcome for both (red color). The game reveals the challenges of trust and the tempting option of unilateral defection, showing how rational self-interest can lead to collectively suboptimal outcomes. Mutual cooperation is Pareto efficient, while mutual defection constitutes a Pareto inefficient Nash equilibrium.


### Evolution

Upon inspecting **Fig. 3**, the apparent dominance in mutually playing the defective strategies is evident from the very start of the simulation. Five pairs of players played the Nash equilibrium in the first round; however, no pair played the Pareto-efficient strategy profile of mutually playing the Cooperative strategies. However, the other strategies are tempting, causing the Nash equilibrium to be less stable, and some players try individually to restore the Pareto-optimal strategy profile of mutual cooperation. It was achieved only once in the fourth round; otherwise, no pair converted to this strategy profile throughout the entire simulation. We explain it so that there is no cooperation incentive preventing unilateral violation of the cooperation.

**Fig.3** Eight rounds of the strategies evolution

```{r heatmapPD}
# Load required packages
library(tidyverse)

# Load data
udajePD <- read_csv2("PrisonersDilemmaData.csv")

# Filter for Role == 1
role1 <- udajePD %>% 
  filter(Role == 1)

# --- Compute 2x2 occurrence tables ---
round_tables <- role1 %>%
  group_by(Round, Decision, OtherDecision) %>%
  summarise(count = n(), .groups = "drop") %>%
  complete(Round, Decision = 1:2, OtherDecision = 1:2, fill = list(count = 0))

# --- Print 2x2 tables for each round ---
for (r in sort(unique(round_tables$Round))) {
#  cat("\nRound", r, "\n")
  tbl <- round_tables %>%
    filter(Round == r) %>%
    pivot_wider(names_from = OtherDecision, values_from = count)
  #print(tbl)
}

# --- Heatmaps across rounds (white → light blue → dark blue) ---
ggplot(round_tables, aes(x = factor(OtherDecision),
                         y = factor(Decision),
                         fill = count)) +
  geom_tile(color = "white") +
  geom_text(aes(label = count), color = "black", size = 4) +
  scale_fill_gradientn(
    colours = c("white", "#ADD8E6", "#00008B"),  # white → light blue → dark blue
    limits = c(0, 7),
    name = "Count"
  ) +
  scale_y_discrete(limits = c("2", "1")) +  # Reverse Y-axis: 1 on top
  facet_wrap(~ Round, ncol = 4) +
  labs(
    # title = "",
    x = "Column Player 1-cooperate, 2 non-cooperate",
    y = "Row Player 1-cooperate, 2 non-cooperate"
  ) +
  theme_minimal(base_size = 9) +
  theme(
    panel.grid = element_blank(),
    strip.text = element_text(face = "bold")
  )


```



If inspecting average payoffs, as introduced in **Fig. 4**, the following is apparent: the red solid line demonstrates the general trend, which is sloping down; however, we do not consider it to be statistically significant. During the entire experiment, the number of defective player pairs oscillates around five. It means that the Nash equilibrium of mutual defection (meaning the lowest average payoff (2) for both players) is played most frequently; however, the players always try to destroy this kind of disadvantageous equilibrium. Attempts to disrupt this kind of equilibrium result in significant oscillations in the average payoffs (blue curve) around the trend line. 




**Fig 4.** Evolution of the average payoffs

```{r trendplotPD}
# --- Compute average payoff per round (symmetric game) ---
avg_payoffs <- udajePD %>%
  group_by(Round) %>%
  summarise(Average_Payoff = mean(Earnings, na.rm = TRUE)) %>%
  ungroup()

# --- Line plot of average payoffs by round ---
ggplot(avg_payoffs, aes(x = Round, y = Average_Payoff)) +
  # Evolution line (dark blue)
  geom_line(aes(color = "Evolution of average payoffs"), linewidth = 0.6) +
  geom_point(color = "darkblue", size = 2) +
  
  # Trend line (solid red)
  geom_smooth(aes(color = "Trend evolution line"),
              method = "lm", se = FALSE, linetype = "solid", linewidth = 0.5) +
  
  # Horizontal lines (dotted & dashed red)
  geom_hline(aes(yintercept = 3, color = "Avg. payoff if both cooperating (3)"),
             linetype = "dotted", linewidth = 0.5) +
  geom_hline(aes(yintercept = 2, color = "Avg. payoff if both defecting (2)"),
             linetype = "dashed", linewidth = 0.5) +
  
  # Define legend and colors
  scale_color_manual(
    name = "Legend",
    values = c(
      "Evolution of average payoffs" = "darkblue",
      "Avg. payoff if both cooperating (3)" = "red",
      "Avg. payoff if both defecting (2)" = "red",
      "Trend evolution line" = "red"
    )
  ) +
  
  # Wrap legend into two rows
  guides(color = guide_legend(nrow = 4, byrow = TRUE)) +
  
  labs(
    # title = "",
    x = "Round",
    y = "Average Payoff"
  ) +
  theme_minimal(base_size = 9) +
  theme(
    legend.position = "bottom",
    legend.title = element_text(face = "bold"),
    legend.text = element_text(size = 8),
    panel.grid.minor = element_blank()
  )

```
Inspecting the payoff matrix given in **Table 2.** concludes that selection of the second row, or selection of the left column is not rationale, as both strategies dominate their counterpart (i.e., first row or the right column) as regardless the turn of the opponent, the second raw (right column) provides higher payoff than the first raw (left column). **Fig. 5** demonstrates that the number of these irrational strategies selected decreases as the number of rounds passed increases. However, there is a relatively small chance they converge to zero, as the incentives to deviate from the high cooperation payoffs strategies are high.

**Fig.5** Number of Not Best Responses - evolution

```{r}
library(ggplot2)

not_best <- data.frame(
  Round = 1:8,
  Not_Best_Response = c(3, 7, 4, 2, 3, 6, 2, 3)
)

ggplot(not_best, aes(x = Round, y = Not_Best_Response)) +
  geom_col(fill = "steelblue", width = 0.5) +
  geom_text(aes(label = Not_Best_Response),
            vjust = -0.5, size = 3.5, color = "red") +
  geom_smooth(method = "lm", se = FALSE, color = "red", linewidth = 0.8) +  # regression line
  labs(
    x = "Round",
    y = "Count of Non–Best–Response Strategies"
  ) +
  theme_minimal(base_size = 9) +
  theme(
    panel.grid.minor = element_blank(),
    panel.grid.major.x = element_blank(),
    axis.text = element_text(size = 9),
    plot.title = element_text(face = "bold", hjust = 0.5)
  ) +
  ylim(0, max(not_best$Not_Best_Response) + 1)

```

