Going to try working with Statsbomb Data. First, will need to install Statsbombr which is their parsing package.

Will need to do this through Github, as this won’t be in the packages under tools. Will have to use the devtools library in order to achieve this using the code below:

(For full Statsbomb R Walkthrough, navigate through the browser here: https://statsbomb.com/wp-content/uploads/2021/11/Working-with-R.pdf)

library(tidyverse)
library(ggplot2)
library(devtools)
Warning: package ‘devtools’ was built under R version 4.2.3Loading required package: usethis
devtools::install_github("statsbomb/SDMTools") 
Skipping install of 'SDMTools' from a github remote, the SHA1 (4f193c85) has not changed since last install.
  Use `force = TRUE` to force installation
devtools::install_github("statsbomb/StatsBombR")
Skipping install of 'StatsBombR' from a github remote, the SHA1 (f0503b80) has not changed since last install.
  Use `force = TRUE` to force installation
devtools::install_github("Torvaney/ggsoccer")
Skipping install of 'ggsoccer' from a github remote, the SHA1 (3fe317f3) has not changed since last install.
  Use `force = TRUE` to force installation

Next we need to load the soccer libraries now that we’ve downloaded them

library(StatsBombR)
Loading required package: stringi
Warning: package ‘stringi’ was built under R version 4.2.2Loading required package: rvest
Warning: package ‘rvest’ was built under R version 4.2.2
Attaching package: ‘rvest’

The following object is masked from ‘package:readr’:

    guess_encoding

Loading required package: RCurl
Warning: package ‘RCurl’ was built under R version 4.2.3
Attaching package: ‘RCurl’

The following object is masked from ‘package:tidyr’:

    complete

Loading required package: doParallel
Warning: package ‘doParallel’ was built under R version 4.2.3Loading required package: foreach
Warning: package ‘foreach’ was built under R version 4.2.2
Attaching package: ‘foreach’

The following objects are masked from ‘package:purrr’:

    accumulate, when

Loading required package: iterators
Warning: package ‘iterators’ was built under R version 4.2.2Loading required package: parallel
Loading required package: httr
Warning: package ‘httr’ was built under R version 4.2.3Loading required package: jsonlite
Warning: package ‘jsonlite’ was built under R version 4.2.3
Attaching package: ‘jsonlite’

The following object is masked from ‘package:purrr’:

    flatten

Loading required package: sp
Warning: package ‘sp’ was built under R version 4.2.3Warning: replacing previous import ‘foreach::when’ by ‘purrr::when’ when loading ‘StatsBombR’Warning: replacing previous import ‘jsonlite::flatten’ by ‘purrr::flatten’ when loading ‘StatsBombR’Warning: replacing previous import ‘foreach::accumulate’ by ‘purrr::accumulate’ when loading ‘StatsBombR’
library(ggsoccer)

Filter the competition we want to look at

freecomp <- FreeCompetitions()
[1] "Whilst we are keen to share data and facilitate research, we also urge you to be responsible with the data. Please credit StatsBomb as your data source when using the data and visit https://statsbomb.com/media-pack/ to obtain our logos for public use."
competitions <- FreeCompetitions() %>%
  filter(competition_id == 43, season_id==106)
[1] "Whilst we are keen to share data and facilitate research, we also urge you to be responsible with the data. Please credit StatsBomb as your data source when using the data and visit https://statsbomb.com/media-pack/ to obtain our logos for public use."
str(competitions)
'data.frame':   1 obs. of  12 variables:
 $ competition_id           : int 43
 $ season_id                : int 106
 $ country_name             : chr "International"
 $ competition_name         : chr "FIFA World Cup"
 $ competition_gender       : chr "male"
 $ competition_youth        : logi FALSE
 $ competition_international: logi TRUE
 $ season_name              : chr "2022"
 $ match_updated            : chr "2023-11-05T04:23:26.649917"
 $ match_updated_360        : chr "2023-11-21T15:37:11.589616"
 $ match_available_360      : chr "2023-11-21T15:37:11.589616"
 $ match_available          : chr "2023-11-05T04:23:26.649917"
write.csv(freecomp, "C:\\Users\\jmo5660\\School\\Statsbomb\\competitions.csv", row.names = FALSE)

Now we want to look at matches in the season

matches <- FreeMatches(Competitions = competitions
)
[1] "Whilst we are keen to share data and facilitate research, we also urge you to be responsible with the data. Please credit StatsBomb as your data source when using the data and visit https://statsbomb.com/media-pack/ to obtain our logos for public use."

Now let’s look at the event data

non_clean_events <- free_allevents(MatchesDF = matches, Parallel = T)
[1] "Whilst we are keen to share data and facilitate research, we also urge you to be responsible with the data. Please credit StatsBomb as your data source when using the data and visit https://statsbomb.com/media-pack/ to obtain our logos for public use."
# Convert nested lists to character strings
compatible_columns <-sapply(non_clean_events, function(col) is.atomic(col) && !is.list(col))
non_clean_events2 <- non_clean_events[, compatible_columns, drop=FALSE]


write.csv(non_clean_events2, "C:\\Users\\jmo5660\\School\\Statsbomb\\all_unclean_free_event_data_2022_FifaWorldCup.csv", row.names = FALSE)

#clean the events based on the provided statsbomb function
events = allclean(non_clean_events)
Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(id)`Joining with `by = join_by(period, match_id)`
compatible_columns <-sapply(events, function(col) is.atomic(col) && !is.list(col))
subset_df <- events[, compatible_columns, drop=FALSE]


# Assuming you have loaded the StatsBomb event data into a dataframe called 'events'

# Convert timestamp column to numeric format
events$timestamp <- as.numeric(events$timestamp)
Warning: NAs introduced by coercion
# Sort the events dataframe by the timestamp column
events <- events[order(events$timestamp), ]

# Create a new column to store the duration of each possession
events$possession_duration <- NA

# Initialize variables
current_possession <- 1
possession_start_time <- events$timestamp[1]

# Iterate through the events dataframe
for (i in 2:length(events$timestamp)) {
  # Check if possession changes (based on possession_team.id)
  if (events$possession_team.id[i] != events$possession_team.id[i-1]) {
    # Calculate the duration of the previous possession
    possession_duration <- events$timestamp[i-1] - possession_start_time
    # Update the duration column for the previous possession
    events$possession_duration[events$possession_team.id == current_possession] <- possession_duration
    
    # Reset variables for the new possession
    current_possession <- events$possession_team.id[i]
    possession_start_time <- events$timestamp[i]
  }
}

# Calculate the duration of the last possession (if exists)
possession_duration <- events$timestamp[length(events$timestamp)] - possession_start_time
events$possession_duration[events$possession_team.id == current_possession] <- possession_duration



write.csv(subset_df, "C:\\Users\\jmo5660\\School\\Statsbomb\\clean_events.csv")

Player Summary Statistics (Need to do a join on all summaries)

summary <- events %>%
  group_by(events$player.name) %>%
  summarise(
    "Carries" = sum(type.name == "Carry", na.rm = TRUE),
    "Time of Possession" = sum(duration, na.rm = TRUE),
    "Shots" = sum(type.name == "Shot", na.rm = TRUE),
    "Passes" = sum(type.name == "Pass", na.rm = TRUE),
    "Pressures" = sum(type.name == "Pressure", na.rm = TRUE),
    "Fouls_Won" = sum(type.name == "Foul Won", na.rm = TRUE),
    "Interceptions" = sum(type.name == "Interception", na.rm = TRUE),
    "Dispossessed" = sum(type.name == "Dispossessed", na.rm = TRUE),
    "Blocks" = sum(type.name == "Block", na.rm = TRUE),
    "xG" = sum(shot.statsbomb_xg, na.rm=TRUE),
    "Total_Passes_Length" = sum(pass.length, na.rm=TRUE),
    "Nutmegs" = sum(dribble.nutmeg == "TRUE", na.rm=TRUE),
    "Assists" = sum(pass.shot_assist == "TRUE", na.rm = TRUE),
    "Matches_Played" = n_distinct(match_id),
    "Interceptions_Won" = sum(interception.outcome.name == "Won", na.rm=TRUE),
    "Duel_Won" = sum(duel.outcome.name == "Success In Play", na.rm=TRUE),
    "Duel_Lost" = sum(duel.outcome.name == "Lost In Play", na.rm=TRUE),
    "Complete_Dribbles" = sum(dribble.outcome.name == "Complete", na.rm=TRUE),
    "Incomplete_Dribbles" = sum(dribble.outcome.name == "Incomplete", na.rm=TRUE),
  )



summary_goal_outcome <- events %>%
  group_by(player.name) %>%
  summarise(
    Goals = sum(shot.outcome.name == "Goal", na.rm = TRUE),
    `Shots Saved to Post` = sum(shot.outcome.name == "Saved to Post", na.rm = TRUE),
    Shots_Blocked = sum(shot.outcome.name == "Blocked", na.rm = TRUE),
    `Shots Off T` = sum(shot.outcome.name == "Off T", na.rm = TRUE),
    Shots_Hit_Post = sum(shot.outcome.name == "Post", na.rm = TRUE),
    Shots_Saved = sum(shot.outcome.name == "Saved", na.rm = TRUE),
    `Shots Saved Off Target` = sum(shot.outcome.name == "Saved Off Target", na.rm = TRUE),
    Shots_Wayward = sum(shot.outcome.name == "Wayward", na.rm = TRUE)
  )


joined_player_summary <- inner_join(summary, summary_goal_outcome, by = c("events$player.name" = "player.name"))%>% 
  mutate(across(c("Dispossessed", "Carries"), as.numeric),
         dispossessed_per_carry = Dispossessed / Carries) %>% 
  mutate(interception_per_pressure = Interceptions / Pressures)%>% 
  mutate(fouls_won_per_Carry = Fouls_Won / Carries)%>% 
  mutate(blocks_per_pressure = Blocks / Pressures)%>%
  mutate(goals_per_shot = Goals / Shots) %>%
  mutate(Average_Pass_Length = Total_Passes_Length/Passes) %>%
  mutate(blocks_per_match_played = Blocks/Matches_Played) %>%
  mutate(interceptions_per_match = Interceptions/Matches_Played) %>%
  mutate(interception_won_per_pressure = Interceptions_Won / Pressures)%>% 
  mutate(interceptions_won_per_match = Interceptions_Won/Matches_Played) %>%
  mutate(G_xG_Diff = Goals - xG) %>%
  mutate(Duel_Win_Percentage = Duel_Won/(Duel_Won+Duel_Lost)) %>%
  mutate(Complete_Dribble_Percentage = Complete_Dribbles/(Complete_Dribbles+Incomplete_Dribbles))
           

write.csv(joined_player_summary, "C:\\Users\\jmo5660\\School\\Statsbomb\\joined_player_summary.csv", row.names = FALSE)

Team Stats – mimics player stats

summary4 <- events %>%
  group_by(events$team.name) %>%
  summarise("Carries" = sum(type.name=="Carry", na.rm=TRUE))


summary5 <- events %>%
  group_by(events$team.name) %>%
  summarise("Time of Possession" = sum(duration, na.rm=TRUE))

summary6 <- events %>%
  group_by(events$team.name) %>%
  summarise("Shots" = sum(type.name=="Shot", na.rm=TRUE))

summary_team_Pass <- events %>%
  group_by(events$team.name) %>%
  summarise("Passes" = sum(type.name=="Pass", na.rm=TRUE))

summary_team_Pressure <- events %>%
  group_by(events$team.name) %>%
  summarise("Pressures" = sum(type.name=="Pressure", na.rm=TRUE))

summary_foulwon <- events %>%
  group_by(events$team.name) %>%
  summarise("Fouls Won" = sum(type.name=="Foul Won", na.rm=TRUE))

summary_interception <- events %>%
  group_by(events$team.name) %>%
  summarise("Interceptions" = sum(type.name=="Interception", na.rm=TRUE))

joined_team_summary <- inner_join(summary4, summary5, by = "events$team.name") %>%
  inner_join(summary6, by = "events$team.name") %>%
  inner_join(summary_team_Pass, by = "events$team.name")  %>%
  inner_join(summary_team_Pressure, by = "events$team.name")%>%
  inner_join(summary_foulwon, by = "events$team.name")%>%
  inner_join(summary_interception, by = "events$team.name")

write.csv(joined_team_summary, "C:\\Users\\jmo5660\\School\\Statsbomb\\joined_team_summary.csv", row.names = FALSE)

Let’s look to see what correlations there are in our dataset – first for players

lm_model <- lm(Interceptions~Pressures, data=joined_player_summary)

summary(lm_model)

Call:
lm(formula = Interceptions ~ Pressures, data = joined_player_summary)

Residuals:
    Min      1Q  Median      3Q     Max 
-8.0808 -1.1029 -0.4027  0.6919 10.2713 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept) 0.350357   0.117399   2.984  0.00294 ** 
Pressures   0.068411   0.003535  19.352  < 2e-16 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 2.088 on 679 degrees of freedom
Multiple R-squared:  0.3555,    Adjusted R-squared:  0.3545 
F-statistic: 374.5 on 1 and 679 DF,  p-value: < 2.2e-16
plot(joined_player_summary$Pressures, joined_player_summary$Interceptions, xlab="Pressures", ylab="Interceptions", main="Linear Regression for Pressures and Interceptions")
abline(lm_model, col="red")


# Obtain residuals
residuals <- residuals(lm_model)

# Plot the residuals
plot(predict(lm_model), residuals, pch = 16, xlab = "Fitted values", ylab = "Residuals",
     main = "Residuals Plot: Actual vs. Fitted")


# Plot the standardized residuals
std_residuals <- rstandard(lm_model)
plot(predict(lm_model), std_residuals, pch = 16, xlab = "Fitted values", ylab = "Standardized residuals",
     main = "Residuals Plot: Standardized")


library(car)
Warning: package ‘car’ was built under R version 4.2.3Loading required package: carData
Warning: package ‘carData’ was built under R version 4.2.3
Attaching package: ‘car’

The following object is masked from ‘package:dplyr’:

    recode

The following object is masked from ‘package:purrr’:

    some
# Normal probability plot
qqPlot(residuals, main = "Normal Probability Plot: Residuals")
[1]  77 162

Now for Teams

lm_model2 <- lm(Interceptions~Pressures, data=joined_team_summary)

summary(lm_model2)

Call:
lm(formula = Interceptions ~ Pressures, data = joined_team_summary)

Residuals:
     Min       1Q   Median       3Q      Max 
-13.8709  -5.2709  -0.1027   5.0379  20.8449 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept) 1.069205   3.770672   0.284    0.779    
Pressures   0.080758   0.006799  11.879 7.19e-13 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 7.695 on 30 degrees of freedom
Multiple R-squared:  0.8247,    Adjusted R-squared:  0.8188 
F-statistic: 141.1 on 1 and 30 DF,  p-value: 7.189e-13
plot(joined_team_summary$Pressures, joined_team_summary$Interceptions, xlab="Pressures", ylab="Interceptions", main="Linear Regression for Pressures and Interceptions")
abline(lm_model2, col="red")

Let’s Try to Produce a Carry Map for Christian Pulisic – first let’s filter his data out of the events

library(dplyr)
library(cowplot)
Warning: package ‘cowplot’ was built under R version 4.2.3
Attaching package: ‘cowplot’

The following object is masked from ‘package:lubridate’:

    stamp
# Filter carries for "Christian Pulisic"
pulisic_carries <- events %>%
  filter(player.name == "Christian Pulisic", type.name == "Carry")

# Create a new dataframe with selected columns
pulisic_carry_data <- pulisic_carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)

Now let’s create the Christian Pulisic Carry map – will need to re-scale the ggsoccer coordinates to statsbomb


plot<-ggplot(pulisic_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.15, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Simple Pulisic  World Cup 2022 Carry Map", 
          "Orange Analytics")
ggplot2::annotate("text", x = Inf, y = -Inf, label = "Data: StatsBomb", hjust = 1, vjust = -1, size = 3, color = "gray50")
mapping: x = ~x, y = ~y 
geom_text: na.rm = FALSE
stat_identity: na.rm = FALSE
position_identity 
logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Now let’s do it again for Pulisic Passes Let’s Try to Produce a Carry Map for Christian Pulisic – first let’s filter his data out of the events


# Filter passes for "Christian Pulisic"
pulisic_passes <- events %>%
  filter(player.name == "Christian Pulisic", type.name == "Pass")

# Create a new dataframe with selected columns
pulisic_pass_data <- pulisic_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

Now let’s create the Christian Pulisic Carry map – will need to re-scale the ggsoccer coordinates from optima to statsbomb

plot <- ggplot(pulisic_pass_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Simple Pulisic  World Cup 2022 Passmap", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Let’s try to look at pressures for one of the World Cup Darlings…Amrabat

# Filter passes for "Christian Pulisic"
amrabat_pressures <- events %>%
  filter(player.name == "Sofyan Amrabat", type.name == "Pressure")

# Create a new dataframe with selected columns
amrabat_pressure_data <- amrabat_pressures %>%
  select(player.name, type.name, location.x, location.y)

plot <- ggplot(amrabat_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "maroon",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Amrabat Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Ziyech

# Filter pressures for Ziyech
ziyech_pressures <- events %>%
  filter(player.name == "Hakim Ziyech", type.name == "Pressure")

# Create a new dataframe with selected columns
ziyech_pressure_data <- ziyech_pressures %>%
  select(player.name, type.name, location.x, location.y)

plot<- ggplot(ziyech_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "maroon",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Ziyech Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Let’s take a good look at Azzedine Ounahi

# Filter pressures for Ounahi
Ounahi_pressures <- events %>%
  filter(player.name == "Azzedine Ounahi", type.name == "Pressure")

# Create a new dataframe with selected columns
Ounahi_pressure_data <- Ounahi_pressures %>%
  select(player.name, type.name, location.x, location.y)

plot<- ggplot(Ounahi_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "maroon",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Azzedine Ounahi Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Azzedine Ounahi – passes

# Filter passses for Ounahi
Ounahi_passes <- events %>%
  filter(player.name == "Azzedine Ounahi", type.name == "Pass")

# Create a new dataframe with selected columns
Ounahi_pass_data <- Ounahi_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

plot <- ggplot(Ounahi_pass_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "maroon",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Azzedine Ounahi World Cup 2022 Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Azzedine Ounahi – Carries

# Filter passses for Ounahi
Ounahi_Carries <- events %>%
  filter(player.name == "Azzedine Ounahi", type.name == "Carry")

# Create a new dataframe with selected columns
Ounahi_carry_data <- Ounahi_Carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)

plot <- ggplot(Ounahi_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "maroon",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Azzedine Ounahi World Cup 2022 Carry Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot



# Filter carries for "Weston McKennie"
Mckkenie_carries <- events %>%
  filter(player.name == "Weston McKennie", type.name == "Carry")

# Create a new dataframe with selected columns
Mckennie_carry_data <- Mckkenie_carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)

Now let’s create the Christian Pulisic Carry map – will need to re-scale the ggsoccer coordinates to statsbomb


plot<-ggplot(Mckennie_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.15, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Weston McKennie World Cup 2022 Carry Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Now let’s do it again for Pulisic Passes Let’s Try to Produce a Carry Map for Christian Pulisic – first let’s filter his data out of the events


# Filter passes for "Weston McKennie"
mckkenie_passes <- events %>%
  filter(player.name == "Weston McKennie", type.name == "Pass")

# Create a new dataframe with selected columns
mckkenie_pass_data <- mckkenie_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

Now let’s create the mckkenie passing map – will need to re-scale the ggsoccer coordinates from optima to statsbomb

plot <- ggplot(mckkenie_pass_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Weston Mckennie 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

# Filter passes for "Victor Nelsson"
Nelsson_pressures <- events %>%
  filter(player.name == "Victor Nelsson", type.name == "Pressure")

# Create a new dataframe with selected columns
Nelsson_pressure_data <- Nelsson_pressures %>%
  select(player.name, type.name, location.x, location.y)

plot <- ggplot(Nelsson_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "pink",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Victor Nelson Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

# Filter passes for "Victor Nelsson"
Nelsson_passes<- events %>%
  filter(player.name == "Victor Nelsson", type.name == "Pass")

# Create a new dataframe with selected columns
Nelsson_passing_data <- Nelsson_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

plot <- ggplot(Nelsson_passing_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "pink",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Victor Nelsson 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Let’s Look at Enzo Fernandez

# Filter passes for "Enzo Fernandez"
player_passes<- events %>%
  filter(player.name == "Enzo Fernandez", type.name == "Pass")

# Create a new dataframe with selected columns
player_passing_data <- player_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

plot <- ggplot(player_passing_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "light blue",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Enzo Fernandez 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

# Filter passes for "Enzo Fernandez"
player_carries<- events %>%
  filter(player.name == "Enzo Fernandez", type.name == "Carry")

# Create a new dataframe with selected columns
player_carry_data <- player_carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)

plot <- ggplot(player_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "light blue",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Enzo Fernandez 2022 World Cup Carry Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Let’s Look at Tyler Adams

# Filter passes for "Tyler Adams"
player_passes<- events %>%
  filter(player.name == "Tyler Adams", type.name == "Pass")

# Create a new dataframe with selected columns
player_passing_data <- player_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

plot <- ggplot(player_passing_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "light blue",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Tyler Adams 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

Let’s Look at Enzo Fernandez

# Filter passes for "Tyler Adams"
player_pressures<- events %>%
  filter(player.name == "Tyler Adams", type.name == "Pressure")

# Create a new dataframe with selected columns
player_pressure_data <- player_pressures %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

plot <- ggplot(player_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_point(aes(x = location.x, y = location.y),
               colour = "navy") +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Tyler Adams 2022 World Cup Pressure Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot


---
title: "Statsbomb Data --> Passing Map/Pressure Map visualizations"
output: html_notebook
---

Going to try working with Statsbomb Data. First, will need to install Statsbombr which is their parsing package.

Will need to do this through Github, as this won't be in the packages under tools. Will have to use the devtools library in order to achieve this using the code below:

(For full Statsbomb R Walkthrough, navigate through the browser here: https://statsbomb.com/wp-content/uploads/2021/11/Working-with-R.pdf)

```{r}
library(tidyverse)
library(ggplot2)
library(devtools)

devtools::install_github("statsbomb/SDMTools") 
devtools::install_github("statsbomb/StatsBombR")
devtools::install_github("Torvaney/ggsoccer")
```

Next we need to load the soccer libraries now that we've downloaded them
```{r}
library(StatsBombR)
library(ggsoccer)
```

Filter the competition we want to look at 
```{r}
freecomp <- FreeCompetitions()


competitions <- FreeCompetitions() %>%
  filter(competition_id == 43, season_id==106)

str(competitions)

write.csv(freecomp, "C:\\Users\\jmo5660\\School\\Statsbomb\\competitions.csv", row.names = FALSE)
```
Now we want to look at matches in the season
```{r}
matches <- FreeMatches(Competitions = competitions
)

```
Now let's look at the event data
```{r}
non_clean_events <- free_allevents(MatchesDF = matches, Parallel = T)

# Convert nested lists to character strings
compatible_columns <-sapply(non_clean_events, function(col) is.atomic(col) && !is.list(col))
non_clean_events2 <- non_clean_events[, compatible_columns, drop=FALSE]


write.csv(non_clean_events2, "C:\\Users\\jmo5660\\School\\Statsbomb\\all_unclean_free_event_data_2022_FifaWorldCup.csv", row.names = FALSE)

#clean the events based on the provided statsbomb function
events = allclean(non_clean_events)
compatible_columns <-sapply(events, function(col) is.atomic(col) && !is.list(col))
subset_df <- events[, compatible_columns, drop=FALSE]


# Assuming you have loaded the StatsBomb event data into a dataframe called 'events'

# Convert timestamp column to numeric format
events$timestamp <- as.numeric(events$timestamp)

# Sort the events dataframe by the timestamp column
events <- events[order(events$timestamp), ]

# Create a new column to store the duration of each possession
events$possession_duration <- NA

# Initialize variables
current_possession <- 1
possession_start_time <- events$timestamp[1]

# Iterate through the events dataframe
for (i in 2:length(events$timestamp)) {
  # Check if possession changes (based on possession_team.id)
  if (events$possession_team.id[i] != events$possession_team.id[i-1]) {
    # Calculate the duration of the previous possession
    possession_duration <- events$timestamp[i-1] - possession_start_time
    # Update the duration column for the previous possession
    events$possession_duration[events$possession_team.id == current_possession] <- possession_duration
    
    # Reset variables for the new possession
    current_possession <- events$possession_team.id[i]
    possession_start_time <- events$timestamp[i]
  }
}

# Calculate the duration of the last possession (if exists)
possession_duration <- events$timestamp[length(events$timestamp)] - possession_start_time
events$possession_duration[events$possession_team.id == current_possession] <- possession_duration



write.csv(subset_df, "C:\\Users\\jmo5660\\School\\Statsbomb\\clean_events.csv")
```
Player Summary Statistics (Need to do a join on all summaries)
```{r}
summary <- events %>%
  group_by(events$player.name) %>%
  summarise(
    "Carries" = sum(type.name == "Carry", na.rm = TRUE),
    "Time of Possession" = sum(duration, na.rm = TRUE),
    "Shots" = sum(type.name == "Shot", na.rm = TRUE),
    "Passes" = sum(type.name == "Pass", na.rm = TRUE),
    "Pressures" = sum(type.name == "Pressure", na.rm = TRUE),
    "Fouls_Won" = sum(type.name == "Foul Won", na.rm = TRUE),
    "Interceptions" = sum(type.name == "Interception", na.rm = TRUE),
    "Dispossessed" = sum(type.name == "Dispossessed", na.rm = TRUE),
    "Blocks" = sum(type.name == "Block", na.rm = TRUE),
    "xG" = sum(shot.statsbomb_xg, na.rm=TRUE),
    "Total_Passes_Length" = sum(pass.length, na.rm=TRUE),
    "Nutmegs" = sum(dribble.nutmeg == "TRUE", na.rm=TRUE),
    "Assists" = sum(pass.shot_assist == "TRUE", na.rm = TRUE),
    "Matches_Played" = n_distinct(match_id),
    "Interceptions_Won" = sum(interception.outcome.name == "Won", na.rm=TRUE),
    "Duel_Won" = sum(duel.outcome.name == "Success In Play", na.rm=TRUE),
    "Duel_Lost" = sum(duel.outcome.name == "Lost In Play", na.rm=TRUE),
    "Complete_Dribbles" = sum(dribble.outcome.name == "Complete", na.rm=TRUE),
    "Incomplete_Dribbles" = sum(dribble.outcome.name == "Incomplete", na.rm=TRUE),
  )



summary_goal_outcome <- events %>%
  group_by(player.name) %>%
  summarise(
    Goals = sum(shot.outcome.name == "Goal", na.rm = TRUE),
    `Shots Saved to Post` = sum(shot.outcome.name == "Saved to Post", na.rm = TRUE),
    Shots_Blocked = sum(shot.outcome.name == "Blocked", na.rm = TRUE),
    `Shots Off T` = sum(shot.outcome.name == "Off T", na.rm = TRUE),
    Shots_Hit_Post = sum(shot.outcome.name == "Post", na.rm = TRUE),
    Shots_Saved = sum(shot.outcome.name == "Saved", na.rm = TRUE),
    `Shots Saved Off Target` = sum(shot.outcome.name == "Saved Off Target", na.rm = TRUE),
    Shots_Wayward = sum(shot.outcome.name == "Wayward", na.rm = TRUE)
  )


joined_player_summary <- inner_join(summary, summary_goal_outcome, by = c("events$player.name" = "player.name"))%>% 
  mutate(across(c("Dispossessed", "Carries"), as.numeric),
         dispossessed_per_carry = Dispossessed / Carries) %>% 
  mutate(interception_per_pressure = Interceptions / Pressures)%>% 
  mutate(fouls_won_per_Carry = Fouls_Won / Carries)%>% 
  mutate(blocks_per_pressure = Blocks / Pressures)%>%
  mutate(goals_per_shot = Goals / Shots) %>%
  mutate(Average_Pass_Length = Total_Passes_Length/Passes) %>%
  mutate(blocks_per_match_played = Blocks/Matches_Played) %>%
  mutate(interceptions_per_match = Interceptions/Matches_Played) %>%
  mutate(interception_won_per_pressure = Interceptions_Won / Pressures)%>% 
  mutate(interceptions_won_per_match = Interceptions_Won/Matches_Played) %>%
  mutate(G_xG_Diff = Goals - xG) %>%
  mutate(Duel_Win_Percentage = Duel_Won/(Duel_Won+Duel_Lost)) %>%
  mutate(Complete_Dribble_Percentage = Complete_Dribbles/(Complete_Dribbles+Incomplete_Dribbles))
           

write.csv(joined_player_summary, "C:\\Users\\jmo5660\\School\\Statsbomb\\joined_player_summary.csv", row.names = FALSE)


```

Team Stats -- mimics player stats
```{r}
summary4 <- events %>%
  group_by(events$team.name) %>%
  summarise("Carries" = sum(type.name=="Carry", na.rm=TRUE))


summary5 <- events %>%
  group_by(events$team.name) %>%
  summarise("Time of Possession" = sum(duration, na.rm=TRUE))

summary6 <- events %>%
  group_by(events$team.name) %>%
  summarise("Shots" = sum(type.name=="Shot", na.rm=TRUE))

summary_team_Pass <- events %>%
  group_by(events$team.name) %>%
  summarise("Passes" = sum(type.name=="Pass", na.rm=TRUE))

summary_team_Pressure <- events %>%
  group_by(events$team.name) %>%
  summarise("Pressures" = sum(type.name=="Pressure", na.rm=TRUE))

summary_foulwon <- events %>%
  group_by(events$team.name) %>%
  summarise("Fouls Won" = sum(type.name=="Foul Won", na.rm=TRUE))

summary_interception <- events %>%
  group_by(events$team.name) %>%
  summarise("Interceptions" = sum(type.name=="Interception", na.rm=TRUE))

joined_team_summary <- inner_join(summary4, summary5, by = "events$team.name") %>%
  inner_join(summary6, by = "events$team.name") %>%
  inner_join(summary_team_Pass, by = "events$team.name")  %>%
  inner_join(summary_team_Pressure, by = "events$team.name")%>%
  inner_join(summary_foulwon, by = "events$team.name")%>%
  inner_join(summary_interception, by = "events$team.name")

write.csv(joined_team_summary, "C:\\Users\\jmo5660\\School\\Statsbomb\\joined_team_summary.csv", row.names = FALSE)
```

Let's look to see what correlations there are in our dataset -- first for players
```{r}
lm_model <- lm(Interceptions~Pressures, data=joined_player_summary)

summary(lm_model)

plot(joined_player_summary$Pressures, joined_player_summary$Interceptions, xlab="Pressures", ylab="Interceptions", main="Linear Regression for Pressures and Interceptions")
abline(lm_model, col="red")

# Obtain residuals
residuals <- residuals(lm_model)

# Plot the residuals
plot(predict(lm_model), residuals, pch = 16, xlab = "Fitted values", ylab = "Residuals",
     main = "Residuals Plot: Actual vs. Fitted")

# Plot the standardized residuals
std_residuals <- rstandard(lm_model)
plot(predict(lm_model), std_residuals, pch = 16, xlab = "Fitted values", ylab = "Standardized residuals",
     main = "Residuals Plot: Standardized")

library(car)
# Normal probability plot
qqPlot(residuals, main = "Normal Probability Plot: Residuals")
```
Now for Teams
```{r}
lm_model2 <- lm(Interceptions~Pressures, data=joined_team_summary)

summary(lm_model2)

plot(joined_team_summary$Pressures, joined_team_summary$Interceptions, xlab="Pressures", ylab="Interceptions", main="Linear Regression for Pressures and Interceptions")
abline(lm_model2, col="red")

```

Let's Try to Produce a Carry Map for Christian Pulisic -- first let's filter his data out of the events
```{r}
library(dplyr)
library(cowplot)

# Filter carries for "Christian Pulisic"
pulisic_carries <- events %>%
  filter(player.name == "Christian Pulisic", type.name == "Carry")

# Create a new dataframe with selected columns
pulisic_carry_data <- pulisic_carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)

```

Now let's create the Christian Pulisic Carry map -- will need to re-scale the ggsoccer coordinates to statsbomb
```{r}

plot<-ggplot(pulisic_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.15, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Simple Pulisic  World Cup 2022 Carry Map", 
          "Orange Analytics")
ggplot2::annotate("text", x = Inf, y = -Inf, label = "Data: StatsBomb", hjust = 1, vjust = -1, size = 3, color = "gray50")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```


Now let's do it again for Pulisic Passes
Let's Try to Produce a Carry Map for Christian Pulisic -- first let's filter his data out of the events
```{r}

# Filter passes for "Christian Pulisic"
pulisic_passes <- events %>%
  filter(player.name == "Christian Pulisic", type.name == "Pass")

# Create a new dataframe with selected columns
pulisic_pass_data <- pulisic_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

```

Now let's create the Christian Pulisic Carry map -- will need to re-scale the ggsoccer coordinates from optima to statsbomb
```{r}
plot <- ggplot(pulisic_pass_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Simple Pulisic  World Cup 2022 Passmap", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```

Let's try to look at pressures for one of the World Cup Darlings...Amrabat
```{r}
# Filter passes for "Christian Pulisic"
amrabat_pressures <- events %>%
  filter(player.name == "Sofyan Amrabat", type.name == "Pressure")

# Create a new dataframe with selected columns
amrabat_pressure_data <- amrabat_pressures %>%
  select(player.name, type.name, location.x, location.y)
```

```{r}

plot <- ggplot(amrabat_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "maroon",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Amrabat Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

```

Ziyech
```{r}
# Filter pressures for Ziyech
ziyech_pressures <- events %>%
  filter(player.name == "Hakim Ziyech", type.name == "Pressure")

# Create a new dataframe with selected columns
ziyech_pressure_data <- ziyech_pressures %>%
  select(player.name, type.name, location.x, location.y)
```

```{r}

plot<- ggplot(ziyech_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "maroon",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Ziyech Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

```

Let's take a good look at 
Azzedine Ounahi
```{r}
# Filter pressures for Ounahi
Ounahi_pressures <- events %>%
  filter(player.name == "Azzedine Ounahi", type.name == "Pressure")

# Create a new dataframe with selected columns
Ounahi_pressure_data <- Ounahi_pressures %>%
  select(player.name, type.name, location.x, location.y)
```

```{r}

plot<- ggplot(Ounahi_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "maroon",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Azzedine Ounahi Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

```
Azzedine Ounahi -- passes
```{r}
# Filter passses for Ounahi
Ounahi_passes <- events %>%
  filter(player.name == "Azzedine Ounahi", type.name == "Pass")

# Create a new dataframe with selected columns
Ounahi_pass_data <- Ounahi_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)
```

```{r}

plot <- ggplot(Ounahi_pass_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "maroon",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Azzedine Ounahi World Cup 2022 Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

```


Azzedine Ounahi -- Carries
```{r}
# Filter passses for Ounahi
Ounahi_Carries <- events %>%
  filter(player.name == "Azzedine Ounahi", type.name == "Carry")

# Create a new dataframe with selected columns
Ounahi_carry_data <- Ounahi_Carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)
```

```{r}

plot <- ggplot(Ounahi_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "maroon",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Azzedine Ounahi World Cup 2022 Carry Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

```


```{r}


# Filter carries for "Weston McKennie"
Mckkenie_carries <- events %>%
  filter(player.name == "Weston McKennie", type.name == "Carry")

# Create a new dataframe with selected columns
Mckennie_carry_data <- Mckkenie_carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)

```

Now let's create the Christian Pulisic Carry map -- will need to re-scale the ggsoccer coordinates to statsbomb
```{r}

plot<-ggplot(Mckennie_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.15, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Weston McKennie World Cup 2022 Carry Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```


Now let's do it again for Pulisic Passes
Let's Try to Produce a Carry Map for Christian Pulisic -- first let's filter his data out of the events
```{r}

# Filter passes for "Weston McKennie"
mckkenie_passes <- events %>%
  filter(player.name == "Weston McKennie", type.name == "Pass")

# Create a new dataframe with selected columns
mckkenie_pass_data <- mckkenie_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)

```

Now let's create the mckkenie passing map -- will need to re-scale the ggsoccer coordinates from optima to statsbomb
```{r}
plot <- ggplot(mckkenie_pass_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "navy",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Weston Mckennie 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```
```{r}
# Filter passes for "Victor Nelsson"
Nelsson_pressures <- events %>%
  filter(player.name == "Victor Nelsson", type.name == "Pressure")

# Create a new dataframe with selected columns
Nelsson_pressure_data <- Nelsson_pressures %>%
  select(player.name, type.name, location.x, location.y)
```

```{r}

plot <- ggplot(Nelsson_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb,
                 colour = "white",
                 fill   = "springgreen4",
                 limits = TRUE) +
  geom_point(aes(x = location.x, y = location.y),
             colour = "pink",
             size = 3) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Victor Nelson Pressure Map",
          "Fifa World Cup 2022 Statsbomb Data")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```
```{r}
# Filter passes for "Victor Nelsson"
Nelsson_passes<- events %>%
  filter(player.name == "Victor Nelsson", type.name == "Pass")

# Create a new dataframe with selected columns
Nelsson_passing_data <- Nelsson_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)
```

```{r}

plot <- ggplot(Nelsson_passing_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "pink",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Victor Nelsson 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```
Let's Look at Enzo Fernandez
```{r}
# Filter passes for "Enzo Fernandez"
player_passes<- events %>%
  filter(player.name == "Enzo Fernandez", type.name == "Pass")

# Create a new dataframe with selected columns
player_passing_data <- player_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)
```

```{r}

plot <- ggplot(player_passing_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "light blue",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Enzo Fernandez 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```


```{r}
# Filter passes for "Enzo Fernandez"
player_carries<- events %>%
  filter(player.name == "Enzo Fernandez", type.name == "Carry")

# Create a new dataframe with selected columns
player_carry_data <- player_carries %>%
  select(player.name, type.name, location.x, location.y, carry.end_location.x, carry.end_location.y)
```

```{r}

plot <- ggplot(player_carry_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = carry.end_location.x, yend =  carry.end_location.y),
               colour = "light blue",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Enzo Fernandez 2022 World Cup Carry Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot

```

Let's Look at Tyler Adams
```{r}
# Filter passes for "Tyler Adams"
player_passes<- events %>%
  filter(player.name == "Tyler Adams", type.name == "Pass")

# Create a new dataframe with selected columns
player_passing_data <- player_passes %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)
```

```{r}

plot <- ggplot(player_passing_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_segment(aes(x = location.x, y = location.y, xend = pass.end_location.x, yend =  pass.end_location.y),
               colour = "light blue",
               arrow = arrow(length = unit(0.17, "cm"),
                             type = "closed")) +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Tyler Adams 2022 World Cup Passing Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```


Let's Look at Enzo Fernandez
```{r}
# Filter passes for "Tyler Adams"
player_pressures<- events %>%
  filter(player.name == "Tyler Adams", type.name == "Pressure")

# Create a new dataframe with selected columns
player_pressure_data <- player_pressures %>%
  select(player.name, type.name, location.x, location.y, pass.end_location.x, pass.end_location.y)
```

```{r}

plot <- ggplot(player_pressure_data) +
  annotate_pitch(dimensions = pitch_statsbomb, 
                 colour = "white",
                 fill   = "springgreen4",
                 limits = FALSE) +
  geom_point(aes(x = location.x, y = location.y),
               colour = "navy") +
  theme_pitch() +
  direction_label(colour = "white")+
  theme(panel.background = element_rect(fill = "springgreen4")) +
  ggtitle("Tyler Adams 2022 World Cup Pressure Map", 
          "Orange Analytics")

logo <- ggdraw() +
  draw_image("C:\\Users\\jmo5660\\School\\Statsbomb\\StatsBomb Logo\\1. Colour positive\\SB - Icon Lockup - Colour positive.png",x = 0.01, y = 0.001, width = 0.15, height = 0.15)

# Combine the plot and the logo
final_plot <- plot + annotation_custom(ggplotGrob(logo))

# Display the final plot
final_plot
```