Student’s Name : EI THIRI LWIN
Student’s Number : S3866360
The purpose of this evaluation is to present a compelling story using data visualisation based on open data.
The dataset “International football results from 1872 to 2022” is used from https://www.kaggle.com/datasets/martj42/international-football-results-from-1872-to-2017
This dataset is offered for academic use by Mart Jürisoo, Data analyst, Tallinn, Harju County, Estonia.
The dataset comprises 44,060 international football match results from 1972 to 2022, and the “results.csv” file contains the following columns:
date - date of the match
home_team - the name of the home
away_team - the name of the away team
home_score - full-time home team score including extra time, not including penalty-shootouts
away_score - full-time away team score including extra time, not including penalty-shootouts
tournament - the name of the tournament
city - the name of the city/town/administrative unit where the match was played
country - the name of the country where the match was played
neutral - TRUE/FALSE column indicating whether the match was played at a neutral venue
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
##
## Attaching package: 'lubridate'
## The following objects are masked from 'package:base':
##
## date, intersect, setdiff, union
## Rows: 44060 Columns: 9
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: ","
## chr (6): date, home_team, away_team, tournament, city, country
## dbl (2): home_score, away_score
## lgl (1): neutral
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## date home_team away_team home_score away_score tournament city
## 0 0 0 0 0 0 0
## country neutral
## 0 0
## # A tibble: 6 × 15
## date home_…¹ away_…² home_…³ away_…⁴ tourn…⁵ city country neutral i_date
## <chr> <chr> <chr> <dbl> <dbl> <chr> <chr> <chr> <lgl> <date>
## 1 9/27… Albania Iceland 1 1 UEFA N… Tira… Albania FALSE 2022-09-27
## 2 9/27… Norway Serbia 0 2 UEFA N… Oslo Norway FALSE 2022-09-27
## 3 9/27… Sweden Sloven… 1 1 UEFA N… Stoc… Sweden FALSE 2022-09-27
## 4 9/27… Kosovo Cyprus 5 1 UEFA N… Pris… Kosovo FALSE 2022-09-27
## 5 9/27… Greece Northe… 3 1 UEFA N… Athe… Greece FALSE 2022-09-27
## 6 9/30… Fiji Solomo… 0 0 MSG Pr… Luga… Vanuatu TRUE 2022-09-30
## # … with 5 more variables: months <dbl>, years <dbl>, outcome <chr>,
## # winnerteam <lgl>, looserteam <lgl>, and abbreviated variable names
## # ¹home_team, ²away_team, ³home_score, ⁴away_score, ⁵tournament
## # ℹ Use `colnames()` to see all variable names
## `summarise()` has grouped output by 'tournament'. You can override using the
## `.groups` argument.
## `summarise()` has grouped output by 'years'. You can override using the
## `.groups` argument.