Adista Nursani, Aniket Mawlankar, Purva Desai, Sneha Jadhav
April 29, 2021
Football is one of the most popular sports worldwide, winning a football match has become one of the most crucial aspect of football clubs.In this project, we will be analyzing the matches across different competitions (leagues & cups) through the API.
In recent decades, football clubs have been publicly traded and has become an investment target by many. Predicting match outcomes have become very important to generate a positive return on investment. The data provides the pre-match odds of winning a game (whether the home team or away team will win, or draw). It also provides team names, halfway scores, and fulltime scores. We are also including FIFA’s team ratings in our dataset. Using this data, we are trying to predict the outcome of the matches.
Primary Source of our data: Football API https://www.football-data.org/documentation/quickstart
Secondary Sources:
Analytics Plan:
Analytics plan:
Many teams suggested us to:
The feedbacks provided by some of the teams allowed us to think beyond our limits. Suggestions Incorporated:
This is the data post unnesting the raw data and selecting relevant columns
Since we had limited data available from the API, we included external data from (http://football-data.co.uk/data.php) to enhance our dataset
We dropped the NA values from the combined dataset and came up with 20K+ rows
For our analysis, We are also using performance ratings of the teams from (https://www.fifaindex.com/teams/) in order to enrich data.
Finally we have a dataset of matches containing pre-match odds, half time and full time goals as well as performance ratings of the team