Dataset “athelete_events.csv” has been selected for this detailed analysis on various athletic events. Raw dataset contains 271116 datapoints with 15 variables. Variables included are: ID, Name, Sex, Age, Height, Weight, Team, NOC, Games, Year, Season, City, Sport, Event, Medal.
The follwoing table outlines the total instances for each year of the olympics included. Years range from 1896 through 2016
medals_per_year<- atheletes %>% group_by(Year) %>% count(Year, sort=T, name="Total_Medals")
medals_per_year
## # A tibble: 35 x 2
## # Groups: Year [35]
## Year Total_Medals
## <fct> <int>
## 1 1992 16413
## 2 1988 14676
## 3 2000 13821
## 4 1996 13780
## 5 2016 13688
## 6 2008 13602
## 7 2004 13443
## 8 2012 12920
## 9 1972 11959
## 10 1984 11588
## # ... with 25 more rows
The highest number of medals won in a given year was in 1992 with a total count of 16413 medals. The least medals per year was 380 medals.
All entries that did not result in a medal were removed from dataset reducing from 271116 datapoints to 39783 datapoints. This results in a total of 231333 datapoints with “NA” for Medal variable.
The following table depicts an example of data for Team United States Medal Winners.
## ID Name Sex Age Height Weight Team NOC
## 38 84 Stephen Anthony Abas M 26 165 55 United States USA
## 57 145 Jeremy Abbott M 28 175 70 United States USA
## 58 150 Margaret Ives Abbott (-Dunne) F 23 NA NA United States USA
## 59 153 Monica Cecilia Abbott F 23 191 88 United States USA
## 60 165 Nia Nicole Abdallah F 20 175 56 United States USA
## 69 351 Julius Shareef Abdur-Rahim M 23 202 104 United States USA
## Games Year Season City Sport
## 38 2004 Summer 2004 Summer Athina Wrestling
## 57 2014 Winter 2014 Winter Sochi Figure Skating
## 58 1900 Summer 1900 Summer Paris Golf
## 59 2008 Summer 2008 Summer Beijing Softball
## 60 2004 Summer 2004 Summer Athina Taekwondo
## 69 2000 Summer 2000 Summer Sydney Basketball
## Event Medal
## 38 Wrestling Men's Featherweight, Freestyle Silver
## 57 Figure Skating Mixed Team Bronze
## 58 Golf Women's Individual Gold
## 59 Softball Women's Softball Silver
## 60 Taekwondo Women's Featherweight Silver
## 69 Basketball Men's Basketball Gold
Of the 39783 total medal winners, a total of 5371 have won medals for the United States. This equates to 13.5% of all total medals won.
The following table outlines the total number of medals won by the USA team summarized by each sport.
## # A tibble: 49 x 2
## # Groups: Sport [49]
## Sport `Total Medals`
## <chr> <int>
## 1 Athletics 1071
## 2 Swimming 1066
## 3 Basketball 341
## 4 Rowing 333
## 5 Ice Hockey 276
## 6 Shooting 193
## 7 Gymnastics 177
## 8 Diving 140
## 9 Equestrianism 132
## 10 Water Polo 129
## # ... with 39 more rows
Out of all medals won by USA team the vast majority fall into two Sports - Athletics and Swimming. Out of 5371 medals, 2137 have been won in these two categories, accounting for 39.79 percent of all medals won by USA team.
The following table depicts Olympic medals won by Team USA in each city and year ranked descending based on number of medals.
## # A tibble: 50 x 3
## # Groups: City, Year [50]
## City Year `Medals per City and Year`
## <chr> <fct> <int>
## 1 Los Angeles 1984 352
## 2 Beijing 2008 317
## 3 Rio de Janeiro 2016 264
## 4 Athina 2004 263
## 5 Atlanta 1996 259
## 6 London 2012 248
## 7 Sydney 2000 242
## 8 Barcelona 1992 224
## 9 Seoul 1988 207
## 10 St. Louis 1904 203
## # ... with 40 more rows
The results of evaluating the number of medals won by USA team per City above indicate that the most medals won was in 352 in the city of Los Angeles in the year 1984.
The following graph depicts the total medals won by the USA team in swimming per year.
## [1] "2016-11-12"
The data indicates that team USA over last 35 Olympics had a maximum medal win of 83medals in 1968 with a decline through the late 80’s. A steady resurgence of medals through the 90’s has positioned the USA team with a steady increase to year 2016-11-12 with a total win of 71
The following table depicts the total number of medals won per country for the top 10 countries.
## # A tibble: 10 x 2
## Team medal
## <chr> <int>
## 1 United States 5371
## 2 Soviet Union 2501
## 3 Germany 2083
## 4 Great Britain 1735
## 5 Italy 1577
## 6 France 1576
## 7 Sweden 1440
## 8 Australia 1310
## 9 Canada 1281
## 10 Russia 1158
The top ten medal winning countries have won a total of 20032 medals through the data set timeframe, comprising a total of 50.35 percent of all medals won.
The following chart depicts graphically the top ten country medal winnings and totals.
Graphically illustrated here it is clear to see that Team United States has won the most medals and the total number of medals won by United States is 2.15 times more than that of the next highest winning country, the Soviet Union.
## # A tibble: 3,530 x 4
## # Groups: Team, Year [1,728]
## Team Year Medal medal
## <chr> <fct> <chr> <int>
## 1 Soviet Union 1980 Gold 205
## 2 United States 1984 Gold 190
## 3 Soviet Union 1988 Gold 174
## 4 United States 1996 Gold 159
## 5 Soviet Union 1980 Silver 158
## 6 Soviet Union 1976 Gold 152
## 7 United States 2012 Gold 145
## 8 Soviet Union 1972 Gold 143
## 9 United States 2016 Gold 139
## 10 Soviet Union 1980 Bronze 133
## # ... with 3,520 more rows
The above table depicts the total medals won by country per year in descending order from most medals one. The most medals won in a given year were by the Soviet Union in year 1980 with a total of 205 medals. A total of 755 instances occur where only 1 medal was won by a Team.
## # A tibble: 12 x 3
## # Groups: Team, Medal [12]
## Team Medal Total_Medals
## <chr> <chr> <int>
## 1 United States Gold 2529
## 2 United States Silver 1559
## 3 United States Bronze 1283
## 4 Germany Gold 721
## 5 Germany Bronze 711
## 6 Germany Silver 651
## 7 Great Britain Silver 602
## 8 France Bronze 592
## 9 Great Britain Bronze 590
## 10 Great Britain Gold 543
## 11 France Silver 525
## 12 France Gold 459
The total ratio of gold medals won by the united states over all medals is 0.47 while the total ratio of gold medals won by Germany equates to 0.35.
The total ratio of gold medals won by Great Britain over all medals is 0.31 while the total ratio of gold medals won by France equates to 0.29. Based off of the ratio of gold medals to total medals across the top four countries, gold medal percentage of total medals decreasingly trends as total medal winnings decrease.
The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from maximum number of medals to total of all medals greater than 250. As total number of medals decrease through the top medal placement, the number of medals, Gold, Bronze, Silver tend to consolidate below the top 5 Teams.
The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from total of all medals of 250 to 15 medals over the duration of data coverage. Data shows that in the intermediate range of medals the variation of type of medal is independent of Team or Total.
The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from total of all medals of 15 to 3 medals over the duration of data coverage. This view clearly indicates that as Teams reduce in number of medals the distribution between type of medal varies. View of data begins to indicate a stepped alignment on total number.
The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from total of all medals of less than or equal to 5 medals over the duration of data coverage. The graphic clearly visualizes a stepped record from one to two, three, etc. through 5. This indicates a number of countries that have won a single medal, independent of medal type or other.
ggplot(data=medal_type_per_country_UFGG, aes(x=reorder(Team, -Total_Medals), y=Total_Medals)) +geom_col() +labs(x = "Country", y = "Medals", title = "Medal Type Per Total Winning for Four Countries") + theme(axis.text.x = element_text(angle = 0, vjust = -3, hjust=1))+geom_col(color="blue", fill="blue")
The Team United States is the Country which produced the highest number of medals throughout the perior of Olyimic data. The next highest signifigance falls with Team Germany.
?element_text
medals_by_team
View(medals_by_team)
```