Summary of Data

Dataset “athelete_events.csv” has been selected for this detailed analysis on various athletic events. Raw dataset contains 271116 datapoints with 15 variables. Variables included are: ID, Name, Sex, Age, Height, Weight, Team, NOC, Games, Year, Season, City, Sport, Event, Medal.

Data Preparation

The follwoing table outlines the total instances for each year of the olympics included. Years range from 1896 through 2016

medals_per_year<- atheletes %>% group_by(Year) %>% count(Year, sort=T, name="Total_Medals")
medals_per_year
## # A tibble: 35 x 2
## # Groups:   Year [35]
##    Year  Total_Medals
##    <fct>        <int>
##  1 1992         16413
##  2 1988         14676
##  3 2000         13821
##  4 1996         13780
##  5 2016         13688
##  6 2008         13602
##  7 2004         13443
##  8 2012         12920
##  9 1972         11959
## 10 1984         11588
## # ... with 25 more rows

The highest number of medals won in a given year was in 1992 with a total count of 16413 medals. The least medals per year was 380 medals.

Data Evaluation

Entries resulting in a Medal

All entries that did not result in a medal were removed from dataset reducing from 271116 datapoints to 39783 datapoints. This results in a total of 231333 datapoints with “NA” for Medal variable.

Medals Won by Team USA

The following table depicts an example of data for Team United States Medal Winners.

##     ID                          Name Sex Age Height Weight          Team NOC
## 38  84          Stephen Anthony Abas   M  26    165     55 United States USA
## 57 145                 Jeremy Abbott   M  28    175     70 United States USA
## 58 150 Margaret Ives Abbott (-Dunne)   F  23     NA     NA United States USA
## 59 153         Monica Cecilia Abbott   F  23    191     88 United States USA
## 60 165           Nia Nicole Abdallah   F  20    175     56 United States USA
## 69 351    Julius Shareef Abdur-Rahim   M  23    202    104 United States USA
##          Games Year Season    City          Sport
## 38 2004 Summer 2004 Summer  Athina      Wrestling
## 57 2014 Winter 2014 Winter   Sochi Figure Skating
## 58 1900 Summer 1900 Summer   Paris           Golf
## 59 2008 Summer 2008 Summer Beijing       Softball
## 60 2004 Summer 2004 Summer  Athina      Taekwondo
## 69 2000 Summer 2000 Summer  Sydney     Basketball
##                                       Event  Medal
## 38 Wrestling Men's Featherweight, Freestyle Silver
## 57                Figure Skating Mixed Team Bronze
## 58                  Golf Women's Individual   Gold
## 59                Softball Women's Softball Silver
## 60          Taekwondo Women's Featherweight Silver
## 69              Basketball Men's Basketball   Gold

Of the 39783 total medal winners, a total of 5371 have won medals for the United States. This equates to 13.5% of all total medals won.

Total Number of Medals Won by Team USA in each Sport

The following table outlines the total number of medals won by the USA team summarized by each sport.

## # A tibble: 49 x 2
## # Groups:   Sport [49]
##    Sport         `Total Medals`
##    <chr>                  <int>
##  1 Athletics               1071
##  2 Swimming                1066
##  3 Basketball               341
##  4 Rowing                   333
##  5 Ice Hockey               276
##  6 Shooting                 193
##  7 Gymnastics               177
##  8 Diving                   140
##  9 Equestrianism            132
## 10 Water Polo               129
## # ... with 39 more rows

Out of all medals won by USA team the vast majority fall into two Sports - Athletics and Swimming. Out of 5371 medals, 2137 have been won in these two categories, accounting for 39.79 percent of all medals won by USA team.

Total Number of Medals Won by Team USA per City and Year

The following table depicts Olympic medals won by Team USA in each city and year ranked descending based on number of medals.

## # A tibble: 50 x 3
## # Groups:   City, Year [50]
##    City           Year  `Medals per City and Year`
##    <chr>          <fct>                      <int>
##  1 Los Angeles    1984                         352
##  2 Beijing        2008                         317
##  3 Rio de Janeiro 2016                         264
##  4 Athina         2004                         263
##  5 Atlanta        1996                         259
##  6 London         2012                         248
##  7 Sydney         2000                         242
##  8 Barcelona      1992                         224
##  9 Seoul          1988                         207
## 10 St. Louis      1904                         203
## # ... with 40 more rows

The results of evaluating the number of medals won by USA team per City above indicate that the most medals won was in 352 in the city of Los Angeles in the year 1984.

Total Number of Medals Won by Team USA in Swimming per Year

The following graph depicts the total medals won by the USA team in swimming per year.

## [1] "2016-11-12"

The data indicates that team USA over last 35 Olympics had a maximum medal win of 83medals in 1968 with a decline through the late 80’s. A steady resurgence of medals through the 90’s has positioned the USA team with a steady increase to year 2016-11-12 with a total win of 71

Total Number of Medals Won by Country

The following table depicts the total number of medals won per country for the top 10 countries.

## # A tibble: 10 x 2
##    Team          medal
##    <chr>         <int>
##  1 United States  5371
##  2 Soviet Union   2501
##  3 Germany        2083
##  4 Great Britain  1735
##  5 Italy          1577
##  6 France         1576
##  7 Sweden         1440
##  8 Australia      1310
##  9 Canada         1281
## 10 Russia         1158

The top ten medal winning countries have won a total of 20032 medals through the data set timeframe, comprising a total of 50.35 percent of all medals won.

The following chart depicts graphically the top ten country medal winnings and totals.

Graphically illustrated here it is clear to see that Team United States has won the most medals and the total number of medals won by United States is 2.15 times more than that of the next highest winning country, the Soviet Union.

Total Number of Medals Won by Country per Year

## # A tibble: 3,530 x 4
## # Groups:   Team, Year [1,728]
##    Team          Year  Medal  medal
##    <chr>         <fct> <chr>  <int>
##  1 Soviet Union  1980  Gold     205
##  2 United States 1984  Gold     190
##  3 Soviet Union  1988  Gold     174
##  4 United States 1996  Gold     159
##  5 Soviet Union  1980  Silver   158
##  6 Soviet Union  1976  Gold     152
##  7 United States 2012  Gold     145
##  8 Soviet Union  1972  Gold     143
##  9 United States 2016  Gold     139
## 10 Soviet Union  1980  Bronze   133
## # ... with 3,520 more rows

The above table depicts the total medals won by country per year in descending order from most medals one. The most medals won in a given year were by the Soviet Union in year 1980 with a total of 205 medals. A total of 755 instances occur where only 1 medal was won by a Team.

Medal Count by Medal Type

## # A tibble: 12 x 3
## # Groups:   Team, Medal [12]
##    Team          Medal  Total_Medals
##    <chr>         <chr>         <int>
##  1 United States Gold           2529
##  2 United States Silver         1559
##  3 United States Bronze         1283
##  4 Germany       Gold            721
##  5 Germany       Bronze          711
##  6 Germany       Silver          651
##  7 Great Britain Silver          602
##  8 France        Bronze          592
##  9 Great Britain Bronze          590
## 10 Great Britain Gold            543
## 11 France        Silver          525
## 12 France        Gold            459

The total ratio of gold medals won by the united states over all medals is 0.47 while the total ratio of gold medals won by Germany equates to 0.35.

The total ratio of gold medals won by Great Britain over all medals is 0.31 while the total ratio of gold medals won by France equates to 0.29. Based off of the ratio of gold medals to total medals across the top four countries, gold medal percentage of total medals decreasingly trends as total medal winnings decrease.

Medal Count by Type of Medal and Team

The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from maximum number of medals to total of all medals greater than 250. As total number of medals decrease through the top medal placement, the number of medals, Gold, Bronze, Silver tend to consolidate below the top 5 Teams.

The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from total of all medals of 250 to 15 medals over the duration of data coverage. Data shows that in the intermediate range of medals the variation of type of medal is independent of Team or Total.

The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from total of all medals of 15 to 3 medals over the duration of data coverage. This view clearly indicates that as Teams reduce in number of medals the distribution between type of medal varies. View of data begins to indicate a stepped alignment on total number.

The above graphic depicts the distribution of medals based on type of medal (Gold, Silver, Bronze) for Teams in descending order from total of all medals of less than or equal to 5 medals over the duration of data coverage. The graphic clearly visualizes a stepped record from one to two, three, etc. through 5. This indicates a number of countries that have won a single medal, independent of medal type or other.

ggplot(data=medal_type_per_country_UFGG, aes(x=reorder(Team, -Total_Medals), y=Total_Medals)) +geom_col() +labs(x = "Country", y = "Medals", title = "Medal Type Per Total Winning for Four Countries") + theme(axis.text.x = element_text(angle = 0, vjust = -3, hjust=1))+geom_col(color="blue", fill="blue") 

The Team United States is the Country which produced the highest number of medals throughout the perior of Olyimic data. The next highest signifigance falls with Team Germany.

?element_text

medals_by_team

View(medals_by_team)

```