The data analysis for this memo only required filtering, and I ultimately determined it was more efficient to filter in Excel rather than R.
In Excel:
- Highlighted all data and inserted a table so I could easily filter information
- Hid all unnecessary rows until I was left with Name, Sex, Team, Year, Sport, Event, and Medal
- Filtered Team so I only viewed “United States”
- Filtered Sport so I only viewed “Gymnastics”
- Filtered Sex so I only viewed “F”
- Filtered Medal so I only viewed “Gold,” “Silver” and “Bronze”
- Filtered Year in ascending order
- Discovered that there have been 94 medals won by women from the U.S. since they first won a medal in 1948.
- Removed filter on Medal to find that women first started competing in this sport in 1936
- Filtered for only “Gold,” “Silver” and “Bronze” separately
Found 28 golds, 35 silvers and 31 bronzes
When looking at Gold, I found that Mary Lou Retton was the first gymnast from the U.S. to win a gold medal in 1984. She won for the individual all-around event
Made note of the years each medal was won and discovered that the most medals won in the events was in 2016, with four golds, four silvers and one bronze
- To look at how many events have been competed in total, I removed all filters from Medals and filtered Sex to only “F”
- Found that there have been 572 total events competed by female gymnasts
- To eliminate repeated names in the data and find out how many female athletes have competed, I clicked “Remove Duplicates” and selected only Name
- There have been 94 female athletes — coincidentally the same number of medals won
- To look at how many medals, including what kind, were won in each event, I filtered the Event column and selected each event individually
- For vault: Found one gold and three silvers won. For uneven bars: Found one gold, four silvers and three bronzes won. For balance beam: Found two golds, four silvers and four bronzes won. For floor exercise: Found two golds, three silvers and four bronzes won