The data analysis for this memo only required filtering, and I ultimately determined it was more efficient to filter in Excel rather than R.

In Excel:

  1. Highlighted all data and inserted a table so I could easily filter information
  2. Hid all unnecessary rows until I was left with Name, Sex, Team, Year, Sport, Event, and Medal
  3. Filtered Team so I only viewed “United States”
  4. Filtered Sport so I only viewed “Gymnastics”
  5. Filtered Sex so I only viewed “F”
  6. Filtered Medal so I only viewed “Gold,” “Silver” and “Bronze”
  7. Filtered Year in ascending order
  1. Removed filter on Medal to find that women first started competing in this sport in 1936
  2. Filtered for only “Gold,” “Silver” and “Bronze” separately
  1. To look at how many events have been competed in total, I removed all filters from Medals and filtered Sex to only “F”
  1. To eliminate repeated names in the data and find out how many female athletes have competed, I clicked “Remove Duplicates” and selected only Name
  1. To look at how many medals, including what kind, were won in each event, I filtered the Event column and selected each event individually