0.1 Introduction

The ancient olympic game was held at Olympia, Greece, from 776 BC through 393 AD. It returned after 1503 years. The first modern olympic was held in Athenes, Greece in 1896. The ‘modern Olympics’ comprises all the Games from Athens 1896 to Rio 2016. Baron Pierre de Coubertin presented the idea in 1894.

There are two long periods without any Games between 1912-1920 and 1936-1948, corresponding to WWI and WWII.

Perhaps, the most significant benefit of visual analytics is to ease the understanding of complex data, while representing it in correct, concise, and appropriate way. This manuscript proposes a handy visualization analysis of Olympic games between 1896 to 2016, which comprises in four levels of design system. These four levels are;

This application utilizes concrete analysis examples and claim to provide efficient, effective, functional, and convenient model for users. Discussion, conclusion, future work are summarized along with requesting recommendation for improvement of the limitations, where it was surveyed in evaluation as another section.

The primary goal of the dashboard is to explain how the user may benefit from the developed system experiencing visual representations. User interfaces are pretty significant, simple and clear to use, while informing the individuals about Olympics. The evaluation should at least consider whether the product meets the specific requirements, efficient and effectiveness.

Goal

Data Characteristics

0.2 Domain Problem Charactarization

0.3 Data/operation abstraction design

1. Athelete Events data

## Observations: 271,116
## Variables: 15
## $ ID     <chr> "1", "2", "3", "4", "5", "5", "5", "5", "5", "5", "6", "6…
## $ Name   <chr> "A Dijiang", "A Lamusi", "Gunnar Nielsen Aaby", "Edgar Li…
## $ Sex    <fct> M, M, M, M, F, F, F, F, F, F, M, M, M, M, M, M, M, M, M, …
## $ Age    <int> 24, 23, 24, 34, 21, 21, 25, 25, 27, 27, 31, 31, 31, 31, 3…
## $ Height <dbl> 180, 170, NA, NA, 185, 185, 185, 185, 185, 185, 188, 188,…
## $ Weight <dbl> 80, 60, NA, NA, 82, 82, 82, 82, 82, 82, 75, 75, 75, 75, 7…
## $ Team   <chr> "China", "China", "Denmark", "Denmark/Sweden", "Netherlan…
## $ NOC    <chr> "CHN", "CHN", "DEN", "DEN", "NED", "NED", "NED", "NED", "…
## $ Games  <chr> "1992 Summer", "2012 Summer", "1920 Summer", "1900 Summer…
## $ Year   <int> 1992, 2012, 1920, 1900, 1988, 1988, 1992, 1992, 1994, 199…
## $ Season <fct> Summer, Summer, Summer, Summer, Winter, Winter, Winter, W…
## $ City   <chr> "Barcelona", "London", "Antwerpen", "Paris", "Calgary", "…
## $ Sport  <chr> "Basketball", "Judo", "Football", "Tug-Of-War", "Speed Sk…
## $ Event  <chr> "Basketball Men's Basketball", "Judo Men's Extra-Lightwei…
## $ Medal  <fct> NA, NA, NA, Gold, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA,…
## # A tibble: 6 x 15
##   ID    Name  Sex     Age Height Weight Team  NOC   Games  Year Season
##   <chr> <chr> <fct> <int>  <dbl>  <dbl> <chr> <chr> <chr> <int> <fct> 
## 1 1     A Di… M        24    180     80 China CHN   1992…  1992 Summer
## 2 2     A La… M        23    170     60 China CHN   2012…  2012 Summer
## 3 3     Gunn… M        24     NA     NA Denm… DEN   1920…  1920 Summer
## 4 4     Edga… M        34     NA     NA Denm… DEN   1900…  1900 Summer
## 5 5     Chri… F        21    185     82 Neth… NED   1988…  1988 Winter
## 6 5     Chri… F        21    185     82 Neth… NED   1988…  1988 Winter
## # … with 4 more variables: City <chr>, Sport <chr>, Event <chr>,
## #   Medal <fct>

2. NOC Regions data

## Observations: 230
## Variables: 3
## $ NOC    <chr> "AFG", "AHO", "ALB", "ALG", "AND", "ANG", "ANT", "ANZ", "…
## $ region <chr> "Afghanistan", "Curacao", "Albania", "Algeria", "Andorra"…
## $ notes  <chr> NA, "Netherlands Antilles", NA, NA, NA, NA, "Antigua and …
## # A tibble: 6 x 3
##   NOC   region      notes               
##   <chr> <chr>       <chr>               
## 1 AFG   Afghanistan <NA>                
## 2 AHO   Curacao     Netherlands Antilles
## 3 ALB   Albania     <NA>                
## 4 ALG   Algeria     <NA>                
## 5 AND   Andorra     <NA>                
## 6 ANG   Angola      <NA>

0.4 Encoding/Interaction design

1. Has the number of athletes, nations, and events changed over time?

# Load athletes_events data 
data <- read_csv("Data/athlete_events.csv",
                 col_types = cols(
                   ID = col_character(),
                   Name = col_character(),
                   Sex = col_factor(levels = c("M","F")),
                   Age =  col_integer(),
                   Height = col_double(),
                   Weight = col_double(),
                   Team = col_character(),
                   NOC = col_character(),
                   Games = col_character(),
                   Year = col_integer(),
                   Season = col_factor(levels = c("Summer","Winter")),
                   City = col_character(),
                   Sport = col_character(),
                   Event = col_character(),
                   Medal = col_factor(levels = c("Gold","Silver","Bronze"))
                 )
)

# count number of athletes, nations, & events, excluding the Art Competitions
counts <- data %>%
  group_by(Year, Season) %>%
  summarize(
    Athletes = length(unique(ID)),
    Nations = length(unique(NOC)),
    Events = length(unique(Event))
  )
counts <- counts %>%
  mutate(gap= if(Year<1920) 1 else if(Year>=1920 & Year<=1936) 2 else 3)
p1 <- ggplot(counts, aes(x=Year, y=Athletes, group=interaction(Season,gap), color=Season)) +
  geom_point(size=2) +
  geom_line() +
  scale_color_manual(values=c("darkorange","darkblue")) +
  xlab("") +  
  
  annotate("text",x=c(1916,1942),y=c(10000,10000),
           label=c("WWI","WWII"), size=4, color="red") +
  geom_segment(mapping=aes(x=1914,y=8000,xend=1918,yend=8000),color="red", size=2) +
  geom_segment(mapping=aes(x=1939,y=8000,xend=1945,yend=8000),color="red", size=2)
p2 <- ggplot(counts, aes(x=Year, y=Nations, group=interaction(Season,gap), color=Season)) +
  geom_point(size=2) +
  geom_line() +
  scale_color_manual(values=c("darkorange","darkblue")) +
  xlab("") 
p3 <- ggplot(counts, aes(x=Year, y=Events, group=interaction(Season,gap), color=Season)) +
  geom_point(size=2) +
  geom_line() +
  scale_color_manual(values=c("darkorange","darkblue"))
grid.arrange(p1, p2, p3, ncol=1)

2. Which countries won the most medals (TOP 10)?

3.Which countries won the most medals- Map view

4.Number of female and male over time

Primary and Secondry Question Findings

0.5 Algorithmic design

https://rstudio.github.io/shinyloadtest/articles/limitations-of-shinyloadtest.html

0.6 User evaluation

0.7 Future work

0.8 Appendix

Limitations

Links

https://anjali-bapat.shinyapps.io/Final_Project/

https://docs.google.com/forms/d/e/1FAIpQLSfTCvwJWXp9CgdG6mQ7kgn06chfwXzZqHUJaobJdWCZMR1keQ/viewform

https://elifdemirblog.netlify.com/post/120-years-of-olympics/

https://github.com/anjbapat/Olympics.github.io/

http://rpubs.com/kabitapaul11/olympic_presentation

0.9 References