Setup
Loading Packages
packages = c('igraph', 'tidygraph', 'ggraph', 'visNetwork', 'lubridate', 'tidyverse', 'RColorBrewer')
for(p in packages){library
if(!require(p, character.only = T)){
install.packages(p)
}
library(p, character.only = T)
}
Importing Data
GAStech_nodes <- read_csv("data/GAStech_email_node.csv")
GAStech_edges <- read_csv("data/GAStech_email_edge-v2.csv")
Manupilating Data
GAStech_edges$SentDate = dmy(GAStech_edges$SentDate)
GAStech_edges$Weekday = wday(GAStech_edges$SentDate, label = TRUE, abbr = FALSE)
GAStech_edges_aggregated <- GAStech_edges %>%
filter(MainSubject == "Work related") %>%
group_by(source, target, Weekday) %>%
summarise(Weight = n()) %>%
filter(source!=target) %>%
filter(Weight > 1) %>%
ungroup()
Creation of Graph
GAStech_graph <- tbl_graph(nodes = GAStech_nodes, edges = GAStech_edges_aggregated, directed = TRUE)
Original graph code
g <- GAStech_graph %>%
mutate(betweenness_centrality = centrality_betweenness()) %>%
mutate(closeness_centrality = centrality_closeness()) %>%
ggraph(layout = "nicely") +
geom_edge_link(aes()) +
geom_node_point(aes(colour = closeness_centrality, size=betweenness_centrality))
g + theme_graph()

Task 1
Improve the code chunk used to create the organisation network graph by using the latest functions provided in ggraph2.0
set_graph_style(family = "sans")
# new function that replaces theme_graph by fixing the graph style; I used it to fix the font error in windows
ggraph(GAStech_graph, layout = "nicely") +
geom_edge_link() +
geom_node_point(aes(size=centrality_betweenness(),colour=centrality_closeness()))

# new function that allows for self referencing
Identify three aspects of the graph visualisation in Section 6.1 that can be improved.
The original graph does not have any indication of what each node represents, rendering the visualization semi useless due to a lack of information. I propose labeling all nodes with the names of the employees.
The black edges make the graph look extremely dense and cluterred, as well as making the visualization unclear due to black being used in our colour scale for centrality_closeness(). I propose changing the colour to something more neutral, like grey.
For an organizational graph, you would most likely want to see the interactions within/outside each department. I propose displaying the type of orgnisation with the shape of the node.
Provide the sketch of your alternative design.
Using appropriate ggraph functions, plot the alternative design.
g <- ggraph(GAStech_graph, layout = "stress") +
geom_edge_link(colour = 'grey') +
geom_node_point( aes(size=centrality_betweenness(),colour=centrality_closeness(), shape = Department)) +
geom_node_text(aes(label=label), repel=TRUE, size = 2)
g

Task 2
Setup
GAStech_edges_aggregated <- GAStech_edges %>%
left_join(GAStech_nodes, by = c("sourceLabel" = "label")) %>%
rename(from = id) %>%
left_join(GAStech_nodes, by = c("targetLabel" = "label")) %>%
rename(to = id) %>%
filter(MainSubject == "Work related") %>%
group_by(from, to) %>%
summarise(weight = n()) %>%
filter(from!=to) %>%
filter(weight > 1) %>%
ungroup()
GAStech_nodes <- GAStech_nodes %>%
rename(group = Department)
visNetwork(GAStech_nodes, GAStech_edges_aggregated) %>%
visIgraphLayout(layout = "layout_with_fr") %>%
visOptions(highlightNearest = TRUE, nodesIdSelection = TRUE)
Improve the design of the graph by incorporating the following interactivity:
*When a name is selected from the drop-down list, the corresponding node will not only be highlighted but also will be labelled. Furthermore, all the linked nodes of the selected node will also be labelled too.
*When a node of the interactive graph is selected, the node will not only be highlighted but also will be labelled. Furthermore, all the linked nodes of the selected node will be labelled as well.
visNetwork(GAStech_nodes, GAStech_edges_aggregated) %>%
visIgraphLayout(layout = "layout_with_fr") %>%
visOptions(highlightNearest = list(enabled= TRUE, algorithm = 'all', labelOnly = FALSE), nodesIdSelection = TRUE)
Identify three aspects of the graph visualisation in Section 7.4 that can be improved.
Although the visualization is colour coded by department, a fresh user who has not seen the code wouldn’t understand the colour coding. I propose a legend to identify the departments
When a node has alot of edges, sometimes the label gets too cluttered. I propose placing the labels inside the nodes, viewing the documentation mentions “The types with the label inside of it are: ellipse, circle, datab”
Currently, although we have a column representing weights, it is not plotted out on the graph. I propose actually using the weights in the visualization, represented by the thickness of each individual edge.
Provide the sketch of your alternative design.
Using appropriate visNetwork functions, plot the alternative design.
making a new column called “value”, vizNetwork takes this in as the weights. Reference: https://stackoverflow.com/questions/46664645/r-visnetwork-igraph-weighted-network-visualization-with-visedges
GAStech_edges_aggregated_2 = GAStech_edges_aggregated %>% mutate(value = weight)
visNetwork(GAStech_nodes, GAStech_edges_aggregated_2, width = "100%") %>%
visNodes(shape = 'datab') %>%
visEdges() %>%
visIgraphLayout(layout = "layout_with_fr") %>%
visOptions(highlightNearest = list(enabled= TRUE, algorithm = 'all', labelOnly = FALSE), nodesIdSelection = TRUE) %>%
visLegend()
---
title: "DataViz Makeover 2, Linus Cheng Xin Wei"
output: html_notebook
---

```{r global_options, include=FALSE}
knitr::opts_chunk$set(message = FALSE, warning = FALSE)
```

# Setup

### Loading Packages
```{r}
packages = c('igraph', 'tidygraph', 'ggraph', 'visNetwork', 'lubridate', 'tidyverse', 'RColorBrewer')

for(p in packages){library
  if(!require(p, character.only = T)){
    install.packages(p)
  }
  library(p, character.only = T)
}
```
### Importing Data
```{r}
GAStech_nodes <- read_csv("data/GAStech_email_node.csv")
GAStech_edges <- read_csv("data/GAStech_email_edge-v2.csv")
```
### Manupilating Data
```{r}
GAStech_edges$SentDate  = dmy(GAStech_edges$SentDate)
GAStech_edges$Weekday = wday(GAStech_edges$SentDate, label = TRUE, abbr = FALSE)

GAStech_edges_aggregated <- GAStech_edges %>%
  filter(MainSubject == "Work related") %>%
  group_by(source, target, Weekday) %>%
    summarise(Weight = n()) %>%
  filter(source!=target) %>%
  filter(Weight > 1) %>%
  ungroup()
```
### Creation of Graph
```{r}
GAStech_graph <- tbl_graph(nodes = GAStech_nodes, edges = GAStech_edges_aggregated, directed = TRUE)
```
### Original graph code
```{r}
g <- GAStech_graph %>%
  mutate(betweenness_centrality = centrality_betweenness()) %>%
  mutate(closeness_centrality = centrality_closeness()) %>%
  ggraph(layout = "nicely") + 
  geom_edge_link(aes()) +
  geom_node_point(aes(colour = closeness_centrality, size=betweenness_centrality))

g + theme_graph()  
```
# Task 1

### Improve the code chunk used to create the organisation network graph by using the latest functions provided in ggraph2.0
```{r echo = TRUE}

# new function that replaces theme_graph by fixing the graph style; I used it to fix the font error in windows
set_graph_style(family = "sans") 

# new object modification that allows for self referencing
ggraph(GAStech_graph, layout = "nicely") + 
  geom_edge_link() +
  geom_node_point(aes(size=centrality_betweenness(),colour=centrality_closeness())) 

```
### Identify three aspects of the graph visualisation in Section 6.1 that can be improved.

1. The original graph does not have any indication of what each node represents, rendering the visualization semi useless due to a lack of information. I propose labeling all nodes with the names of the employees.  

2. The black edges make the graph look extremely dense and cluterred, as well as making the visualization unclear due to black being used in our colour scale for centrality_closeness(). I propose changing the colour to something more neutral, like grey.

3. For an organizational graph, you would most likely want to see the interactions within/outside each department. I propose displaying the type of orgnisation with the shape of the node.

### Provide the sketch of your alternative design.

![Sketch 1](sketch_1.jpg)


### Using appropriate ggraph functions, plot the alternative design.
```{r}
g <- ggraph(GAStech_graph, layout = "stress") + 
  geom_edge_link(colour = 'grey') +
  geom_node_point( aes(size=centrality_betweenness(),colour=centrality_closeness(), shape = Department)) +
   geom_node_text(aes(label=label),  repel=TRUE, size = 2)
g
```

# Task 2

### Setup
```{r}
GAStech_edges_aggregated <- GAStech_edges %>%
  left_join(GAStech_nodes, by = c("sourceLabel" = "label")) %>%
  rename(from = id) %>%
  left_join(GAStech_nodes, by = c("targetLabel" = "label")) %>%
  rename(to = id) %>%
  filter(MainSubject == "Work related") %>%
  group_by(from, to) %>%
    summarise(weight = n()) %>%
  filter(from!=to) %>%
  filter(weight > 1) %>%
  ungroup()
GAStech_nodes <- GAStech_nodes %>%
  rename(group = Department)
```

```{r}
visNetwork(GAStech_nodes, GAStech_edges_aggregated) %>%
  visIgraphLayout(layout = "layout_with_fr") %>%
  visOptions(highlightNearest = TRUE, nodesIdSelection = TRUE)
```

### Improve the design of the graph by incorporating the following interactivity:

*When a name is selected from the drop-down list, the corresponding node will not only be highlighted but also will be labelled. Furthermore, all the linked nodes of the selected node will also be labelled too.

*When a node of the interactive graph is selected, the node will not only be highlighted but also will be labelled. Furthermore, all the linked nodes of the selected node will be labelled as well.

```{r}
visNetwork(GAStech_nodes, GAStech_edges_aggregated) %>%
  visIgraphLayout(layout = "layout_with_fr") %>%
  visOptions(highlightNearest = list(enabled= TRUE, algorithm = 'all', labelOnly = FALSE), nodesIdSelection = TRUE)
```

### Identify three aspects of the graph visualisation in Section 7.4 that can be improved.

1. Although the visualization is colour coded by department, a fresh user who has not seen the code wouldn't understand the colour coding. I propose a legend to identify the departments

2. When a node has alot of edges, sometimes the label gets too cluttered. I propose placing the labels inside the nodes, viewing the documentation mentions "The types with the label inside of it are: ellipse, circle, datab"

3. Currently, although we have a column representing weights, it is not plotted out on the graph. I propose actually using the weights in the visualization, represented by the thickness of each individual edge.

### Provide the sketch of your alternative design.
![Sketch 2](sketch_2.jpg)

### Using appropriate visNetwork functions, plot the alternative design.

making a new column called "value", vizNetwork takes this in as the weights. Reference: https://stackoverflow.com/questions/46664645/r-visnetwork-igraph-weighted-network-visualization-with-visedges

```{r echo = TRUE}

GAStech_edges_aggregated_2 = GAStech_edges_aggregated %>% mutate(value = weight)

```

```{r}
visNetwork(GAStech_nodes, GAStech_edges_aggregated_2, width = "100%") %>%
  visNodes(shape = 'datab') %>%
  visEdges() %>%
  visIgraphLayout(layout = "layout_with_fr") %>%
  visOptions(highlightNearest = list(enabled= TRUE, algorithm = 'all', labelOnly = FALSE), nodesIdSelection = TRUE) %>%
  visLegend()
```





