I conducted an analysis of public Facebook pages and groups that support US Presidential Candidates Donald Trump and Joe Biden to determine what pages are be most influential, may be coordinating to amplify content, and what topics they are discussing.

To identify pages and groups that support the candidates, I used the CrowdTangle API to search for pages and groups that have shared posts from the official Facebook pages of the candidates any time in the past year (October 2018 and October 2019). Below is the number of pages that were found which comprise the support “ecosystems” of pro-Trump and pro-Biden used in this analysis:

To identify which pages may be most influential among those identified, I used the CrowdTangle API to extract “Leaderboard” data from lists of pages and groups. Leaderboard data includes the number users that follow or are members of pages or groups and the number of interactions (likes, comments, and shares) that posts have received.

INFLUENCE ANALYSIS: PRO-TRUMP PAGES

In the pro-Trump Pages ecosystem, the most prominent pages ranked by number of interactions (likes, comments and shares) were “Pro-Trump” and “Pro-Republican” judging by the page names. Excluding the official page of Donald Trump, eight of the top 10 pages had over 2,000,000 interactions.

COORDINATION ANALYSIS: PRO-TRUMP PAGES

A social network analysis shows a large cluster of Pro-Trump pages that seem to engage in relatively loose coordination and share the same content. Two other small clusters also clearly emerged: one comprised of four influential pages “Trump2020”, “Conservative Renaissance”, “Red Symposium”, and “QCN”, and another comprised of three less influential pages from Illinois, North Dakota and South Dakota “Against the Illegal Alien Invasion.” - these smaller clusters clearly engage in coordinated efforts to amplify content.

TOPIC ANALYSIS: PRO-TRUMP PAGES

Democrats, borders, and the House of Representatives featured prominently. Mentions of Democrats were associated with “lies” and “political narratives”, borders with crisis and wall, and the House of Representatives with Nancy Pelosi and the Judiciary Committee. Fake news was also a notable topic and there was quite a bit of discussion of Hollywood (presumably anti).

COORDINATED CLUSTER TOPIC ANALYSIS: PRO-TRUMP PAGES

A closer look at the topics of posts in the coordination cluster comprised of “Red Symposium”, “QCN”, “Trump 2020”, “Conservative Renaissance” highlights strongly divisive anti-Democrat and anti-Liberal messaging. Mentions of “Democrats” or “Democratic Party” are associated with words such as “lies”, “extreme politics”, “identity politics”, and “America hating” while the term “Liberals” is associated with “violent.” There is also an association of “liberal”, “media”, and “hypocrisy” as well as a small, but strong clustering of the terms “god”, “bless”, “president” and “trump”.

A closer look at the topics of posts in the coordination cluster comprised of“Stand With Illinois Against The Illegal Alien Invasion”, “Stand With South Dakota Against The Illegal Alien Invasion”, “Stand With North Dakota Against The Illegal Alien Invasion” is a bit more narrowly focused on the topic of border security (this is not terribly surprising given the page names in the cluster).

INFLUENCE ANALYSIS: PRO-BIDEN PAGES

In the pro-Biden Pages ecosystem, the most prominent pages ranked by number of interactions (likes, comments and shares of posts) include a mix of pages that appear to be anti-Trump, pro-Obama, and pro-Democrat in general. The first page that mentions Joe Biden in its name (beyond the official page) is “Joe Biden for President 2020” which ranks 14th and has 4,519 followers. By contrast, there are four anti-Trump pages ranked in the top 10.

COORDINATION ANALYSIS: PRO-BIDEN PAGES

A social network analysis indicates that a cluster of lesser popular and distinctly location or identity-specific pro-Biden Pages (e.g. “Utah for Biden”, “Milwaukee for Joe Biden”, “Women For Biden”) are seemingly coordinating to amplify the same content. There appeared to be otherwise little coordination among other pages, such as anti-Trump or pro-Obama pages.

TOPIC ANALYSIS: PRO-BIDEN PAGES

A social network analysis of posts in pro-Biden Pages highlights health care and the Democratic primary race as prominent topics of conversation. A word association of those topics indicates that mentions of the Affordable Care Act and pre-existing conditions were referenced in posts about health care. Mentions of polls Elizabeth Warren, Bernie Sanders and Kamala Harris were associated with posts about the Democratic primary race. Donald Trump is frequently referenced along with mentions of Ukraine. Barack Obama and the House of Representatives were also frequent topics and there was some discussion of white supremacy (presumably anti).

INFLUENCE ANALYSIS: PRO-TRUMP GROUPS

In the pro-Trump Groups ecosystem, the most prominent groups were predominantly “Pro-Trump” and “Pro-Conservative” groups. Several groups had more than 5,000,000 interactions.

TOPIC ANALYSIS: PRO-TRUMP GROUPS

The Bidens, impeachment and the House of Representatives featured prominently. There was quite a bit of inter-related discussion on these topics with Donald Trump and impeachment at the center. Mentions of the Bidens were associated with Hunter and prosecutor, and the House of Representatives with the Intelligence Committee. Mainstream and fake news as well as god were also notable topics of conversation.

INFLUENCE ANALYSIS: PRO-BIDEN GROUPS

In the pro-Biden Groups ecosystem, the most prominent groups were predominantly “Anti-Trump” and “Pro-Democrat” groups. The most popular groups were “anti-Trump”, two of which - “FU Trump” and “A SKUNK NAMED TRUMP” receiving more than 5,000,000 interactions.

TOPIC ANALYSIS: PRO-BIDEN GROUPS

Donald Trump, Ukraine and Impeachment featured prominently. There was quite a bit of inter-related discussion on these topics with Donald Trump at the center of it. Mentions of Ukraine were associated with Giuliani and Zelensky, and impeachment with inquiry and Nancy Pelosi. Elizabeth Warren was mentioned, but other candidates did not feature prominently. Fake news was also a notable topic of conversation (presumably anti).

---
title: "Coordination & Topic Analysis in pro-Trump & pro-Biden Facebook Ecosystems"
output: html_notebook
---

I conducted an analysis of public Facebook pages and groups that support US Presidential Candidates Donald Trump and Joe Biden to determine what pages are be most influential, may be coordinating to amplify content, and what topics they are discussing. 

To identify pages and groups that support the candidates, I used the CrowdTangle API to search for pages and groups that have shared posts from the official Facebook pages of the candidates any time in the past year (October 2018 and October 2019). Below is the number of pages that were found which comprise the support "ecosystems" of pro-Trump and pro-Biden used in this analysis:

* 100 pro-Trump Pages (299,996 Posts)
* 216 pro-Trump Groups (299,974 Posts)
* 48 pro-Biden Pages (86,387 Posts)
* 77 pro-Biden Groups (299,995 Posts)

To identify which pages may be most influential among those identified, I used the CrowdTangle API to extract "Leaderboard" data from lists of pages and groups. Leaderboard data includes the number users that follow or are members of pages or groups and the number of interactions (likes, comments, and shares) that posts have received. 

```{r include=FALSE}
#Load Required Packages
require("dplyr")
require("stringr")
require("ggplot2")
require("tidytext")
require("tidyr")
require("textmineR")
require("widyr")
require("topicmodels")
require("drlib")
require("igraph")
require("ggraph")
require("drlib")
require("scales")
require("janitor")
require("DT")
```

```{r, include=FALSE}
#Read in Data for topic analysis
jbpages <- readRDS(file="jbpages_oct18-oct19.rda")
dtpages <- readRDS(file="dtpages_oct18-oct19.rda")
jbgroups <- readRDS(file="jbgroups_oct18-oct19.rda")
dtgroups <- readRDS(file="dtgroups_oct18-oct19.rda")
```

```{r include=FALSE}
#Read In Data for Leader Analysis 
jbpages_leaders <- readRDS("jbpages_leaders.rda")
dtpages_leaders <- readRDS("dtpages_leaders.rda")
jbgroups_leaders <- readRDS("jbgroups_leaders.rda")
dtgroups_leaders <- readRDS("dtgroups_leaders.rda")
```

```{r include=FALSE}
#Run Functions for Cleaning, Tokenizing & Correlating Data
text_clean <- function(data, text) {
  require("dplyr")
  text <- enquo(text)
  data <- data %>% select(!!text) %>% filter(!!text != "") #remove empty messages
  data <- data %>% mutate(text=as.character(!!text))
  data <- data %>% mutate(text=gsub("http[^[:space:]]*", '', text))
  data <- data %>% mutate(text=gsub("#\\w+ *", '', text))
  data <- data %>% mutate(text=gsub("@\\w+ *", '', text))
  data <- data %>% mutate(text=gsub("RT\\w+ *:", '', text)) 
  data <- data %>% mutate(text=gsub("[[:digit:]]+", '', text))
  data <- data %>% mutate(text=gsub("[[:punct:]]+", '', text)) 
  data <- data %>% mutate(text=gsub("[^[:alnum:]]",' ', text))
  data <- data %>% filter(text != "") %>% mutate(id = row_number()) %>% transmute(id=id, text=text)  
  return(data)
}

text_tokenize <- function(data, text, stopwords=NULL, n=NULL) {
  require("dplyr")
  require("tidyr")
  require("tidytext")
  text=enquo(text)
  if(is.null(stopwords)) { stopwords <- tidytext::stop_words }
  if(is.null(n)) { n <- 1 }
  tokens <- data %>% unnest_tokens(word, !!text)
  tokens <- tokens %>% filter(!(nchar(word)==1)) %>% anti_join(stopwords) 
  tokens <- tokens %>% mutate(ind=row_number())
  tokens <- tokens %>% group_by(id) %>% mutate(ind=row_number()) %>% spread(key=ind, value=word) 
  tokens[is.na(tokens)] <- ""
  tokens <- unite(tokens, text, -id, sep=" ")
  tokens$text <- trimws(tokens$text)
  tokens <- tokens %>% unnest_tokens(ngram, text, token="ngrams", n=n)
  return(tokens)
}

text_correlation <- function(data, ngram, top=NULL, mentions=NULL) {
  require("dplyr")
  require("widyr")
  ngram <- enquo(ngram)
  if(is.null(top)) { top <- 20 }
  if(is.null(mentions)) { mentions <- 1 }
  top_ngrams <- data %>% group_by(!!ngram) %>% count() %>% arrange(desc(n)) %>% head(top)
  top_ngrams <- top_ngrams$ngram
  correlation <- data %>% group_by(!!ngram) %>% filter(n() >= mentions) %>% pairwise_cor(ngram, id, sort=T)
  top_topics <- correlation %>% filter(item1 %in% top_ngrams) %>% transmute(topic=item1, mention=item2, correlation=correlation)
  return(top_topics)
}

topic_reduce <- function(data, topic, mentions=NULL) {
  require("dplyr")
  topic <- enquo(topic)
  #sort <- enquo(sort)
  #if(is.null(sort)) { sort <- correlation }
  if(is.null(mentions)) { mentions <- 10 }
  clean <- data %>% group_by(!!topic) %>% top_n(mentions, wt=correlation) %>% ungroup() %>% arrange(!!topic, correlation) %>% mutate(.r = row_number())
  return(clean)
}

```
#### INFLUENCE ANALYSIS: PRO-TRUMP PAGES
In the **pro-Trump Pages ecosystem**, the most prominent pages ranked by number of interactions (likes, comments and shares) were “Pro-Trump” and “Pro-Republican" judging by the page names. Excluding the official page of Donald Trump, eight of the top 10 pages had over 2,000,000 interactions.
<br/>
<br/>
```{r, echo=FALSE, warning=FALSE, eval=TRUE}
dtpages_int <- dtpages_leaders %>% rename(Name=Page) 

datatable(dtpages_int, options=list(order=list(list(3, 'desc'))))
```

#### COORDINATION ANALYSIS: PRO-TRUMP PAGES
A social network analysis shows a large cluster of Pro-Trump pages that seem to engage in relatively loose coordination and share the same content. Two other small clusters also clearly emerged: one comprised of four influential pages “Trump2020”, “Conservative Renaissance”, “Red Symposium”, and “QCN”, and another comprised of three less influential pages from Illinois, North Dakota and South Dakota “Against the Illegal Alien Invasion.” - these smaller clusters clearly engage in coordinated efforts to amplify content. 
<br/>
<br/>
```{r, include=FALSE}
dtpages_facebook <- dtpages %>% filter(str_detect(Link, "https://www.facebook.com")) %>% mutate(Shared.Page = str_match(Link, "://(.*?)/(.*?)/")[,3])
#head(dtpages_facebook2)

dtpages_facebook_network <- dtpages_facebook %>% filter(Page.Name != Shared.Page) %>% select(Shared.Page, Page.Name)
dtpages_counts <- dtpages_facebook_network %>% count(Page.Name) %>% arrange(desc(n))
dtpages_page_correlations <- dtpages_facebook_network %>% semi_join(dtpages_counts) %>% pairwise_cor(Page.Name, Shared.Page, sort=T, upper= F)
```

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
set.seed(2019)

dtp_coordination_sna <- dtpages_page_correlations %>%
  filter(correlation > .3) %>%
  graph_from_data_frame(directed = T) %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "red", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

dtp_coordination_sna
```

#### TOPIC ANALYSIS: PRO-TRUMP PAGES
Democrats, borders, and the House of Representatives featured prominently. Mentions of Democrats were associated with “lies” and “political narratives”, borders with crisis and wall, and the House of Representatives with Nancy Pelosi and the Judiciary Committee. Fake news was also a notable topic and there was quite a bit of discussion of Hollywood (presumably anti).

```{r, include=FALSE}
dtpages2 <- dtpages %>% ungroup()
dtpages_text <- text_clean(data=dtpages2, text=Message)
dtpages_tokens <- text_tokenize(data=dtpages_text, text=text, stopwords=NULL, n=1)  
dtpages_correlation <- text_correlation(data=dtpages_tokens, ngram=ngram, top=20, mentions=20)

#Clean Text for Words That Are Not Relevant
dtpages_correlation %>% group_by(topic) %>% top_n(n=10, wt=correlation) %>% arrange(topic)
#Inspect one-by-one to make sure you're not missing something
dtpages_topics_clean <- dtpages_correlation %>% filter(!topic %in% c("warrior", "angel", "rebel", "rouser", "time", "dont", "american", "donald"))
#Reduce Topics & Prep For Plotting
dtpages_topics <- topic_reduce(dtpages_topics_clean, topic=topic, mentions=10)
#dtpages_topics 
```

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
dtp_topics_bar <- dtpages_topics %>% ggplot(aes(x=.r, y=correlation)) +
  geom_bar(stat = "identity", color='red',fill='firebrick') +
  facet_wrap(~ topic, scales = "free") +
  coord_flip() +
  scale_x_continuous(breaks = dtpages_topics$.r,
                     labels = dtpages_topics$mention)

set.seed(2019)
dtp_topics_sna <- dtpages_topics %>%
  filter(correlation > .15) %>%
  graph_from_data_frame() %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "red", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

dtp_topics_sna
dtp_topics_bar

```

###COORDINATED CLUSTER TOPIC ANALYSIS: PRO-TRUMP PAGES
A closer look at the topics of posts in the coordination cluster comprised of "Red Symposium", "QCN", "Trump 2020", "Conservative Renaissance" highlights strongly divisive anti-Democrat and anti-Liberal messaging. Mentions of "Democrats" or "Democratic Party" are associated with words such as "lies", "extreme politics", "identity politics", and "America hating" while the term "Liberals" is associated with "violent." There is also an association of "liberal", "media", and "hypocrisy" as well as a small, but strong clustering of the terms "god", "bless", "president" and "trump".

```{r, include=FALSE}
cluster_4 <- c("Red Symposium", "QCN", "Trump 2020", "Conservative Renaissance")
dtpages3 <- dtpages2 %>% filter(Page.Name %in% cluster_4)
dtpages_text_cluster_4 <- text_clean(data=dtpages3, text=Message)
dtpages_tokens_cluster_4 <- text_tokenize(data=dtpages_text_cluster_4, text=text, stopwords=NULL, n=1)  
dtpages_correlation_cluster_4 <- text_correlation(data=dtpages_tokens_cluster_4, ngram=ngram, top=20, mentions=20)

#Clean Text for Words That Are Not Relevant
dtpages_correlation_cluster_4 %>% group_by(topic) %>% top_n(n=10, wt=correlation) %>% arrange(topic)
#Inspect one-by-one to make sure you're not missing something
#dtpages_topics_clean <- dtpages_correlation %>% filter(!topic %in% c("warrior", "angel", "rebel", "rouser", "time", "dont", "american", "donald"))
#Reduce Topics & Prep For Plotting
dtpages_topics_cluster_4 <- topic_reduce(dtpages_correlation_cluster_4, topic=topic, mentions=10)
#dtpages_topics 
```

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
#dtp_topics_bar <- dtpages_topics %>% ggplot(aes(x=.r, y=correlation)) +
#  geom_bar(stat = "identity", color='red',fill='firebrick') +
#  facet_wrap(~ topic, scales = "free") +
#  coord_flip() +
#  scale_x_continuous(breaks = dtpages_topics$.r,
#                     labels = dtpages_topics$mention)

set.seed(2019)
dtp_topics_sna_cluster_4 <- dtpages_topics_cluster_4 %>%
  filter(correlation > .15) %>%
  graph_from_data_frame() %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "red", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

dtp_topics_sna_cluster_4
#dtp_topics_bar
```

A closer look at the topics of posts in the coordination cluster comprised of"Stand With Illinois Against The Illegal Alien Invasion", "Stand With South Dakota Against The Illegal Alien Invasion", "Stand With North Dakota Against The Illegal Alien Invasion" is a bit more narrowly focused on the topic of border security (this is not terribly surprising given the page names in the cluster).   

```{r, include=FALSE}
cluster_3 <- c("Stand With Illinois Against The Illegal Alien Invasion", "Stand With South Dakota Against The Illegal Alien Invasion", "Stand With North Dakota Against The Illegal Alien Invasion")
dtpages4 <- dtpages2 %>% filter(Page.Name %in% cluster_3)
dtpages_text_cluster_3 <- text_clean(data=dtpages4, text=Message)
dtpages_tokens_cluster_3 <- text_tokenize(data=dtpages_text_cluster_3, text=text, stopwords=NULL, n=1)  
dtpages_correlation_cluster_3 <- text_correlation(data=dtpages_tokens_cluster_3, ngram=ngram, top=20, mentions=20)

#Clean Text for Words That Are Not Relevant
dtpages_correlation_cluster_3 %>% group_by(topic) %>% top_n(n=10, wt=correlation) %>% arrange(topic)
#Inspect one-by-one to make sure you're not missing something
dtpages_topics_clean_cluster_3 <- dtpages_correlation_cluster_3 %>% filter(!topic %in% c("ruthie"))
#Reduce Topics & Prep For Plotting
dtpages_topics_cluster_3 <- topic_reduce(dtpages_correlation_cluster_3, topic=topic, mentions=10)
#dtpages_topics 
```

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
#dtp_topics_bar <- dtpages_topics %>% ggplot(aes(x=.r, y=correlation)) +
#  geom_bar(stat = "identity", color='red',fill='firebrick') +
#  facet_wrap(~ topic, scales = "free") +
#  coord_flip() +
#  scale_x_continuous(breaks = dtpages_topics$.r,
#                     labels = dtpages_topics$mention)

set.seed(2019)
dtp_topics_sna_cluster_3 <- dtpages_topics_cluster_3 %>%
  filter(correlation > .15) %>%
  graph_from_data_frame() %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "red", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

dtp_topics_sna_cluster_3
#dtp_topics_bar
```

#### INFLUENCE ANALYSIS: PRO-BIDEN PAGES
In the **pro-Biden Pages ecosystem**, the most prominent pages ranked by number of interactions (likes, comments and shares of posts) include a mix of pages that appear to be anti-Trump, pro-Obama, and pro-Democrat in general. The first page that mentions Joe Biden in its name (beyond the official page) is "Joe Biden for President 2020" which ranks 14th and has 4,519 followers. By contrast, there are four anti-Trump pages ranked in the top 10.
<br/>
<br/>
```{r, echo=FALSE, warning=FALSE, eval=TRUE}
jbpages_int <- jbpages_leaders %>% rename(Name=Page) %>% filter(!Name %in% c("Klaus Iohannis - Presedintele Romaniei", "Ψ | МИ ПАТРІОТИ УКРАЇНИ | Ψ", "We Are Q", "Dacian Ciolos"))

datatable(jbpages_int, options=list(order=list(list(3, 'desc'))))
```

#### COORDINATION ANALYSIS: PRO-BIDEN PAGES
```{r, include=FALSE}
jbpages_facebook <- jbpages %>% filter(str_detect(Link, "https://www.facebook.com")) %>% mutate(Shared.Page = str_match(Link, "://(.*?)/(.*?)/")[,3])
#head(jbpages_facebook)

jbpages_facebook_network <- jbpages_facebook %>% filter(Page.Name != Shared.Page) %>% select(Shared.Page, Page.Name)
jbpages_counts <- jbpages_facebook_network %>% count(Page.Name) %>% arrange(desc(n))
jbpages_page_correlations <- jbpages_facebook_network %>% semi_join(jbpages_counts) %>% pairwise_cor(Page.Name, Shared.Page, sort=T, upper= F)
```

A social network analysis indicates that a cluster of lesser popular and distinctly location or identity-specific pro-Biden Pages (e.g. "Utah for Biden", "Milwaukee for Joe Biden", "Women For Biden") are seemingly coordinating to amplify the same content. There appeared to be otherwise little coordination among other pages, such as anti-Trump or pro-Obama pages.
<br/>
<br/>

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
set.seed(2019)

jbp_coordination_sna <- jbpages_page_correlations %>%
  filter(correlation > .3) %>%
  graph_from_data_frame(directed = T) %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "blue", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

#dtpages_page_correlations %>% datatable(class = 'cell-border stripe', caption = 'Top shared Facebook Pages, Groups, or profiles by sharer')
jbp_coordination_sna
```

#### TOPIC ANALYSIS: PRO-BIDEN PAGES
A social network analysis of posts in pro-Biden Pages highlights health care and the Democratic primary race as prominent topics of conversation. A word association of those topics indicates that mentions of the Affordable Care Act and pre-existing conditions were referenced in posts about health care. Mentions of polls Elizabeth Warren, Bernie Sanders and Kamala Harris were associated with posts about the Democratic primary race. Donald Trump is frequently referenced along with mentions of Ukraine. Barack Obama and the House of Representatives were also frequent topics and there was some discussion of white supremacy (presumably anti).  

```{r include=FALSE}
#Prep Data for Topic Analysis
jbpages2 <- jbpages %>% ungroup()
jbpages_text <- text_clean(data=jbpages2, text=Message)
jbpages_tokens <- text_tokenize(data=jbpages_text, text=text, stopwords=NULL, n=1)  
jbpages_correlation <- text_correlation(data=jbpages_tokens, ngram=ngram, top=20, mentions=10)

#Clean Text for Words That Are Not Relevant
#jbpages_correlation %>% group_by(topic) %>% top_n(n=10, wt=correlation) %>% arrange(topic)
jbpages_topics_clean <- jbpages_correlation %>% filter(!topic %in% c("pfm", "edegbe", "bank", "support", "time", "day", "happy", "people")) 

#Reduce Topics & Prep For Plotting
jbpages_topics <- topic_reduce(jbpages_topics_clean, topic=topic, mentions=10)
#jbpages_topics %>% group_by(topic) 
```

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
jbp_topics_bar <- jbpages_topics %>% ggplot(aes(x=.r, y=correlation)) +
  geom_bar(stat = "identity", color='skyblue',fill='steelblue') +
  facet_wrap(~ topic, scales = "free") +
  coord_flip() +
  xlab("Words Associated with Topic") + ylab("Correlation") +
  scale_x_continuous(breaks = jbpages_topics$.r,
                     labels = jbpages_topics$mention)

set.seed(2019)
jbp_topics_sna <- jbpages_topics %>%
  filter(correlation > .15) %>%
  graph_from_data_frame() %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "lightblue", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

jbp_topics_sna
jbp_topics_bar
```

#### INFLUENCE ANALYSIS: PRO-TRUMP GROUPS
In the **pro-Trump Groups ecosystem**, the most prominent groups were predominantly “Pro-Trump” and “Pro-Conservative” groups. Several groups had more than 5,000,000 interactions. 
<br/>
<br/>

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
dtgroups_int <- dtgroups_leaders %>% rename(Name=Group) 

datatable(dtgroups_int, options=list(order=list(list(3, 'desc'))))
```

#### TOPIC ANALYSIS: PRO-TRUMP GROUPS
The Bidens, impeachment and the House of Representatives featured prominently. There was quite a bit of inter-related discussion on these topics with Donald Trump and impeachment at the center. Mentions of the Bidens were associated with Hunter and prosecutor, and the House of Representatives with the Intelligence Committee. Mainstream and fake news as well as god were also notable topics of conversation. 
<br/>
<br/>

```{r, include=FALSE}
dtgroups2 <- dtgroups %>% ungroup()
dtgroups_text <- text_clean(data=dtgroups2, text=Message)
dtgroups_tokens <- text_tokenize(data=dtgroups_text, text=text, stopwords=NULL, n=1)  
dtgroups_correlation <- text_correlation(data=dtgroups_tokens, ngram=ngram, top=20, mentions=20)

#Clean Text for Words That Are Not Relevant
dtgroups_correlation %>% group_by(topic) %>% top_n(n=10, wt=correlation) %>% arrange(topic)
#Inspect one-by-one to make sure you're not missing something
dtgroups_topics_clean <- dtgroups_correlation %>% filter(!topic %in% c("time", "dont", "people", "donald", "american", "country", "news", "president")) 
#Reduce Topics & Prep For Plotting
dtgroups_topics <- topic_reduce(dtgroups_topics_clean, topic=topic, mentions=10)
```

```{r, echo=FALSE, warning=FALSE, eval=TRUE}
dtg_topics_bar <- dtgroups_topics %>% ggplot(aes(x=.r, y=correlation)) +
  geom_bar(stat = "identity", color='red',fill='firebrick') +
  facet_wrap(~ topic, scales = "free") +
  coord_flip() +
  scale_x_continuous(breaks = dtpages_topics$.r,
                     labels = dtpages_topics$mention)

set.seed(2019)
dtg_topics_sna <- dtgroups_topics %>%
  filter(correlation > .15) %>%
  graph_from_data_frame() %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "red", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

dtg_topics_sna
dtg_topics_bar
```

#### INFLUENCE ANALYSIS: PRO-BIDEN GROUPS
In the **pro-Biden Groups ecosystem**, the most prominent groups were predominantly “Anti-Trump” and “Pro-Democrat” groups. The most popular groups were “anti-Trump”, two of which - “FU Trump” and “A SKUNK NAMED TRUMP” receiving more than 5,000,000 interactions. 
<br/>
<br/>
```{r, echo=FALSE, warning=FALSE, eval=TRUE}
jbgroups_int <- jbgroups_leaders %>% rename(Name=Group) %>% filter(!Name %in% c("Klaus Iohannis - Presedintele Romaniei", "Ψ | МИ ПАТРІОТИ УКРАЇНИ | Ψ", "We Are Q", "Dacian Ciolos")) 

datatable(jbgroups_int, options=list(order=list(list(3, 'desc'))))
```

#### TOPIC ANALYSIS: PRO-BIDEN GROUPS
Donald Trump, Ukraine and Impeachment featured prominently. There was quite a bit of inter-related discussion on these topics with Donald Trump at the center of it. Mentions of Ukraine were associated with Giuliani and Zelensky, and impeachment with inquiry and Nancy Pelosi. Elizabeth Warren was mentioned, but other candidates did not feature prominently. Fake news was also a notable topic of conversation (presumably anti).
<br/>
<br/>

```{r, include=FALSE}
jbgroups2 <- jbgroups %>% ungroup()
jbgroups_text <- text_clean(data=jbgroups2, text=Message)
jbgroups_tokens <- text_tokenize(data=jbgroups_text, text=text, stopwords=NULL, n=1)  
jbgroup_correlation <- text_correlation(data=jbgroups_tokens, ngram=ngram, top=20, mentions=20)

#Clean Text for Words That Are Not Relevant
jbgroup_correlation %>% group_by(topic) %>% top_n(n=10, wt=correlation) %>% arrange(topic)
jbgroups_topics_clean <- jbgroup_correlation %>% filter(!topic %in% c("de", "time", "dont", "la", "hes", "не", "на", "care"))

#Reduce Topics & Prep For Plotting
jbgroups_topics <- topic_reduce(jbgroups_topics_clean, topic=topic, mentions=10)
#jbgroups_topics %>% saveRDS(file="app/jbg_topics.rda")
```

```{r, eval=TRUE, echo=FALSE, warning=FALSE}
jbg_topics_bar <- jbgroups_topics %>% ggplot(aes(x=.r, y=correlation)) +
  geom_bar(stat = "identity", color='skyblue',fill='steelblue') +
  facet_wrap(~ topic, scales = "free") +
  coord_flip() +
  scale_x_continuous(breaks = jbgroups_topics$.r,
                     labels = jbgroups_topics$mention)

set.seed(2019)
jbg_topics_sna <- jbgroups_topics %>%
  filter(correlation > .15) %>%
  graph_from_data_frame() %>%
  ggraph(layout = "fr") +
  geom_edge_link(aes(edge_alpha = correlation), show.legend = FALSE) +
  geom_node_point(color = "lightblue", size = 3) +
  geom_node_text(aes(label = name), repel = TRUE) +
  theme_void()

jbg_topics_sna
jbg_topics_bar

```

```{r}

```