Social Network Analysis with TERGM was conducted on a professional network of 28 health professionals located in South Dakota. Each of these actors has a possibility of five connection types–how often they contact others (contact), the strength of their relationship (know), how often they receive information from others (receive), how often they send information to others (send), and how much they trust them (trust). The connections of interest in our analysis are receive and trust.
The attributes for this network include the following:
Workgroup: 1 = Stanford model of Chronic Disease Self-Management Program, 2 = PN/CHW
Attendees: 1 = absent, 2 = present
Sex: 1 = male, 2 = female
Race/Ethnicity: 1 = white, 2 = Black/African American, 3 = AI/AN, 5 = Other, 9 = Prefer not to answer
Specialization: 1 = Cancer, 2 = CD, 3 = neither
Roles: Health Educator, CHW, Clinician, Lawmaker, Public/Government Employee, Researcher/academic, NGO/NPO prof, Private Sector, Insurer, Health administrator, Public Health Prof, and other.
Topic: Cancer; Nutrition, PA, obesity; Diabetes; Heart Disease; Tobacco & alcohol; Reproductive Health; Population Health; Access to health services; Mental & emotional well-being; Survivorship; Other
The types of questions than can be answered about this network using SNA and TERGM are:
Do researchers interact only with those within the same role?
Does a researcher’s sex contribute to the formation of relationships?
Are relationships consistent from one year to the next?
What influences whether a researcher acquires, maintains, or drops a relationship over time?
Homophily had a singificant impact on the probability of tie formation across both networks and years. The largest predictor of tie formation for both years in the trust network is conference attendance–if both actors attended the conference, the probability of a trusting relationship between them increased significantly. Reciprocality of ties was the largest predictor of tie formation among the receive networks. According to the tergm, a tie from one year to the next was more likely to form if the tie resulted in tie reciprocity. From one year to the next, the trust network communities became tighter and the receive communities expanded.The most trusted cancer specialist and two most trusted CD specialists are SD10 and SD1 and SD9, respectively.
To better understand this network, descriptive analysis was conducted on the attributes data to see what it can tell us about the network, such as if professionals with the same roles have the same attributes. Next, we calculated network-level metrics such as density to identify how many connections there actually are. Node level analysis and subgroup analysis was used to identify any particular actors of interest and if there are communities present. Lastly, ergm/tergm analysis was conducted to identify what variables, if any, impacted tie formation.
The SD Trust table for 2016 contained 28 vertices, but the SD trust table for 2015 contained only 21 vertices, so the missing vertices were filled in to make a symmetric data matrix to keep the two time series data in the same scale. The SD attributes table contains attributes for SD 1-21 for 2015 and SD 1-28 for 2016. Any missing values were replaced with 0 and attached to the graph object as needed. Additionally, attributes variables were updated so they could be used in the tergm analysis.
There are two social networks of interest among this community: receive and trust. In the receive network, respondents identified how often they received information from another, from a range of 1 = never to 6 = every day. For the sake of our analysis, those connections that were 3 or greater–received information once a quarter and more–were considered true connections while those 2 and under were not. The resulting receive networks for 2015 and 2016 can be seen below:
It is interesting to see how the network changed from one year to the next. In 2015, respondents appeared to have more receive connections, and overall there seems to be one main group who sends and receives information among themselves. This group predominately specializes in communicable diseases, with one actor whose specialization is unknown. The receive network of 2016 is less connected and clear communities are not present. There is also less clear of a distinction between specialties.
The second social network of interest is the trust network. Respondents identified which other actors they trusted, from a range of 0 = no trust to 5 = fully trust. The same method of binarization was used.
Overall, the trust network is much more connected than the receive network. For 2015, the general community among the Communicable Disease (CD) actors is the predominant group, but there are more connections among the isolates that were present in the receive network. Additionally, in 2016 the trust network was even more connecting, showing that trust may increase with time. SD10 appeared as the most trusted of the cancer specialists and SD1 and SD9 the most trusted of the CD specialists.
This network consists of a total of 28 professionals, 21 from 2015 and 22 from 2016. There were 15 individuals who attended both years. Of the total attendees, the majority were white females who specialized in CD. They are evenly split between the Stanford Model of Chronic Disease Self-Management Program and PN/CWH working groups.
The actors were predominantly health educators, clinicians, or involved with NGO/NPO.
The professionals who attended both years were predominately health educators or involved with NGO/NPO or Public/Government.
| Network | Year | Vertices | Edges | Network Density | Transitivity |
|---|---|---|---|---|---|
| Trust | 2015 | 28 | 82 | 0.1045918 | 0.6217617 |
| Trust | 2016 | 28 | 93 | 0.1186224 | 0.4773960 |
| Receive | 2015 | 28 | 71 | 0.0905612 | 0.7200000 |
| Receive | 2016 | 28 | 41 | 0.0522959 | 0.4528302 |
The local clustering coefficient of a node in a graph quantifies how close its neighbours are to being a clique. When the value equals to 1, as is of node SD2 in trust 2015, every neighbour connected to it is also connected to every other vertex within the neighbourhood.
| Node | Network | Year | Clustering Coefficient | _______ | Node | Network | Year | Clustering Coefficient |
|---|---|---|---|---|---|---|---|---|
| SD2 | Trust | 2015 | 1.0000000 | _______ | SD4 | Receive | 2015 | 1.00 |
| SD4 | Trust | 2016 | 1.0000000 | _______ | SD1 | Receive | 2016 | 1.00 |
| SD1 | Trust | 2015 | 0.8055556 | _______ | SD23 | Receive | 2016 | 1.00 |
| SD4 | Trust | 2015 | 0.8055556 | _______ | SD4 | Receive | 2016 | 1.00 |
| SD5 | Trust | 2015 | 0.7857143 | _______ | SD19 | Receive | 2015 | 0.75 |
| SD14 | Trust | 2015 | 0.7857143 | _______ | SD24 | Receive | 2016 | 0.50 |
We choose to only focused on degree centrality this time, which measures the total connections a node has.
| Node | Network | Year | Degree | _______ | Node | Network | Year | Degree |
|---|---|---|---|---|---|---|---|---|
| SD9 | Trust | 2015 | 20 | _______ | SD9 | Receive | 2015 | 17 |
| SD15 | Trust | 2016 | 18 | _______ | SD1 | Receive | 2015 | 16 |
| SD1 | Trust | 2015 | 15 | _______ | SD16 | Receive | 2015 | 15 |
| SD3 | Trust | 2015 | 15 | _______ | SD2 | Receive | 2015 | 14 |
| SD4 | Trust | 2015 | 15 | _______ | SD3 | Receive | 2015 | 14 |
| SD5 | Trust | 2015 | 15 | _______ | SD5 | Receive | 2015 | 14 |
The walktrap community detection algorithm utilizes random walks along edges in the graph to identify communities. Since only a few edges will lead outside of a community, these random walks can identify cliques within a network. The process is iterated until individual groups are merged into the final communities and a specified modularity is reached.
This community detection method found two main communities across all networks and years. After the first year’s conference, a third community was established in the receive network. This could have been a result of meeting and exchanging information at the initial conference. The trust network saw a third smaller community from 2015 welcomed into one of the larger communities. This is likely related to greater trust being established as the number of times met increased.
Exponential Random Graph Models were utilized to identify which attributes, if any, had a statistically significant influence on the probability of tie formation within the trust and receive networks, and if those same attributes had a significant impact across both years.
##
## ==========================
## Summary of model fit
## ==========================
##
## Formula: tr15net ~ edges + mutual + nodematch("Workgroup") + nodematch("Attendees") +
## nodematch("Sex") + nodematch("Race") + nodematch("Cancer")
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -6.1608 1.0299 0 -5.982 < 1e-04 ***
## mutual 2.0599 0.4650 0 4.430 < 1e-04 ***
## nodematch.Workgroup 0.4129 0.2461 0 1.678 0.093397 .
## nodematch.Attendees 2.9466 1.0553 0 2.792 0.005235 **
## nodematch.Sex -0.3853 0.2560 0 -1.505 0.132350
## nodematch.Race 1.1388 0.3333 0 3.417 0.000634 ***
## nodematch.Cancer 1.1772 0.2667 0 4.413 < 1e-04 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 1048.0 on 756 degrees of freedom
## Residual Deviance: 298.1 on 749 degrees of freedom
##
## AIC: 312.1 BIC: 344.5 (Smaller is better.)
The change in the number of edges, mutuality, attendees, race, and cancer are significant predictors of tie formation, as the p-value is far less than the conventional significance cut-off of 0.05. The column for “MCMC%” is left as null, because this model didn’t require Markov Chain Monte Carlo estimation.
The coefficients, -6.16, 2.0599, 2.9466, 1.1388, and 1.1772, are expressed as conditional log-odds, such that:
\[logodds(trust_{2015}) = -6.16*edges + 2.06*mutual + 2.95*attendees + 1.14*sex + 1.18*cancer\]
The log-odds that these heterophilous ties will form in the network equals -6.16, equaling a probability of exp(-6.16)/(1+exp(-6.16)), or 0.002. Likewise, the log-odds of a homophilous relationship forming equals (-6.16+2.06+2.95+1.14+1.18+1.17), corresponding to a probability of 0.76, or exp(1.17)/(1+exp(1.17)). The greatest indicator of a tie forming was homophily of attendees–whether they attended the conference together.
##
## ==========================
## Summary of model fit
## ==========================
##
## Formula: tr16net ~ edges + mutual + nodematch("Workgroup") + nodematch("Attendees") +
## nodematch("Sex") + nodematch("Race") + nodematch("Cancer")
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -3.63503 0.28860 0 -12.595 < 1e-04 ***
## mutual 1.19859 0.40398 0 2.967 0.00301 **
## nodematch.Workgroup 0.08826 0.23886 0 0.370 0.71175
## nodematch.Attendees 1.50072 0.33594 0 4.467 < 1e-04 ***
## nodematch.Sex -0.29476 0.26539 0 -1.111 0.26671
## nodematch.Race 0.74782 0.25756 0 2.904 0.00369 **
## nodematch.Cancer 0.83494 0.27986 0 2.983 0.00285 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 1048.0 on 756 degrees of freedom
## Residual Deviance: 452.8 on 749 degrees of freedom
##
## AIC: 466.8 BIC: 499.2 (Smaller is better.)
The change in the number of edges, mutial, attendees, race, and cancer are significant predictors of tie formation, as the p-value is far less than the conventional significance .05 cut off. The column for “MCMC%” is null, because this model didn’t require Markov chain Monte Carlo estimation.
The coefficients, -3.64, 1.20, 1.5, 0.75, and 0.83 , are expressed as conditional log-odds, such that:
\[logodds(trust_{2016})=-3.64*edges+1.20*mutual+1.50*attendees+0.75*race+0.83*cancer\]
The log-odds that these heterophilous ties will form in the network equals -3.64, equaling a probability of exp(-3.64)/(1+exp(-3.64)), or 0.025. Likewise, the log-odds of a homophilous relationship forming equals (-3.64+1.20+1.50+0.75+0.83)= 0.64, corresponding to a probability of 0.65, or exp(.64)/(1+exp(.64)).
##
## ==========================
## Summary of model fit
## ==========================
##
## Formula: re15net ~ edges + mutual + nodematch("Workgroup") + nodematch("Attendees") +
## nodematch("Sex") + nodematch("Race") + nodematch("Cancer")
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -6.2324 0.9455 0 -6.592 < 1e-04 ***
## mutual 3.0986 0.5418 0 5.719 < 1e-04 ***
## nodematch.Workgroup 0.3225 0.2542 0 1.269 0.20453
## nodematch.Attendees 2.6414 0.9673 0 2.731 0.00632 **
## nodematch.Sex -0.4799 0.2572 0 -1.865 0.06211 .
## nodematch.Race 0.8015 0.3305 0 2.425 0.01529 *
## nodematch.Cancer 1.4052 0.2936 0 4.786 < 1e-04 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 1048.0 on 756 degrees of freedom
## Residual Deviance: 245.6 on 749 degrees of freedom
##
## AIC: 259.6 BIC: 292 (Smaller is better.)
The change in the number of edges, mutual, attendees, race, and cancer are significant predictors of tie formation, as the p-value is far less than the conventional significance cut off of 0.05. The column for “MCMC%” is null.
The coefficients, -6.23, 3.10, 2.64, 0.80, and 1.41 , are expressed as conditional log-odds, such that:
\[logodds(receive_{2015})=-6.23*edges+3.10*mutual+2.64*attendees+0.80*race+1.41*cancer\] The log-odds that these heterophilous ties will form in the network equals -6.23, equaling a probability of exp(-.6.23)/(1+exp(-6.23)), or 0.002. Likewise, the log-odds of a homophilous relationship forming equals (-6.23+3.10+2.64+0.80+1.41)= 1.72, corresponding to a probability of 0.85 , or exp(1.72)/(1+exp(1.72).
##
## ==========================
## Summary of model fit
## ==========================
##
## Formula: re16net ~ edges + mutual + nodematch("Workgroup") + nodematch("Attendees") +
## nodematch("Sex") + nodematch("Race") + nodematch("Cancer")
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -4.7615 0.4344 0 -10.960 < 1e-04 ***
## mutual 2.8543 0.5952 0 4.795 < 1e-04 ***
## nodematch.Workgroup 0.4490 0.3056 0 1.469 0.141798
## nodematch.Attendees 1.2016 0.4834 0 2.486 0.012920 *
## nodematch.Sex -0.6385 0.3570 0 -1.788 0.073704 .
## nodematch.Race 0.5887 0.3412 0 1.725 0.084448 .
## nodematch.Cancer 1.2547 0.3643 0 3.445 0.000572 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 1048.0 on 756 degrees of freedom
## Residual Deviance: 229.1 on 749 degrees of freedom
##
## AIC: 243.1 BIC: 275.5 (Smaller is better.)
The change in the number of edges, mutual, attendees, and cancer are significant predictors of tie formation, as the p-value is far less than the conventional significance cutoff of 0.05. The column for “MCMC%” is null.
The coefficients, -4.76, 2.85, 1.2, and 1.26, are expressed as conditional log-odds, such that:
\[logodds(receive_{2016})=-4.76*edges+2.85*mutual+1.20*attendees+1.26*cancer\]
The log-odds that these heterophilous ties will form in the network equals -3.64, equaling a probability of exp(-3.64)/(1+exp(-3.64)), or 0.025. Likewise, the log-odds of a homosexual relationship forming equals (-4.76+2.85+1.2+1.26)= .55, corresponding to a probability of 0.63 , or exp(.55)/(1+exp(.55)).
##
## ==============================
## Summary of formation model fit
## ==============================
##
## Formula: ~edges + mutual + transitiveties + cyclicalties + nodematch("Sex") +
## nodematch("Workgroup") + nodematch("Race") + nodematch("Cancer")
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -4.3510 0.3589 0 -12.123 < 1e-04 ***
## mutual 0.7128 0.3731 0 1.911 0.056044 .
## transitiveties 1.1679 0.3414 0 3.421 0.000623 ***
## cyclicalties 0.4374 0.2565 0 1.705 0.088115 .
## nodematch.Sex 0.5192 0.2615 0 1.986 0.047082 *
## nodematch.Workgroup 0.1780 0.2642 0 0.674 0.500586
## nodematch.Race 0.8431 0.3115 0 2.707 0.006789 **
## nodematch.Cancer -0.2837 0.3144 0 -0.902 0.366921
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 934.4 on 674 degrees of freedom
## Residual Deviance: 377.6 on 666 degrees of freedom
##
## AIC: 393.6 BIC: 429.7 (Smaller is better.)
##
## ================================
## Summary of dissolution model fit
## ================================
##
## Formula: ~edges + mutual + transitiveties + cyclicalties
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -2.4944 0.5459 0 -4.570 < 1e-04 ***
## mutual 2.6614 1.2426 0 2.142 0.032208 *
## transitiveties 1.9284 0.5506 0 3.502 0.000462 ***
## cyclicalties -Inf 0.0000 0 -Inf < 1e-04 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 113.7 on 82 degrees of freedom
## Residual Deviance: NaN on 78 degrees of freedom
##
## AIC: NaN BIC: NaN (Smaller is better.)
With all other variables held constant, a relationship is much more likely to form if it will close a mutual pair. The conditional log-odds in this case are increased by 0.7, which translates to an increase in the conditional odds of 2.01. Note that the change statistic can equal only 0 or 1 in this case.
The reciprocity effect on persistence is also positive, but less strong and not statistically signifiant. The point estimate is an increase of 2.7 in the conditional log-odds of persistence, a bit more than double the conditional odds.
Transitive triads are positive and borderline significant. Cyclical triads have a statistically significant negative influence. Overall, the results suggest that hierarchy plays a role in tie formation and persistence in this model, and there is a consistent anti-egalitarianism dynamic.
##
## ==============================
## Summary of formation model fit
## ==============================
##
## Formula: ~edges + mutual + transitiveties + cyclicalties + nodematch("Sex") +
## nodematch("Workgroup") + nodematch("Race") + nodematch("Cancer")
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -5.1140 0.4491 0 -11.387 < 1e-04 ***
## mutual 2.6331 0.7214 0 3.650 0.000262 ***
## transitiveties 1.9412 0.4374 0 4.438 < 1e-04 ***
## cyclicalties -0.4982 0.3056 0 -1.630 0.103045
## nodematch.Sex 0.7273 0.4127 0 1.762 0.078000 .
## nodematch.Workgroup -0.7159 0.4724 0 -1.516 0.129635
## nodematch.Race 0.6707 0.3937 0 1.704 0.088430 .
## nodematch.Cancer -0.4568 0.5253 0 -0.870 0.384488
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 949.6 on 685 degrees of freedom
## Residual Deviance: 173.1 on 677 degrees of freedom
##
## AIC: 189.1 BIC: 225.4 (Smaller is better.)
##
## ================================
## Summary of dissolution model fit
## ================================
##
## Formula: ~edges + mutual + transitiveties + cyclicalties
##
## Iterations: 2 out of 20
##
## Monte Carlo MLE Results:
## Estimate Std. Error MCMC % z value Pr(>|z|)
## edges -3.2251 0.6059 0 -5.323 <1e-04 ***
## mutual 1.6814 1.6493 0 1.019 0.3080
## transitiveties 1.8584 0.7273 0 2.555 0.0106 *
## cyclicalties -0.3321 0.7480 0 -0.444 0.6571
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Null Deviance: 98.43 on 71 degrees of freedom
## Residual Deviance: 43.48 on 67 degrees of freedom
##
## AIC: 51.48 BIC: 60.53 (Smaller is better.)
## [1] 2.6
## [1] 13.46374
## [1] 1.7
With all other variables held constant, a relationship is much more likely to form if it will close a mutual pair. The conditional log-odds are increased by 2.6 which translates to an increase in the conditional odds by 13.46374. Note that the change statistic can equal only 0 or 1 in this case.
The reciprocity effect on persistence is also positive, but less strong and not statistically signifiant. The point estimate is an increase of 1.7 in the conditional log-odds of persistence, a bit more than double the conditional odds.
Transitive triads are positive and significant, but cyclical triads, while negative, are not significant. Overall, the results suggest that hierarchy plays a role in tie formation and persistence in this model, and there is a weak though consistent anti-egalitarianism dynamic.
The following animation simulates how the trust network formed based on our model.
## caching 5 properties for slice 0
## caching 5 properties for slice 1
## caching 5 properties for slice 2
## loading ndtv-d3 animation widget...
The following animation simulates how the receive network formed based on our model.
## caching 5 properties for slice 0
## caching 5 properties for slice 1
## caching 5 properties for slice 2
## loading ndtv-d3 animation widget...