Causes of Complete Networks
Introduction
As stated in the very beginning of this course, social networks play a crucial role within the social sciences. Social networks are interesting social phenomena to explain (causes of social networks) and social networks play important roles in many so-called social problems (consequences of social networks). If social scientists seek explanations for why we observe specific social networks the explanations generally refer to three aspects, or ‘theoretical dimensions’, of social networks, namely:
- structure (the relations in the network)
- composition (characteristics of the nodes in the network)
- evolution (change in structure and/or composition)
- tie evolution: structure –> structure
- node evolution: node attributes –> node attributes
- influence: structure –> node attributes
- selection: node attributes –> structure
The structure of the network refers to the relations we observe between persons. For example, we may seek explanations for why networks are more dense than others, why people tend to reciprocate positive relations and why we observe clusters in the network. There are many different types of structures (involving one, two, three or more nodes) and some of them will be discussed below. The composition of the network refers to characteristics of the persons in the network. We may seek explanations for who joins the Twitter network and when, or why we observe ethnic segregation between soccer teams even if they are part of the same club. These questions generally refer to selective selection into or out of networks, about joiners and leavers. Explanations for network size may also be categorized under the label of composition-explanations. Evolution process refers to change of social networks. Let us first consider tie evolution. An important realization is that the ties we currently observe in the network are in part the consequence of the ties that were present at a previous time point. Consider the simple example of reciprocity. That we observe many reciprocated relations in a network is not only because people tend to reciprocate relations but also because people tend to make relations in the first place (we can only reciprocate a relation when there is a relation). With node evolution I mean that node attributes may change over time as a result of other node attributes. This sounds a little complicated but it simple means that persons develop and they may do say at different rates. For example, people who start to smoke will generally smoke more and more. Or, grades of girls in elementary school improve at a faster rate than the grades of boys. In these questions a social network perspective is not necessary. Influence processes are by now familiar to us. Node attributes may change because of the ties this node has to specific other nodes. Selection refers to the opposite process; the ties I have to other nodes may change as a result of the node attributes. Both influence and selection processes explain why specific structures (e.g. reciprocated dyads) more often have a specific composition (e.g. ego-alter similarity) than other structures (e.g. asymmetric dyad).
Network structures
Node-level
Although one node is not a network (the smallest network is a dyad), I would like to start with the node-level.
degree
Some nodes, or people in the network will have a higher outdegree than others. See Figure 1 below.
Figure 1. Outdegree
We could use the function degree
in igraph
to find the outdegree of each node. See below for an example.
hist(igraph::degree(G1, mode="out", loops=FALSE), xlab="outdegree", main="histogram of outdegree atmention network Dutch MPs")
We observe that most MPs atmention a little and some atmention a lot. An explanation could be that some MPs have the ‘task’ (e.g. communication officer) to tweet a lot.
Similarly as outdegree, some nodes, or people, in the network will have a higher indegree than others. See Figure 2 below.
Figure 2. Indegree
We could use the function degree
in igraph
to find the indegree of each node. See below for an example.
hist(igraph::degree(G1, mode="in", loops=FALSE), xlab="indegree", main="histogram of indegree atmention network Dutch MPs")
We observe that most MPs are atmentioned seldom and some are atmentioned often. An explanation could be that some MPs are more important and it may help your own career to atmention these MPs.
In many social networks both the out- and indegree distribution are skewed just as in our Twitter network among MPs.
The mean outdegree is the same as the density in the network. In RSiena, if we want to predict the network structure we start by including the out-degree effect that takes the average outdegree into account. The average outdegree is the same as the average in-degree, so there is no separate effect or that. To take into account that some nodes have a higher out-degree than others, we could include the out-degree activity effect. This states that if a node has a high out-degree at time t1 it is likely that this node has a high out-degree at t2. The in-degree popularity effect does the same for in-degree. This effect tests whether nodes who have relatively many indegrees at t1 also have many indegrees at t2. We could also imagine that nodes who have relatively many outdegrees will also have relatively many indegrees and there are effects for that to in RSiena (indegree-activity and outdegree-popularity).
Dyad-level
Undirected Dyads
Dyads are the smallest possible networks. For undirected relations there are two possible dyad configurations. See Figure 3.
Figure 3. Undirected dyads configurations
Directed Dyads
And for directed dyads. There are four options. See Figure 4. However, two configurations (the two in the middle row) are isomorphic. We cannot distinguish them in the network. From now on, I won’t draw all isomorphs.
Figure 4. Directed dyads configurations
When there is a tie from i to j and vice versa, we call this a reciprocated dyad. I hope you see that the reasons for me to start or break a relation with you if you don’t have relation with me, may be different from the reasons for me to start or break a relation with you if you have relation with me. Consider the example of atmentioning among MPs. If MP i atmentions MP j this may increase the visibility of MP j on Twitter and this is something MP j probably evaluates positively. In return of this favor, MP j may atmention MP j another time. If I scratch your back, you scratch mine. We could use igraph to make a dyad count, which is called a dyad census. See below:
dyadcount <- dyad.census(G1)
dyadcount$total <- (vcount(G1)*(vcount(G1)-1))/2 # to add the number of dyads in the graph
dyadcount
## $mut
## [1] 203
##
## $asym
## [1] 634
##
## $null
## [1] 9894
##
## $total
## [1] 10731
We could compare these values with a random graph of the same size with the same density.
dens <- graph.density(G1)
size <- vcount(G1)
trial <- 1000
recip <- rep(NA, trial)
for ( i in 1:trial ){
random_graph <- erdos.renyi.game(n = size, p.or.m = dens, directed = TRUE)
recip[i] <- dyad.census(random_graph)$mut
}
{hist(recip, main="number of mutual dyads in random graph", xlab="", )
abline(v=dyadcount$mut, col="red", lwd=3)}
We thus observe way more (or significantly more) mutual dyads (203) as could be expected based solely on chance. Conclusion: MPs scratch each others back. In social networks of positive relations, reciprocity effects are very common
Undirected Dyads - multiplexity
If we consider two types of undirected relations simultaneously, we have three distinct configurations. See Figure 5.
Figure 5. Multiplex, undirected dyads configurations
Explanations for multiplex configuration are multicomplex. A first step to consider is whether there is a necessary, or likely, order in the relations. To illustrate. I first became friends with my wife and afterwards she became my wife. Although this situations hopefully holds true for most married couples, it is not logically necessary of course. A second step is to consider the valence of the relations. Are they positive or negative. It is not likely that I would name my friend a foe and vice versa but there is a thin line between love and hate. Feelings of hate and love for the same person may even coexist. Ever heard someone saying: “I can’t live with him/her but I can’t live without him/her either”?.
Triad-level
Undirected Triads
Figure 7. Undirected triad configurations
We observe an unconnected triad, a triad with a connected pair, an open triad and a closed triad. The open triad is also called a ‘forbidden triad’ and actor i in such a triad is said to hold a ‘brokerage position’.
Undirected Triads - Multiplexity
Figure 8. Multiplex, undirected triad configurations
Directed Triads
By now, I am a bit tired of drawing all these nodes and relations. Luckily, the net is full of pictures of the possible directed triad configurations. I stole this one from an online workshop on Social Network Analysis for Anthropologists here.
Figure 9. Directed triad configurations
These triads all have unique names. Depending on the number dyads without ties (last digit), the number of dyads with one tie (asymmetric dyads; second digit) and the number of dyads with two ties (mutual dyads; first digit). The subtypes are distinguished by the letters C, D, U, T.
This triad census has been developed by Davis and Leinhardt in 19721 And their original picture in the paper is too cool not to show here:
Figure 9. Original Triad census by Davis and Leinhard (1972)
Directed Triads - Multiplexity
I could not find a nice picture of all possible directed triad configurations for two relations simultaneously. Naturally, there are also specific configurations in which 4 nodes or even more nodes are involved.
Tie evolution
Suppose we would like to explain why there are so many transative triads in our network. We must realize that a transitive dyad may be the outcome of different evolution processes. See below:
Figure 10. Different pathways to a transitive triad.
That is, if we assume that each tie is made subsequently, thus not two tie are created at the same time. The reason why i closes the triad, may be very different from the reason why k closes the triad. It all depends on the social relation under consideration. Do you think both pathways are just as likely for (a) ‘friendship relations’, (b) ‘who kicks whom relations’ and (c) ‘who explains social network analysis to whom? relations’ Do you also observe that a combination of an in-degree activity effect and an in-degree popularity effect would make the top pathway more likely and that a combination of an out-degree activity effect and an indegree activity or indegree popularity effect would make the bottom pathway more likely? The take home message is that we need longitudinal data if we would like to disentangle specific explanations and that we generally need to take into account ‘lower order’ explanations first before we start to scrutinize more complext ‘higher order’ explanations.
Selection and influence
When I started to learn SNA I followed a RSiena workshop by Christian Steglich. And one of his slides made a big impression on me. See below:
Figure 10. An inspirational slide of Christian Steglich.
When scientists investigate social networks they generally do not have information on all the tie changes that take place. More commonly, the information they have is based on one or more snapshots of the network. Suppose we observe at T1 a tie between i who is white and j who is black and that at T2 we observer that there still is a tie between i and j but that i became black. Intuitively, we would interpret that some kind of influence (or rather assimilation) took place. Unfortunately, even though we have longitudinal data (two snapshots), we cannot rule out that actually a combination of selection and node evolution explain our observations; i breaks a tie with j who is dissimilar, i’s attribute evolves (for whatever reason) from white to black, i selects j who is similar. Christian Steglich’s take home message was that we need a statistical method (RSiena) that can test which pathway is more likely. My take home message is that you have to be aware of overdetermination, that there are possibly multiple causes leading to the same outcome.