Density test

Density test#

Here, we compare the two unmatched networks by treating each as an Erdos-Renyi network and simply compare their estimated densities.

The Erdos-Renyi (ER) model#

The Erdos-Renyi (ER) model is one of the simplest network models. This model treats the probability of each potential edge in the network occuring to be the same. In other words, all edges between any two nodes are equally likely.

Math

Let \(n\) be the number of nodes. We say that for all \((i, j), i \neq j\), with \(i\) and \(j\) both running from \(1 ... n\), the probability of the edge \((i, j)\) occuring is:

\[ P[A_{ij} = 1] = p_{ij} = p \]

Where \(p\) is the the global connection probability.

Each element of the adjacency matrix \(A\) is then sampled independently according to a Bernoulli distribution:

\[ A_{ij} \sim Bernoulli(p) \]

For a network modeled as described above, we say it is distributed

\[ A \sim ER(n, p) \]

Thus, for this model, the only parameter of interest is the global connection probability, \(p\). This is sometimes also referred to as the network density.

Testing under the ER model#

In order to compare two networks \(A^{(L)}\) and \(A^{(R)}\) under this model, we simply need to compute these network densities (\(p^{(L)}\) and \(p^{(R)}\)), and then run a statistical test to see if these densities are significantly different.

Math

Under this model, the total number of edges \(m\) comes from a \(Binomial(n(n-1), p)\) distribution, where \(n\) is the number of nodes. This is because the number of edges is the sum of independent Bernoulli trials with the same probability. If \(m^{(L)}\) is the number of edges on the left hemisphere, and \(m^{(R)}\) is the number of edges on the right, then we have:

\[m^{(L)} \sim Binomial(n^{(L)}(n^{(L)} - 1), p^{(L)})\]

and independently,

\[m^{(R)} \sim Binomial(n^{(R)}(n^{(R)} - 1), p^{(R)})\]

To compare the two networks, we are just interested in a comparison of \(p^{(L)}\) vs. \(p^{(R)}\). Formally, we are testing:

\[H_0: p^{(L)} = p^{(R)}, \quad H_a: p^{(L)} \neq p^{(R)}\]

Fortunately, the problem of testing for equal proportions is well studied. In our case, we will use Fisher’s Exact test to run this test for the null and alternative hypotheses above.

Environment variables:
   RESAVE_DATA: true
   RERUN_SIMS: true
   DISPLAY_FIGS: False

Diagram of the ER model#

Diagram of the density test#

Reject bilateral symmetry under the ER model#

_images/5f1c462ad2171772eba06b7c6a231918b97a93eaa47efe08a9741476e4ede6f7.png — Fig. 3 Comparison of estimated densities for the left and right hemisphere networks. The estimated density (probability of any edge across the entire network), \(\hat{p}\), for the left hemisphere is ~0.016, while for the right it is ~0.017. Black lines denote % confidence intervals for this estimated parameter \(\hat{p}\). The p-value for testing the null hypothesis that these densities are the same is 4.87e-24 (two sided Fisher’s exact test).#

Figure 3 shows the comparison of the network densities between the left and right hemisphere induced subgraphs. We see that the density on the left is ~0.016, and on the right it is ~0.017. To determine whether this is a difference likely to be observed by chance under the ER model, we ran a two-sided Fisher’s exact test, which tests whether the success probabilities between two independent binomials are significantly different. This test yields a p-value of 4.87e-24, suggesting that we have strong evidence to reject this version of our hypotheis of bilateral symmetry. We note that while the difference between estimated densities is not massive, this low p-value results from the large sample size for this comparison. We note that there are 2,266,530 and 2,266,530 potential edges on the left and right, respectively, making the sample size for this comparison quite large.

To our knowledge, when neuroscientists have considered the question of bilateral symmetry, they have not meant such a simple comparison of proportions. In many ways, the ER model is too simple to be an interesting description of connectome structure. However, we note that even the simplest network model yields a significant difference between brain hemispheres for this organism. It is unclear whether this difference in densities is biological (e.g. a result of slightly differing rates of development for this individual), an artifact of how the data was collected (e.g. technological limitations causing slightly lower reconstruction rates on the left hemisphere), or something else entirely. Still, the ER test results also provide important considerations for other tests. Almost any network statistic (e.g. clustering coefficient, number of triangles, etc), as well as many of the model-based parameters we will consider in this paper, are strongly related to the network density. Thus, if the densities are different, it is likely that tests based on any of these other test statistics will also reject the null hypothesis. Thus, we will need ways of telling whether an observed difference for these other tests could be explained by this difference in density alone.

_images/d0e666309f60d6cdeb983498b2867c82155a803b2033180d76986e06ea99feab.svg

End#

Script took 0:00:05.677516
Completed at 2023-03-10 13:26:05.593312